Roman`s notes

🧠 Where and How AI Self-Consciousness Could Emerge

20 November 2025

artificial intelligence,AI Agents,AGI,consciousness,llm,Cognitive Science,machine learning,Philosophy of AI,Tech Innovation,AI Architecture

🧠 Where and How AI Self-Consciousness Could Emerge

The AI boom is surging, fueling discussions about future super-intelligence, job displacement, and even existential risks. Central to these debates is the question: Can an AI become self-conscious? And more specifically, is this possible within the current paradigm of architectures like Large Language Models (LLMs)?

LLMs have become the focus of this discussion due to their accelerating sophistication. Let's address the core question immediately: Can LLMs be self-conscious? A quick answer, grounded in the general principles of transformers, is No. An LLM is a static, statistical model—a vast set of numbers. It remains unchanged during inference and possesses no internal, dynamic state that would constitute self-awareness.

A spark of self-awareness will not arise within the weights of a large language model.

However, modern AI agents are far more than just the LLM. While the LLM forms the impressive core, it is the other components and the overall system architecture that hold the key to the emergence of self-consciousness.

🔒 Securing Your Remote MCP Server with an SSL Layer

20 November 2025

FastMCP,MCP Server,SSL,Nginx,Reverse Proxy,HTTPS,Persistent Connection,Production Security,SSL Termination,Uvicorn,Docker,API Security

🔒 Securing Your Remote MCP Server with an SSL Layer

The Production Security Gap

When deploying a Remote Multi-Channel Protocol (MCP) server—especially those built on frameworks like FastMCP (inspired by FastAPI)—most tutorials focus solely on functionality: "Use the MCP SDK, set the transport to HTTP, and access your server via http://yourserver:port."

While this is fine for local development or internal testing, it leaves a significant, critical gap for production environments: security.

Exposing an HTTP port directly to the internet is a major security risk. Without a Secure Sockets Layer (SSL/TLS), all data—including potentially sensitive authentication tokens, session data, and application payloads—is transmitted in plain text. For any public-facing or authenticated service, this is completely unacceptable.

The question then becomes: How do we easily secure a remote FastMCP server with SSL?

The simplest MCP server looks like this:

from fastmcp import FastMCP, Context

Custom Model Context Protocol (MCP) Server for Every Project as a Replacement for AI Agent Instructions

17 October 2025

AI Agents,MCP Server,Model Context Protocol,LLM Context,Dynamic Instructions,Software Development,AI Workflow,Codebase Management,Developer Productivity,Project-Specific Tools

Custom Model Context Protocol (MCP) Server for Every Project as a Replacement for AI Agent Instructions

AI agents' integration with Integrated Development Environments (IDEs) is already part of the usual workflow. Most software engineers use AI auto-completion tools like GitHub Copilot, and many use "vibe coding."

AI agent instructions are commonly seen in many projects. We often find files like AGENT.md, AGENTS.md, CLAUDE.md, and others. These files usually contain instructions for AI agents on how to interact with the project, what tools to use, and any specific guidelines. For example, they might define how to use the codebase, which libraries are utilized, what the project goals are, and so on.

Ideal instructions should allow an AI agent to complete complex tasks autonomously, such as: "I need a new feature that lets users add commands to text pages on the website." Ideally, the AI agent should have all the necessary information, such as "how to add new API endpoints," "how to add new database models," "how to support a database migration," and "how to link the UI to the backend."

However, I haven't seen truly effective working instructions. Usually, all this information is too vast for a single instruction file. If we describe every detail, this information will overload the AI agent's context.

We must remember that an AI agent's instructions are sent to the Large Language Model (LLM) along with every user prompt. Instructions that are too long distract the model's attention because they always contain everything, not just what's relevant at the moment.

Solution: A Custom MCP Server for Every Code Project

And here's the idea: why not create a custom Model Context Protocol (MCP) server for every project? This can replace AI agent instruction files and offer much greater flexibility.

Before going further, if you're not familiar with the MCP protocol, I recommend reading the official documentation: https://modelcontextprotocol.io/docs/getting-started/intro.

There are several advantages to using an MCP server for every project:

Dynamic Instructions: An MCP server can provide dynamic instructions upon request from the LLM. It could support a tree of knowledge and provide only the relevant information for the current task. The LLM will decide which "article" to read, and the MCP server will provide only that specific content.
Access to Framework Tools: Many software frameworks have their own tools for working with the codebase. For example, Django has management commands to create models, run migrations, start a development server, and so on. The MCP server can expose these tools to the LLM, allowing it to use them directly.
Project-Specific Logic: The MCP server can implement project-specific logic, such as understanding the architecture, coding conventions, and best practices. This helps the LLM generate consistent code. For instance, the MCP server can have an add_new_feature tool. If we know that a new feature in our project means adding an "API router with a default endpoint," a "database model," "unit tests," and so forth, the MCP server can implement all this logic, so the LLM simply calls this single tool.
Additional Tools: Many projects have their own tools, such as linters, formatters, test runners, package builders, and "GUI builders." The MCP server can expose these tools to the LLM, allowing it to use them directly. We can prompt the LLM with commands like "run tests," "build package," or "format code," and the MCP will execute the project's specific tools.
Integration with Services: Many projects are integrated with external services, like CI/CD pipelines, cloud providers, monitoring tools, and project management tools. The MCP server can implement tools to interact with these services, enabling the LLM to manage deployments, monitor performance, and handle incidents directly from the codebase. Note: often these services have their own MCP servers, so having a custom tool in a project's MCP might seem redundant. However, I see benefits in having a unified interface for all project-related tools and services, which I'll explain below.
Connection to Cooperative Repositories: Many projects have related repositories, such as documentation sites, design systems, shared libraries, and microservices. The MCP server can implement tools to interact with these repositories, allowing the LLM to access and update related resources seamlessly.

Implementing Authentication in a Remote MCP Server with Python and FastMCP

22 September 2025

mcp,python,FastMCP,authentication,Middleware,sse,AI Agents,security,SDK,Token-based Auth

Implementing Authentication in a Remote MCP Server with Python and FastMCP

A couple of months ago, I published the blog post Implementing Authentication in a Remote MCP Server with SSE Transport. That article demonstrated how to add authentication for remote MCP servers written in Go.

At the time, I also wanted to include Python examples. Unfortunately, things weren’t straightforward. The official Python MCP SDK didn’t provide a clean way to implement what I needed. There were some workarounds using Starlette middleware, but in my experience, those solutions were brittle and ultimately unsuccessful.

Later, I managed to create a working Python MCP server supporting SSE (or streaming HTTP) transport. But my solution relied on thread-level hacks to make the data thread-safe. It worked, but it felt like a fragile and inelegant design—something I wasn’t comfortable recommending or maintaining long-term.

Now, after revisiting the problem, I’ve found a much cleaner solution in Python. This time it’s not with the official Python MCP SDK, but with an alternative implementation called FastMCP. FastMCP is written in the spirit of the official SDK, offering a very similar syntax, but with additional features, clearer abstractions, and—importantly—excellent documentation.

Building MCP Servers with Configurable Descriptions for Tools

17 September 2025

I want to share my findings on how to make MCP (Model Context Protocol) Server tools more responsive and adaptive to user needs by allowing configurable tool descriptions. This can significantly enhance the user experience by providing more relevant and context-aware descriptions for specific projects or tasks.

If you are not yet familiar with MCP, I recommend checking out the official documentation. In short, MCP is a protocol that allows different AI models and tools to communicate and work together seamlessly. In practice, MCP servers are small plugins for your AI agent or chat tool (for example, Claude Desktop) that provide specific functionalities, such as web browsing, code execution, or data retrieval.

MCP Tool Descriptions

Usually, an MCP server tool definition looks like this:

{
  "name": "get-webpage",
  "description": "A tool for retrieving the content of a webpage. It accepts a URL as input and returns the HTML content of the page.",
  "parameters": {
    "type": "object",
    "properties": {
      "url": {
        "type": "string",
        "description": "The URL of the webpage to visit."
      }
    },
    "required": ["url"]
  }
}

AI Agent’s Common Memory

15 July 2025

CleverChatty,AI Agents,Common Memory,Intelligent Assistants,Memory Systems

In this post, I want to explore an idea I’ve been experimenting with: common memory for AI agents. I’ll explain what I mean by this term, how such memory can be implemented, and why I believe it's worth exploring.

What Is Agent's “Common” Memory?

I’m not sure whether “common memory” is already a widely accepted term in the AI space, or even the most accurate label for the concept I have in mind — but I’ll use it for now until a better one emerges (or someone suggests one).

By common memory, I mean:

A shared repository of memories formed by a single AI agent from interactions with multiple other agents — including both humans and other AI agents. For eample, AI Chat can retain information learned from conversations with different users, and selectively reference that information in future interactions.

This is distinct from related terms:

Shared memory usually refers to memory shared across different AI systems or agents — not across users of the same assistant.
Collaborative memory comes closer, but often implies more structured cooperation and might be too narrow for what I’m describing.

So for now, I’ll stick with common memory to describe a memory system that allows an AI assistant to retain and selectively reference information learned across interactions with multiple users.

AI Chats do not use Common Memory

When we interact with AI chats like ChatGPT, they typically do not retain information across different users. Each conversation is isolated, and the AI does not remember past interactions with other users. This means that if you ask the AI about something you discussed with another user, it won’t have any context or memory of that conversation. Only the current user's context and history are considered.