Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1849 |
Symbol | |
ID | 6143885 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1871236 |
End bp | 1872405 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616725 |
Product | tetratricopeptide repeat protein |
Protein accession | YP_001743903 |
Protein GI | 170683676 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000245858 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 5.61509e-21 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGCTGGAGT TGTTGTTTCT GCTTTTGCCT GTAGCCGCTG CCTATGGCTG GTATATGGGC CGCAGAAGTG CGCAACAAAA CAAGCAAGAT GAAGCCAACC GCTTGTCGCG TGATTACGTA GCGGGGGTTA ACTTCCTGCT TAGTAATCAA CAGGATAAAG CGGTAGATCT GTTTCTCGAT ATGCTTAAAG AGGATACAGG TACCGTTGAA GCCCACCTTA CGCTCGGAAA CCTGTTCCGT TCGCGTGGCG AAGTTGATCG CGCTATTCGC ATCCATCAGA CCCTAATGGA AAGCGCCTCG CTGACCTATG AACAGCGGCT TTTGGCGATT CAACAACTGG GGCGTGATTA CATGGCTGCC GGGTTATACG ACCGCGCGGA AGACATGTTC AATCAGCTGA CCGATGAAAC TGACTTCCGC ATTGGCGCGC TGCAACAGTT GCTACAAATC TACCAGGCTA CCAGCGAGTG GCAGAAAGCA ATTGATGTTG CCGAACGCCT GGTGAAGCTG GGTAAAGATA AACAGCGCGT CGAAATTGCC CATTTCTACT GTGAGTTAGC TCTGCAGCAT ATGGCCAGCG ACGATCTCGA TCGTGCCATG ACTTTGCTAA AAAAAGGTGC GGCGGCAGAT AAAAACAGCG CCCGCGTATC CATCATGATG GGACGCGTGT TTATGGCGAA AGGAGAATAC GCCAAAGCCG TCGAAAGTCT GCAACGTGTC ATATCCCAGG ACAGAGAACT GGTCAGCGAA ACGCTGGAAA TGCTGCAAAC CTGCTACCAG CAGTTGGGTA AAACTGCCGA ATGGGCAGAG TTCCTGCAGC GCGCGGTGGA AGAGAACACC GGTGCCGATG CTGAACTGAT GCTTGCGGAC ATCATCGAAG CGCGCGACGG TAGTGAGGCC GCACAGGTCT ATATTACGCG TCAGCTTCAG CGTCATCCGA CCATGCGTGT GTTCCATAAG CTAATGGATT ACCACTTAAA TGAAGCGGAA GAAGGGCGTG CCAAAGAGAG TCTGATGGTG CTGCGTGACA TGGTTGGCGA GAAGGTGCGG AGTAAGCCTC GTTATCGCTG CCAGAAATGT GGGTTTACCG CATACACTCT CTACTGGCAT TGTCCGTCTT GTCGGGCCTG GTCAACCATT AAACCGATTC GCGGTCTTGA TGGCCTGTAA
|
Protein sequence | MLELLFLLLP VAAAYGWYMG RRSAQQNKQD EANRLSRDYV AGVNFLLSNQ QDKAVDLFLD MLKEDTGTVE AHLTLGNLFR SRGEVDRAIR IHQTLMESAS LTYEQRLLAI QQLGRDYMAA GLYDRAEDMF NQLTDETDFR IGALQQLLQI YQATSEWQKA IDVAERLVKL GKDKQRVEIA HFYCELALQH MASDDLDRAM TLLKKGAAAD KNSARVSIMM GRVFMAKGEY AKAVESLQRV ISQDRELVSE TLEMLQTCYQ QLGKTAEWAE FLQRAVEENT GADAELMLAD IIEARDGSEA AQVYITRQLQ RHPTMRVFHK LMDYHLNEAE EGRAKESLMV LRDMVGEKVR SKPRYRCQKC GFTAYTLYWH CPSCRAWSTI KPIRGLDGL
|
| |