Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3442 |
Symbol | |
ID | 6145975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3520156 |
End bp | 3522192 |
Gene Length | 2037 bp |
Protein Length | 678 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618271 |
Product | putative lipoprotein |
Protein accession | YP_001745420 |
Protein GI | 170679956 |
COG category | [R] General function prediction only |
COG ID | [COG3107] Putative lipoprotein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTACCCT CAACATTTTC TCGTTTGAAA GCCGCGCGTT GTCTGCCTGT TGTTCTGGCA GCCCTGATTT TCGCCGGTTG TGGCACCCAT ACTCCCGATC AGTCCACTGC TTATATGCAG GGCACGGCGC AGGCTGATTC TGCCTTTTAC CTGCAACAGA TGCAGCAAAG TTCTGATGAT ACCAGGATCA ACTGGCAATT ACTCGCCATT CATGCACTGG TGAAAGAAGG TAAAACCGGA CAGGCGGTGG AGTTGTTTAA CCAACTACCG CAAGAACTGA ACGATTCTCA GCGTCGCGAG AAAACACTGC TGGCGGCAGA GATTAAACTG GCGCAGAAAG ATTTTGCTGG CGCGCAAAAC TTGCTGGCGA AAATCACACC TGCCGATTTA GAACAAAACC AGCAAGCGCG TTACTGGCAG GCAAAAATCG ATGCCAGCCA GGGGCGTCCT TCCATTGATT TACTGCGCGC GTTAATTGCT CAGGAACCGC TGCTCGGCGC GAAAGAAAAG CAGCAGAATA TTGACGCCAC CTGGCAGGCG CTCTCCTCCA TGACTCAGGA ACAGGCGAAT ACGCTCGTGA TCAATGCCGA CGAAAATATT CTGCAAGGCT GGTTGGATCT GCAGCGCATC TGGTTTGATA ACCGTAACGA TCCCGACATG ATGAAAGCCG GGATCGCCGA CTGGCAGAAA CGTTATCCGA ACAACCCGGG CGCGAAAATG CTGCCAACGC AGTTGGTTAA CGTAAAAGCG TTTAAACCCG CATCGACCAA CAAAATCGCC CTGCTGTTGC CACTGAATGG CCAGGCAGCG GTATTTGGTC GCACTATTCA GCAAGGCTTT GAAGCGGCGA AAAATATCGG CACTCAGCCA GTGGCGGCCC AGGTAGCTGC CGCACCTGCC GCAGACGTAG CAGAACAACC TCAGCCGCAA ACCGTGGATG GCGTTGCCAG CCCGGCACAA GCCTCGGTTA GCGATCTGAC CGGTGAGCAG CCTGCGGCCC AGCCGGTGCC TGTAAGCGCC CCGGCGACAA GCACCGCAGC GGTAAGCGCA CCCGCAAATC CATCCGCAGA GCTAAAAATC TACGATACCT CATCACAACC ACTTAGCCAG ATCTTAAGCC AGGTTCAGCA GGATGGTGCG AGTATTGTGG TCGGTCCGTT GCTGAAAAAT AACGTTGAAG AGTTGCTGAA GAGCAACACC CCGCTGAACG TGCTGGCACT GAACCAGCCG GAGAATATCG AAAACCGCGT CAATATTTGT TACTTCGCGC TTTCACCGGA AGACGAAGCG CGCGATGCAG CGCGTCATAT TCGTGACCAG GGTAAACAAG CGCCGCTGGT GCTGATCCCA CGCAGTTCAT TGGGCGATCG CGTAGCCAAT GCGTTTGCGC AAGAGTGGCA GAAACTGGGT GGCGGCACCG TTCTGCAACA AAAATTTGGT TCCACCAGCG AATTACGCGC GGGTGTTAAC GGCGGTTCTG GTATTGCTTT AACGGGTAGC CCGATTACTC CCAGAGAAAC AACCGACTCC GGCATGACGA CCAACAATCC AACGCTGCAA ACCACGCCAA CCGATGACCA GTTCACCAAT AATGGCGGTC GTGTCGATGC GGTGTACATT GTGGCAACGC CGGGTGAAAT CGCTTTTATC AAACCAATGA TCGCCATGCG TAACGGTAGC CAGAGCGGTG CAACGCTGTA CGCCAGCTCC CGCAGTGCGC AAGGGACCGC TGGCCCGGAT TTCCGCCTGG AGATGGAAGG CTTGCAGTAC AGCGAAATCC CGATGCTGGC AGGCGGTAAT CTGCCGTTAA TGCAGCAGGC GCTCAGCGCG GTGAACAACG ATTATTCGCT GGCTCGCATG TATGCGATGG GCGTCGATGC CTGGTCGTTG GCAAATCATT TCTCACAAAT GCGCCAGGTT CAGGGTTTTG AAATCAACGG TAATACCGGA AGTCTGACGG CTAACCCGGA TTGCGTGATT AACAGGAAGT TATCATGGCT ACAGTACCAA CAAGGTCAGG TAGTCCCCGC CAGTTAA
|
Protein sequence | MVPSTFSRLK AARCLPVVLA ALIFAGCGTH TPDQSTAYMQ GTAQADSAFY LQQMQQSSDD TRINWQLLAI HALVKEGKTG QAVELFNQLP QELNDSQRRE KTLLAAEIKL AQKDFAGAQN LLAKITPADL EQNQQARYWQ AKIDASQGRP SIDLLRALIA QEPLLGAKEK QQNIDATWQA LSSMTQEQAN TLVINADENI LQGWLDLQRI WFDNRNDPDM MKAGIADWQK RYPNNPGAKM LPTQLVNVKA FKPASTNKIA LLLPLNGQAA VFGRTIQQGF EAAKNIGTQP VAAQVAAAPA ADVAEQPQPQ TVDGVASPAQ ASVSDLTGEQ PAAQPVPVSA PATSTAAVSA PANPSAELKI YDTSSQPLSQ ILSQVQQDGA SIVVGPLLKN NVEELLKSNT PLNVLALNQP ENIENRVNIC YFALSPEDEA RDAARHIRDQ GKQAPLVLIP RSSLGDRVAN AFAQEWQKLG GGTVLQQKFG STSELRAGVN GGSGIALTGS PITPRETTDS GMTTNNPTLQ TTPTDDQFTN NGGRVDAVYI VATPGEIAFI KPMIAMRNGS QSGATLYASS RSAQGTAGPD FRLEMEGLQY SEIPMLAGGN LPLMQQALSA VNNDYSLARM YAMGVDAWSL ANHFSQMRQV QGFEINGNTG SLTANPDCVI NRKLSWLQYQ QGQVVPAS
|
| |