Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4466 |
Symbol | |
ID | 6969007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4137665 |
End bp | 4139701 |
Gene Length | 2037 bp |
Protein Length | 678 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643388181 |
Product | putative lipoprotein |
Protein accession | YP_002272618 |
Protein GI | 209397195 |
COG category | [R] General function prediction only |
COG ID | [COG3107] Putative lipoprotein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTACCCT CAACATTTTC TCGTTTGAAA GCCGCGCGTT GTCTGCCTGT TGTTCTGGCA GCCCTGATTT TCGTCGGTTG TGGCACCCAT ACTCCCGATC AGTCCACTGC TTATATGCAG GGCACGGCGC AGGCTGATTC TGCCTTTTAT CTTCAGCAGA TGCAGCAAAG CTCTGATGAT ACCAGGATCA ACTGGCAATT ACTCGCCATT CGTGCACTGG TGAAAGAAGG TAAAACCGGG CAGGCGGTTG AGTTGTTTAA CCAACTACCG CAAGAACTGA ACGATTCTCA GCGTCGCGAG AAAACACTGC TGGCGGTAGA GATTAAACTG GCGCAGAAAG ATTTTGCTGG CGCGCAAAAC TTGCTGGCGA AAATTACACC CGCCGATTTA GAACAAAACC AGCAAGCCCG TTACTGGCAG GCAAAAATCG ATGCCAGCCA GGGGCGTCCT TCCATTGATT TACTGCGCGC GTTAATTGCT CAGGAACCGC TGCTTGGCGC GAAAGAAAAA CAGCAGAATA TTGATGCCAC CTGGCAGGCG CTCTCCTCCA TGACTCAGGA ACAGGCGAAT ACGCTGGTGA TCAACGCCGA CGAAAATATT CTGCAAGGCT GGCTGGATCT GCAGCGCGTC TGGTTTGATA ACCGTAACGA TCCCGACATG ATGAAAGCCG GGATCGCCGA CTGGCAGAAA CGTTATCCGA ACAATCCGGG CGCGAAAATG CTGCCAACGC AGTTGGTTAA CGTAAAAGCG TTTAAACCAG CCTCGACCAA CAAAATCGCC CTGCTGTTGC CACTGAATGG CCAGGCAGCG GTATTTGGTC GCACTATTCA GCAAGGCTTT GAAGCGGCGA AAAATATCGG CACTCAGCCA GTGGCGGCTC AGGTAGCTGC CGCACCTGCC GCAGACGTAG CAGAACAACC TCAGCCGCAA ACTGCGGATG GCGTTGCCAG CCCGGCACAA GCCTCGGTTA GCGATCTGAC CGGTGATCAG CCTGCGGCCC AGCCGGTGCC TGTAAGCGCC CCGGCGACAA GCACCGCAGC GGTAAGCGCA CCCGCAAATC CATCCGCAGA GCTGAAAATC TACGATACCT CATCACAACC ACTTAGCCAG ATCTTAAGCC AGGTTCAGCA GGATGGCGCG AGTATTGTGG TTGGTCCGTT GCTGAAAAAT AACGTTGAAG AGTTGCTGAA GAGCAACACT CCGCTGAACG TACTAGCACT GAACCAGCCG GAGAATATCG AAAATCGCGT CAATATTTGT TACTTCGCGC TTTCACCGGA AGACGAAGCG CGCGATGCAG CGCGTCATAT TCGTGACCAG GGTAAACAAG CGCCGCTGGT GCTGATCCCA CGCAGTTCAT TGGGCGATCG CGTAGCCAAT GCGTTTGCGC AAGAGTGGCA GAAATTGGGC GGCGGCACCG TTCTGCAACA AAAATTTGGT TCCACCAGCG AATTACGCGC GGGTGTTAAC GGCGGTTCTG GTATTGCTTT AACGGGTACC CCGATTACTC CCAGAGCGAC AACCGACTCC GGCATGACGA CCAACAATCC AACGCTGCAA ACCACGCCAA CCGATGACCA GTTCACCAAT AATGGCGGTC GTGTCGATGC GGTGTACATT GTGGCAACGC CGGGTGAAAT CGCTTTTATT AAACCGATGA TCGCCATGCG TAACGGTAGC CAGAGCGGTG CAACGCTGTA CGCCAGCTCC CGCAGTGCGC AAGGGACCGC TGGCCCGGAT TTCCGTCTGG AGATGGAAGG CTTGCAGTAC AGCGAAATCC CGATGCTGGC GGGCGGTAAT CTGCCGTTAA TGCAGCAGGC ACTCAGCGCG GTGAATAACG ATTATTCACT GGCTCGCATG TATGCGATGG GCGTCGATGC CTGGTCGCTG GCAAATCATT TCTCACAAAT GCGCCAGGTT CAGGGTTTTG AAATCAACGG TAATACCGGA AGCCTGACGG CTAACCCGGA TTGCGTGATT AACAGGAAGT TATCATGGCT ACAGTACCAA CAAGGTCAGG TAGTCCCCGC CAGTTAA
|
Protein sequence | MVPSTFSRLK AARCLPVVLA ALIFVGCGTH TPDQSTAYMQ GTAQADSAFY LQQMQQSSDD TRINWQLLAI RALVKEGKTG QAVELFNQLP QELNDSQRRE KTLLAVEIKL AQKDFAGAQN LLAKITPADL EQNQQARYWQ AKIDASQGRP SIDLLRALIA QEPLLGAKEK QQNIDATWQA LSSMTQEQAN TLVINADENI LQGWLDLQRV WFDNRNDPDM MKAGIADWQK RYPNNPGAKM LPTQLVNVKA FKPASTNKIA LLLPLNGQAA VFGRTIQQGF EAAKNIGTQP VAAQVAAAPA ADVAEQPQPQ TADGVASPAQ ASVSDLTGDQ PAAQPVPVSA PATSTAAVSA PANPSAELKI YDTSSQPLSQ ILSQVQQDGA SIVVGPLLKN NVEELLKSNT PLNVLALNQP ENIENRVNIC YFALSPEDEA RDAARHIRDQ GKQAPLVLIP RSSLGDRVAN AFAQEWQKLG GGTVLQQKFG STSELRAGVN GGSGIALTGT PITPRATTDS GMTTNNPTLQ TTPTDDQFTN NGGRVDAVYI VATPGEIAFI KPMIAMRNGS QSGATLYASS RSAQGTAGPD FRLEMEGLQY SEIPMLAGGN LPLMQQALSA VNNDYSLARM YAMGVDAWSL ANHFSQMRQV QGFEINGNTG SLTANPDCVI NRKLSWLQYQ QGQVVPAS
|
| |