Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2057 |
Symbol | |
ID | 6969337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1951317 |
End bp | 1953419 |
Gene Length | 2103 bp |
Protein Length | 700 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643385969 |
Product | TonB-dependent receptor |
Protein accession | YP_002270458 |
Protein GI | 209398339 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01783] TonB-dependent siderophore receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.980298 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.0003174 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGATTG TTTTCGTTCG ACAGACCGTT TTGCCCGCCC TGCTTGTCCT TTCCCCCGTT GTTTTTGCCG CTGATGAACA GACCATGATT GTCAGTGCCG CGCCGCAGGT GGTTTCAGAA CTGGATACCC CTGCAGCAGT AAGCGTGGTG GATGGCGAGG AGATGCGCCT GGCAACACCG CGCATTAACT TGTCCGAATC ACTGACCGGC GTGCCTGGTT TGCAGGTACA AAACCGGCAG AACTATGCGC AAGATTTACA GCTGTCGATT CGTGGATTTG GCTCCCGCTC CACTTACGGA ATTCGCGGTA TTCGCCTGTA TGTGGACGGT ATTCCCGCTA CCATGCCCGA CGGGCAAGGG CAAACATCCA ACATCGATTT AAGCAGTGTG CAAAATGTGG AAGTGCTGCG TGGCCCCTTC TCTGCCCTGT ATGGCAACGC GTCTGGCGGT GTAATGAATG TCACCACCCA GACCGGGCAA CAGCCACCAA CCATTGAAGC CAGTAGTTAC TACGGCAGTT TTGGCAGCTG GCGCTATGGG CTGAAAGCAA CGGGCGCAAC GGGAGACGGC ACACAGCCTG GCGATGTTGA TTACACCGTC TCAACCACGC GTTTTACGAC CCACGGCTAT CGTGACCATA GTGGCGCACA GAAAAATTTA GCTAATGCCA AACTGGGCGT ACGCATTGAT GACGCCAGCA AATTAAGCCT GATTTTCAAT AGCGTGGATA TCAAAGCAGA TGACCCAGGT GGGCTAACCA AAGCAGAATG GAAAGCGAAT CCGCAACAAG CGCCTCGTGC AGAACAGTAC GACACGCGAA AAACCATCAA GCAAACTCAG GCTGGGTTGC GCTATGAACG TAGCCTGAGT TCGCGGGATG ATATGAGTGT GATGATGTAT GCCGGAGAGC GAGAAACGAC CCAGTACCAG TCAATACCCA TGGCACCACA ACTTAACCCG TCACATGCGG GCAGCGTGAT TACCCTGCAA CGCCATTACC AGGGAATAGA CAGCCGCTGG ACACACCGTG GTGAACTGGG CGTTCCGGTC ACGTTCACTA CCGGCCTGAA CTACGAAAAC ATGAGTGAAA ACCGCAAGGG CTACAATAAC TTCCGCCTGA ATAGCGGCAT GCCGGAGTAC GGGCAAAAAG GTGAGTTGCG TCGCGACGAA CGCAATCTGA TGTGGAACAT CGATCCCTAT TTACAGACGC AGTGGCAGCT GAGCGAAAAA CTGTCGCTGG ATGCTGGCGT GCGCTACAGC TCCGTGTGGT TTGATTCCAA CGACCATTAC GTTACTCCGG GTAACGGCGA TGACAGCGGT GATGCCAGTT ATCACAAATG GCTACCAGCC GGATCGTTAA AATATGCAAT GACCGATGCC TGGAATATCT ATCTGGCAGC CGGGCGTGGT TTTGAAACGC CGACGATTAA TGAGCTGTCT TATCGCGCTG ATGGGCAAAG CGGTATGAAC TTTGGTTTAA AACCATCCAC CAACGATACA ATTGAGATCG GCAGTAAAAC GCGTATTGGT GATGGGCTGC TTAGTCTCGC ATTGTTCCAG ACCGACACCG ATGATGAAAT TGTTGTCGAT AGCAGTAGCG GTGGGCGTAC GACTTACAAA AATGCTGGAA AGACCCGTCG TCAAGGCGCT GAACTGGCAT GGGATCAACG TTTCGCGGGA GATTTTCGCG TAAACGCGTC CTGGACCTGG CTTGATGCGA CCTATCGCAG CAATGTTTGC AACGAACAGG ATTGTAACGG TAATCGGATG CCAGGGATCG CCCGTAATAT GGGCTTTGCG TCGATAGGTT ATGTACCGGA AGATGGTTGG TATGCAGGCA TGGAAGCGCG TTATATGGGC GATATTATGG CAGATGATGA AAATACGGCC AAAGCGCCGT CTTATACTCT CGTCGGCTTA TTCACCGGGT ATAAATACAA TTACCACAAT TTAACTGTGG ATTTGTTTGG TCGTGTCGAT AATTTATTCG ATAAAGAATA CGTTGGTTCT GTCATTGTCA ATGAGTCAAA CGGGCGATAT TACGAACCTG CGCCCGGGCG AAATTATGGT GTCGGCGTGA ATATCGCATG GCGATTTGAG TAA
|
Protein sequence | MKIVFVRQTV LPALLVLSPV VFAADEQTMI VSAAPQVVSE LDTPAAVSVV DGEEMRLATP RINLSESLTG VPGLQVQNRQ NYAQDLQLSI RGFGSRSTYG IRGIRLYVDG IPATMPDGQG QTSNIDLSSV QNVEVLRGPF SALYGNASGG VMNVTTQTGQ QPPTIEASSY YGSFGSWRYG LKATGATGDG TQPGDVDYTV STTRFTTHGY RDHSGAQKNL ANAKLGVRID DASKLSLIFN SVDIKADDPG GLTKAEWKAN PQQAPRAEQY DTRKTIKQTQ AGLRYERSLS SRDDMSVMMY AGERETTQYQ SIPMAPQLNP SHAGSVITLQ RHYQGIDSRW THRGELGVPV TFTTGLNYEN MSENRKGYNN FRLNSGMPEY GQKGELRRDE RNLMWNIDPY LQTQWQLSEK LSLDAGVRYS SVWFDSNDHY VTPGNGDDSG DASYHKWLPA GSLKYAMTDA WNIYLAAGRG FETPTINELS YRADGQSGMN FGLKPSTNDT IEIGSKTRIG DGLLSLALFQ TDTDDEIVVD SSSGGRTTYK NAGKTRRQGA ELAWDQRFAG DFRVNASWTW LDATYRSNVC NEQDCNGNRM PGIARNMGFA SIGYVPEDGW YAGMEARYMG DIMADDENTA KAPSYTLVGL FTGYKYNYHN LTVDLFGRVD NLFDKEYVGS VIVNESNGRY YEPAPGRNYG VGVNIAWRFE
|
| |