Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1952 |
Symbol | |
ID | 6144049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1972870 |
End bp | 1974840 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641616828 |
Product | TonB-dependent receptor |
Protein accession | YP_001744004 |
Protein GI | 170682916 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4771] Outer membrane receptor for ferrienterochelin and colicins |
TIGRFAM ID | [TIGR03304] outer membrane insertion C-terminal signal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.446334 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.0364758 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACTCA AAAAACGTTA TCTCTGCACA GCATTATCAC TTGCCTTTAC CCAGCAGGCC GTAGCGGCTC AGGAGAGTGA CACGCTGACC GTATGGTCCA GTCCGGTATC ATCGACGACG ACCACCGTTC TCGATCAACC CACCATGAAG GCCCTGGATA AACAGAATGT CGCTCAGGCA TTAAGTGTCG TCCCCGGCGT GGTGCTGCAA AAGTCAGGTA GCCGCAACGA AGAACAGGTC AAAGTTCGCG GCTTTGATAG TCGTCAGGTG CCAGTCTATT TCGAGGGTGT GCCCATTTAT GTTCCCTATG ACGGCAACCT CGATCTGGCG CGGATTCTGA CCAACAATTT GGGGGCAGTT GAAGTTTCCA AAGGGTATTC GTCGCTGCTT CAGGGGCCTA ATCAGATGGG CGGAGCCATT AATATCACCA CCCAGAAGCC AACAAAACCT CTGGAAGCAA ATCTGGGATA TCGCCAGGGA TGGAGCCGTA GCCAGGACAA TGCCTACGAT ATGCATGCTT CATTTGCCGC CAGCAGCGAT CTGGGGTATT TGCAAGTCAG CGGTAGCCAG CTAAAGCAGG ATTTTCTCGG CCTGCCGCAT GGTGTAAATA ATGATATTGC AGGCAAACAC GGCAAGATGA TTAATTCATC GGCTGATGAT AAACGCGGCA TTGTGAAGCT CGGTTTTACA CCACGCGAAA ACGATGAATA CACATTGACT TACATTAAGC AGGATGGTGA AAAAGATAAC CCGCCATACA GTGGAAATAG TGGTCAAAAA TCACGCTACT GGCAGTGGCC AGAGTATGAC AAAGAAAGTT TTTATTATCA GGGAACGACC CAACTAAACG ATCGTTTTAC CCTGAAAAGT CGGCTGTATC GCGACACCTT TGAAAATACG CTAATGATGT ACAACTCGCT GGCTGATTTG AAAAATAAAA AAGGCAGCTA CAGCCATTAT TCCGATTACA GCGACGGTGC CGGGTTACAA CTGGCAGCCG ATGTGCGTGA AAACGATCTG CTGTCGTTTG CCGTTAACTG GAAAGATGAC GTACACCGGG AAAAAGGTGC GCCACACGCC GCTTACGATC GCTATGAAGA TCGTACCTGG TCGCTCGCCA GTGAATATCA ATGGGCTGCT GCCGATAATG TCGATGTCGT GGCTGGAATC AGCTATGACT GGCGCGATAG CGTAGAAGCG AAAAAACATG AGAAAGATGG CAGTATCACC CATTATGACG ACAACAATCA GTCAGCTTTT AACTGGCAGG TGATGGGGAA ATACCACTTT GCCAATGAAG ACACGCTGGC GCTTTCGTAC TATGACCGCA CACGCTTTCC GACGCTGAAA GAACGCTATA CCACGTCCAA ACCTGCGTAT AACCAGATAG CGATTGTTAA CCCGCAGCTC AAACCGGAAC GCGCGCGCGG GGTGGATTTA ACCTGGAATG GTGCCTTCAC GCACGACTGG GGGTTTGAGG TCAGCGTTTA CTATAACCGG GTGAGTGATG CCATCCTCTC GCACAATATC GATGCCGATA CCATTCAAAA TCAGAACAGC GGCACGGTGG ATTACAGCGG TCTGGATGCC GGTATTAAGG GGAAAATCAG CAATATACTG GATGTAGGAT TGAGCTACGC CCTGATCCAC GCTGACGCCA AACGCAAAGA CATCGGCAAG ATAACCGATC TGCCAACGCA GACAATGACC GCATGGATGA CTCTCAAACC GTGGGAGCCG TTAAGCGTAA CGCTGTCGGA AGAGGCGCGT TCCTCCAGCT ACAGCAACAG TGACGGTTCA CAAAAAGCCG CCGGTTTTGC GGTGACCCAC ATTCGAGCCG ATTACACCTT AGGTCATGGC TTCAGCGTCA ATGCGTCGGT CAATAACCTG TTTGATACCA AATATGCCTA CAGTGAAGGG TTTATTGAAG AAGGTCGCAA TTTCTGGGCA GGTATCGCGT ATACGTTCTG A
|
Protein sequence | MRLKKRYLCT ALSLAFTQQA VAAQESDTLT VWSSPVSSTT TTVLDQPTMK ALDKQNVAQA LSVVPGVVLQ KSGSRNEEQV KVRGFDSRQV PVYFEGVPIY VPYDGNLDLA RILTNNLGAV EVSKGYSSLL QGPNQMGGAI NITTQKPTKP LEANLGYRQG WSRSQDNAYD MHASFAASSD LGYLQVSGSQ LKQDFLGLPH GVNNDIAGKH GKMINSSADD KRGIVKLGFT PRENDEYTLT YIKQDGEKDN PPYSGNSGQK SRYWQWPEYD KESFYYQGTT QLNDRFTLKS RLYRDTFENT LMMYNSLADL KNKKGSYSHY SDYSDGAGLQ LAADVRENDL LSFAVNWKDD VHREKGAPHA AYDRYEDRTW SLASEYQWAA ADNVDVVAGI SYDWRDSVEA KKHEKDGSIT HYDDNNQSAF NWQVMGKYHF ANEDTLALSY YDRTRFPTLK ERYTTSKPAY NQIAIVNPQL KPERARGVDL TWNGAFTHDW GFEVSVYYNR VSDAILSHNI DADTIQNQNS GTVDYSGLDA GIKGKISNIL DVGLSYALIH ADAKRKDIGK ITDLPTQTMT AWMTLKPWEP LSVTLSEEAR SSSYSNSDGS QKAAGFAVTH IRADYTLGHG FSVNASVNNL FDTKYAYSEG FIEEGRNFWA GIAYTF
|
| |