Gene EcSMS35_1952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1952 
Symbol 
ID6144049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1972870 
End bp1974840 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content50% 
IMG OID641616828 
ProductTonB-dependent receptor 
Protein accessionYP_001744004 
Protein GI170682916 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.446334 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0364758 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACTCA AAAAACGTTA TCTCTGCACA GCATTATCAC TTGCCTTTAC CCAGCAGGCC 
GTAGCGGCTC AGGAGAGTGA CACGCTGACC GTATGGTCCA GTCCGGTATC ATCGACGACG
ACCACCGTTC TCGATCAACC CACCATGAAG GCCCTGGATA AACAGAATGT CGCTCAGGCA
TTAAGTGTCG TCCCCGGCGT GGTGCTGCAA AAGTCAGGTA GCCGCAACGA AGAACAGGTC
AAAGTTCGCG GCTTTGATAG TCGTCAGGTG CCAGTCTATT TCGAGGGTGT GCCCATTTAT
GTTCCCTATG ACGGCAACCT CGATCTGGCG CGGATTCTGA CCAACAATTT GGGGGCAGTT
GAAGTTTCCA AAGGGTATTC GTCGCTGCTT CAGGGGCCTA ATCAGATGGG CGGAGCCATT
AATATCACCA CCCAGAAGCC AACAAAACCT CTGGAAGCAA ATCTGGGATA TCGCCAGGGA
TGGAGCCGTA GCCAGGACAA TGCCTACGAT ATGCATGCTT CATTTGCCGC CAGCAGCGAT
CTGGGGTATT TGCAAGTCAG CGGTAGCCAG CTAAAGCAGG ATTTTCTCGG CCTGCCGCAT
GGTGTAAATA ATGATATTGC AGGCAAACAC GGCAAGATGA TTAATTCATC GGCTGATGAT
AAACGCGGCA TTGTGAAGCT CGGTTTTACA CCACGCGAAA ACGATGAATA CACATTGACT
TACATTAAGC AGGATGGTGA AAAAGATAAC CCGCCATACA GTGGAAATAG TGGTCAAAAA
TCACGCTACT GGCAGTGGCC AGAGTATGAC AAAGAAAGTT TTTATTATCA GGGAACGACC
CAACTAAACG ATCGTTTTAC CCTGAAAAGT CGGCTGTATC GCGACACCTT TGAAAATACG
CTAATGATGT ACAACTCGCT GGCTGATTTG AAAAATAAAA AAGGCAGCTA CAGCCATTAT
TCCGATTACA GCGACGGTGC CGGGTTACAA CTGGCAGCCG ATGTGCGTGA AAACGATCTG
CTGTCGTTTG CCGTTAACTG GAAAGATGAC GTACACCGGG AAAAAGGTGC GCCACACGCC
GCTTACGATC GCTATGAAGA TCGTACCTGG TCGCTCGCCA GTGAATATCA ATGGGCTGCT
GCCGATAATG TCGATGTCGT GGCTGGAATC AGCTATGACT GGCGCGATAG CGTAGAAGCG
AAAAAACATG AGAAAGATGG CAGTATCACC CATTATGACG ACAACAATCA GTCAGCTTTT
AACTGGCAGG TGATGGGGAA ATACCACTTT GCCAATGAAG ACACGCTGGC GCTTTCGTAC
TATGACCGCA CACGCTTTCC GACGCTGAAA GAACGCTATA CCACGTCCAA ACCTGCGTAT
AACCAGATAG CGATTGTTAA CCCGCAGCTC AAACCGGAAC GCGCGCGCGG GGTGGATTTA
ACCTGGAATG GTGCCTTCAC GCACGACTGG GGGTTTGAGG TCAGCGTTTA CTATAACCGG
GTGAGTGATG CCATCCTCTC GCACAATATC GATGCCGATA CCATTCAAAA TCAGAACAGC
GGCACGGTGG ATTACAGCGG TCTGGATGCC GGTATTAAGG GGAAAATCAG CAATATACTG
GATGTAGGAT TGAGCTACGC CCTGATCCAC GCTGACGCCA AACGCAAAGA CATCGGCAAG
ATAACCGATC TGCCAACGCA GACAATGACC GCATGGATGA CTCTCAAACC GTGGGAGCCG
TTAAGCGTAA CGCTGTCGGA AGAGGCGCGT TCCTCCAGCT ACAGCAACAG TGACGGTTCA
CAAAAAGCCG CCGGTTTTGC GGTGACCCAC ATTCGAGCCG ATTACACCTT AGGTCATGGC
TTCAGCGTCA ATGCGTCGGT CAATAACCTG TTTGATACCA AATATGCCTA CAGTGAAGGG
TTTATTGAAG AAGGTCGCAA TTTCTGGGCA GGTATCGCGT ATACGTTCTG A
 
Protein sequence
MRLKKRYLCT ALSLAFTQQA VAAQESDTLT VWSSPVSSTT TTVLDQPTMK ALDKQNVAQA 
LSVVPGVVLQ KSGSRNEEQV KVRGFDSRQV PVYFEGVPIY VPYDGNLDLA RILTNNLGAV
EVSKGYSSLL QGPNQMGGAI NITTQKPTKP LEANLGYRQG WSRSQDNAYD MHASFAASSD
LGYLQVSGSQ LKQDFLGLPH GVNNDIAGKH GKMINSSADD KRGIVKLGFT PRENDEYTLT
YIKQDGEKDN PPYSGNSGQK SRYWQWPEYD KESFYYQGTT QLNDRFTLKS RLYRDTFENT
LMMYNSLADL KNKKGSYSHY SDYSDGAGLQ LAADVRENDL LSFAVNWKDD VHREKGAPHA
AYDRYEDRTW SLASEYQWAA ADNVDVVAGI SYDWRDSVEA KKHEKDGSIT HYDDNNQSAF
NWQVMGKYHF ANEDTLALSY YDRTRFPTLK ERYTTSKPAY NQIAIVNPQL KPERARGVDL
TWNGAFTHDW GFEVSVYYNR VSDAILSHNI DADTIQNQNS GTVDYSGLDA GIKGKISNIL
DVGLSYALIH ADAKRKDIGK ITDLPTQTMT AWMTLKPWEP LSVTLSEEAR SSSYSNSDGS
QKAAGFAVTH IRADYTLGHG FSVNASVNNL FDTKYAYSEG FIEEGRNFWA GIAYTF