Gene EcSMS35_3802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3802 
SymbolchuA 
ID6146648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3867974 
End bp3869956 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content50% 
IMG OID641618628 
Productouter membrane heme/hemoglobin receptor ChuA 
Protein accessionYP_001745768 
Protein GI170681293 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID[TIGR01785] TonB-dependent heme/hemoglobin receptor family protein
[TIGR01786] TonB-dependent hemoglobin/transferrin/lactoferrin receptor family protein 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.241464 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACGTC CGCAATTTAC CTCGTTGCGT TTGAGTTTGT TGGCTTTGGC TGTTTCTGCC 
ACCTTGCCAA CGTTTGCTTT TGCTACTGAA ACCATGACCG TTACGGCAAC GGGGAATGCA
CGTAGTTCCT TCGAAGCGCC TATGATGGTC AGCGTTATCG ACACTTCCGC TCCTGAAAAT
CAAACGGCTA CTTCAGCCAC TGATTTGCTG CGTCATGTTC CTGGAATTAC TCTTGATGGT
ACCGGACGAA CCAACGGTCA GGATGTAAAT ATGCGTGGCT ATGATCATCG CGGCGTGCTG
GTTCTTGTCG ATGGTGTTCG CCAGGGAACG GATACCGGAC ACCTGAATGG CACTTTTCTC
GATCCGGCGC TGATCAAGCG TGTTGAGATT GTTCGCGGAC CTTCAGCATT ACTGTATGGC
AGTGGCGCGC TGGGTGGAGT GATCTCCTAC GATACGGTCG ATGCAAAAGA TTTATTGCAG
GAAGGACAAA GCAGTGGTTT TCGTGTCTTT GGTACTGGCG GCACGGGGGA CCATAGCCTG
GGATTAGGCG CGAGCGCGTT TGGGCGAACT GAAAATCTGG ATGGTATTGT GGCCTGGTCC
AGTCGCGATC GGGGTGATTT ACGCCAGAGC AATGGTGAAA CCGCGCCGAA TGACGAGTCC
ATTAATAACA TGCTGGCGAA AGGGACTTGG CAAATTGATT CAGCCCAGTC TCTGAGCGGT
TTAGTGCGTT ACTACAACAA CGACGCGCGT GAACCAAAAA ATCCGCAGAC TGTTGAGGCT
TCTGATAGCA GCAACCCGAT GGTTGATCGT TCAACAATTC AACGCGATGC GCAGCTTTCT
TATAAACTCG CCCCGCAGGG CAACGACTGG TTAAATGCAG ATGCAAAAAT TTACTGGTCG
GAAGTCCGTA TTAATGCGCA AAACACGGGG AGTTCCGGCG AGTATCGTGA ACAGATAACA
AAAGGAGCCA GGCTGGAGAA CCGTTCCACT CTATTTGCCG ACAGTTTCGC TTCTCACTTA
CTGACATATG GCGGTGAGTA TTATCGTCAG GAACAACATC CGGGCGGCGC GACCACGGGC
TTCCCGCAAG CAAAAATCGA TTTTAGCTCT GGCTGGCTAC AGGATGAGAT CACCTTACGC
GATCTGCCGA TTACCCTGCT TGGCGGAACC CGCTATGACA GTTATCGCGG TAGCAGCGAC
GGCTACAAAG ATGTTGATGC TGACAAATGG TCATCTCGTG CGGGGATGAC TATCAACCCG
ACCAACTGGC TGATGTTATT TGGCTCATAT GCCCAGGCAT TCCGCGCCCC GACGATGGGC
GAAATGTATA ACGATTCTAA GCACTTCTCG ATTGGTCGCT TCTATACCAA CTATTGGGTG
CCAAACCCGA ACTTACGTCC GGAAACTAAC GAAACTCAGG AGTACGGTTT TGGGCTGCGT
TTTGATGACC TGATGTTGTC CAATGATGCT CTGGAATTTA AAGCCAGCTA CTTTGATACC
AAAGCGAAGG ATTACATCTC CACGACCGTC GATTTCGCGG CGGCGACAAC TATGTCGTAT
AACGTCCCGA ACGCCAAAAT CTGGGGCTGG GATGTAATGA CGAAATATAC CACTGATCTG
TTTAGCCTTG ATGTGGCCTA TAACCGTACC CGCGGCAAAG ACACCGATAC TGGGGAATAT
ATCTCCAGCA TTAACCCGGA TACCGTTACC AGTACCCTGA ATATTCCGAT CGCTCACAGT
GGCTTCTCTG TTGGGTGGGT TGGTACGTTT GCCGATCGCT CAACACATAT CAGCAGCAGT
TACAGCAAAC AACCAGGCTA TGGCGTGAAT GATTTCTACG TCAGTTATCA AGGACAACAG
GCGCTCAAAG GTATGACCAC TACTTTGGTG TTGGGTAACG CTTTCGACAA AGAGTACTGG
TCACCACAAG GCATCCCACA GGATGGTCGT AACGGAAAAA TTTTCGTGAG TTATCAATGG
TAA
 
Protein sequence
MSRPQFTSLR LSLLALAVSA TLPTFAFATE TMTVTATGNA RSSFEAPMMV SVIDTSAPEN 
QTATSATDLL RHVPGITLDG TGRTNGQDVN MRGYDHRGVL VLVDGVRQGT DTGHLNGTFL
DPALIKRVEI VRGPSALLYG SGALGGVISY DTVDAKDLLQ EGQSSGFRVF GTGGTGDHSL
GLGASAFGRT ENLDGIVAWS SRDRGDLRQS NGETAPNDES INNMLAKGTW QIDSAQSLSG
LVRYYNNDAR EPKNPQTVEA SDSSNPMVDR STIQRDAQLS YKLAPQGNDW LNADAKIYWS
EVRINAQNTG SSGEYREQIT KGARLENRST LFADSFASHL LTYGGEYYRQ EQHPGGATTG
FPQAKIDFSS GWLQDEITLR DLPITLLGGT RYDSYRGSSD GYKDVDADKW SSRAGMTINP
TNWLMLFGSY AQAFRAPTMG EMYNDSKHFS IGRFYTNYWV PNPNLRPETN ETQEYGFGLR
FDDLMLSNDA LEFKASYFDT KAKDYISTTV DFAAATTMSY NVPNAKIWGW DVMTKYTTDL
FSLDVAYNRT RGKDTDTGEY ISSINPDTVT STLNIPIAHS GFSVGWVGTF ADRSTHISSS
YSKQPGYGVN DFYVSYQGQQ ALKGMTTTLV LGNAFDKEYW SPQGIPQDGR NGKIFVSYQW