Gene EcSMS35_0162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0162 
SymbolfhuA 
ID6143743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp177787 
End bp180045 
Gene Length2259 bp 
Protein Length752 aa 
Translation table11 
GC content51% 
IMG OID641615063 
Productferrichrome outer membrane transporter 
Protein accessionYP_001742279 
Protein GI170681393 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTT CCAAAACTGC TCAGCCAAAA CACTCACTGC GTAAAATCGC AGTTGTAGTA 
GCCACAGCGG TTAGCGGCAT GTCTGTTTAT GCACAGGCAG CGGTTGAACC GAAAGAAGAC
ACTATCACCG TTACCGCTGC ACCTGCGCCG CAAGAAAGCG CATGGGGGCC AGCTGCAACT
ATTGCGGCGC GACAGTCCGC TACCGGCACT AAAACCGATA CGCCGATTCA AAAAGTGCCA
CAGTCTATTT CTGTTGTGAC CGCCGAAGAG ATGGCGCTGC ATCAGCCGAA GTCGGTAAAA
GAAGCTCTTA GCTACACTCC TGGCGTTGCC GTGGGAACTC GTGGCGCATC CAACACTTAC
GATTACCTGA TCATCCGCGG ATTTGCCGCT GACGGCCAAA GCCAGAACAA CTATCTGAAT
GGCCTGAAGA TGCAGGGCAA CTTCTATAAC GATGCAGTTA TCGATCCGTA TATGCTGGAG
CGCGCTGAAA TCATGCGTGG TCCGGTATCC GTGCTGTACG GGAAAAGCAG CCCTGGCGGC
CTGTTGAACA TGGTCAGCAA GCGCCCGACT ACAGAACCGC TGAAAGAAAT TCAGTTTAAA
GCCGGTACTG ACAGCCTGTT CCAGACTGGT TTTGACTTCA GTGATGCGCT GGATGATGAC
GGCGTTTACT CTTATCGTCT GACTGGTATT GCGCGTTCAG CCAATGCTCA GCAGAAGGGG
GCAGAAGAGC AGCGTTATGC TATTGCTCCA GCGTTCACCT GGCGCCCGGA TGATAAAACC
AATTTCACTT TCCTTTCTTA CTTCCAGAAC GAACCGGAAA CCGGTTATTA CGGCTGGTTG
CCGAAAGAGG GAACCGTTGA GCCGCTGCCG AACGGTAAGC GTCTGCCGAC AGACTTTAAT
GAAGGGGCGA AGAACAACAC CTATTCTCGT AATGAGAAGA TGGTGGGCTA CAGCTTCGAT
CACGAATTTA ACGACACCTT TACTGTGCGT CAGAACCTGC GCTTTGCGCA AAATAAAGTC
TCGCAAAAGA GCGTATATGG CTACGGCATG TGCTCGGATC CGCTGTATAC CAAAGACGAC
GATGCACTCA AGGCGAGTCC ATGTTTAAGT ATCCCGCAGT CAGAATGGAA TCATACACTG
ACTCGTCAGT ACGTTATCGA TAATGAGAAA TTAGAAAACT TCTCTGTTGA TACTCAGTTG
CAAAGTAAAT TCGCAACCGG CTCGGTTGAA CACACTCTGC TGACCGGCGT TGACTTTATG
CGTATGCGTA ATGATATTGA CTCCTGGTTC GGTTACGCCG GTTCCGTTGC ACCGTCTGAT
ATCTATAATT TAGACCGTAG TGACTTTGAT TTTGGTGCTC ACCCGGATCC GTCTGGCCCA
TACCGCGTTT TGCTTAAACA GAAACAAACC GGTCTGTATG TTCAGGATCA GGCGCAGTGG
GATAAGGTGC TGGTGACTCT GGGTGGTCGC TATGACTGGG CGGAACAGTC ATCTTTCAAC
CGTGATTACG GTAATAAATC CGATCGTGAT GACAAACAGT TCACCTGGCG TGGTGGCGTA
AACTACCTGT TCGACAACGG GGTAACGCCT TACTTCAGCT ACAGTGAGTC GTTTGAGCCA
GCATCTTTGA CAGATGCAAA CGGTGATCTG TTTGCACCTT CGAAAGGCAA ACAGTATGAA
GTTGGTGTGA AATATGTGCC GGAAGATCGC CCAATTGTGC TGACGGGCGC ACTGTATCAG
CTTACCAAAA CCAACAACCT GATGGCGGAT CCGAATAATC CCAATTTCTC GATTGAAGGC
GGTGAGATTC GCGCTCGTGG TGTAGAACTG GAAGCAAAAG CGGCGCTGTC GGCGAGTGTT
AACGTAGTAG GTTCTTATAC TTACACCGAT GCGGAATACA CCACCGACAC TACCTATAAA
GGCAATACGC CTGCACAGGT GCCAAAACAC ATGGCTTCGC TGTGGGCTGA CTATACCTTC
TTTGACGGTC CGCTTTCAGG TCTGACGCTG GGCACCGGTG GTCGTTATAC TGGCTCCAGC
TATGGTGATC CGGCTAACTC CTTTAAAGTG GGAAGTTATA CGGTCGTGGA TGCGTTAGTG
CGTTATGATC TGGCGCGAGT CGGCATGGCT GGCTCCAACG TGGCGCTGCA TGTCAACAAC
CTGTTCGATC GTGAATACGT CGCCAGCTGC TTTAACACCT ATGGCTGCTT CTGGGGCGCA
GAACGTCAGG TCGTTGCAAC CGCAACCTTC CGTTTCTAA
 
Protein sequence
MARSKTAQPK HSLRKIAVVV ATAVSGMSVY AQAAVEPKED TITVTAAPAP QESAWGPAAT 
IAARQSATGT KTDTPIQKVP QSISVVTAEE MALHQPKSVK EALSYTPGVA VGTRGASNTY
DYLIIRGFAA DGQSQNNYLN GLKMQGNFYN DAVIDPYMLE RAEIMRGPVS VLYGKSSPGG
LLNMVSKRPT TEPLKEIQFK AGTDSLFQTG FDFSDALDDD GVYSYRLTGI ARSANAQQKG
AEEQRYAIAP AFTWRPDDKT NFTFLSYFQN EPETGYYGWL PKEGTVEPLP NGKRLPTDFN
EGAKNNTYSR NEKMVGYSFD HEFNDTFTVR QNLRFAQNKV SQKSVYGYGM CSDPLYTKDD
DALKASPCLS IPQSEWNHTL TRQYVIDNEK LENFSVDTQL QSKFATGSVE HTLLTGVDFM
RMRNDIDSWF GYAGSVAPSD IYNLDRSDFD FGAHPDPSGP YRVLLKQKQT GLYVQDQAQW
DKVLVTLGGR YDWAEQSSFN RDYGNKSDRD DKQFTWRGGV NYLFDNGVTP YFSYSESFEP
ASLTDANGDL FAPSKGKQYE VGVKYVPEDR PIVLTGALYQ LTKTNNLMAD PNNPNFSIEG
GEIRARGVEL EAKAALSASV NVVGSYTYTD AEYTTDTTYK GNTPAQVPKH MASLWADYTF
FDGPLSGLTL GTGGRYTGSS YGDPANSFKV GSYTVVDALV RYDLARVGMA GSNVALHVNN
LFDREYVASC FNTYGCFWGA ERQVVATATF RF