Gene EcHS_A0154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0154 
SymbolfhuA 
ID5595319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp166532 
End bp168721 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content45% 
IMG OID640919340 
Productferrichrome outer membrane transporter 
Protein accessionYP_001456935 
Protein GI157159617 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGTT CCAAAACTGC TCAGCCAAAA CACTCACTGC GTAAAATCGC AGTTGTAGTA 
GCCACAGCGG TAAGCGGCAT GTCTGTTTAT GCACAGGCAG CGGTTGAACC GAAAGAAGAC
ACTATCACCG TTACCGCTGC ACCTGCGCCG CAAGAAAGCG CATGGGGGCC TGCTGCAACT
ATTGCGGCGA AGCACTCTGC TACTGCGACT AAAACGGATA CACCAATTGA AAAAACGCCG
CAGTCTATTT CGGTTGTGAC AAATGAAGAG ATGCAGATGC ATCAATTTCA GTCTGTAAAA
GAAGCATTAG GTTATACACC TGGCGTTACG GTTAGCAGTC GTGGTGCTTC TAATACATAT
GATTTTGTTA TCATTCGTGG TTTCTCATCT GTTGGTCTGA ATCAAAATAA TTACCTTGAT
GGACTAAAAC TTCAGGGCAA CTTTTATAAC GATGCTGTGA TTGATCCTTA CATGCTCGAG
AGGGTTGAAC TGATGCGTGG GCCGACGTCT GTTCTCTATG GAAAAAGTAA TCCTGGTGGG
ATTATTTCTA TGGTGAGTAA GCGTCCGACC ACTGAACCCC TGAAAGAAAT TCAGTTTAAA
ATGGGGACGG ATAATCTGTT TCAGACTGGA TTCGATTTCA GTGATGCACT GGATGATAAC
GGTGAGTTCT CCTACCGTTT GACGGGCCTC GCTCGTTCAA CAAACGAACA GCAGAAAAAT
TCTGAATCCC AGCGTTATAC CATTGCTCCA TCATTCTCAT GGCGTCCAGA CGACAAAACC
AATTTTACTT TCCTGTCCTA TTTCCAGAAT GAACCAGAAA CGGGTTATTA CGGTTGGTTG
CCGAAAGAGG GGACCGTTGA GCCATTGCCT AATGGTAAAC GTCTACCGAC TGACTTCAAT
GAAGGTGCAT CGAATAATAC ATACTCCCGT AACCAGAAAA TGGTGGGATA TAGTTTTGAA
CATGGTTTTA ATGACACCTT CACCGTGCGT CAGAATCTGC GTTTCAGTGA AATGAAAACC
TCACAGAAAA GTGTTTATGG CACAGGGATT GCCAACGATG GTCATACTCT AAACCGCGGG
ACAGTGGTGG ATAATGAGCG TCTGCAAAAC TTTAGCGTTG ATACCCAACT TGAAAGTAAA
TTTGCTACAG GTGAAGTGGA GCATACTTTG CTGACAGGGG TAGACTTCAT GCGTATGCGC
AATGATATCA ATGCCAGCTT TGGATCTGCA CCATCAATCG ATCTTTATAA CAAATATCAT
CCTGAATACT TTGCATTTGG TAACGCAGAG CCATACCAAA TGAATGAAAG CAAACAAACA
GGTATTTATG TTCAGGATCA GGCGGAATGG AATAAATGGG TATTCACTCT GGGGGGACGC
TACGATTGGT CTAAGCAAGC GACCACTGTT CGTGAAAACT CTTATACGCC GACTGAAGGT
TATATTGAGC GTAATGATCA TCAGTTCACC TGGCGCGGTG GTGTAAATTA CTTATTCGAT
AATGGTATTT CACCTTACTT TAGCTATAGC CAGTCCTTTG AACCGAGTGC TTTCGATCTG
TGGAGCAACC CGCGCGTTTC CTATAAGCCA TCGAAAGGTG AACAGTATGA AGCTGGCGTA
AAATATGTTC CGAATGATAT GCCGGTCGTT GTTACGGGCG CAGTCTATCA ATTGACGAAA
ACAAATAACC TGACAGCAGA CCCAACAAAC CCGTTAGCGC AAGTCCCAGC AGGTGAGATT
CGCGCTCGTG GTGTGGAGCT TGAAGCAAAA GCAGCGTTAA ATGCCAATAT TAACTTGACG
GCTTCTTATA CCTACACGGA TGCGGAATAC ACCAAAGACA CCAATCTCAA AGGTAAAACT
CCAGAACAAG TACCGGAGCA TATGGCATCT CTCTGGGGGG ATTATACCTT CAATGAAGGG
CCGCTTTCTG GTTTAACATT GGGAACAGGT GGTCGTTTTA TTGGTTCCAG CTATGGTGAT
CCGGCAAACA CTTTTAAAGT GGGTAGCGCA GCTGTAATGG ATGCTGTTGT AAAATATGAT
CTGGCACGCT TTGGTATGGC GGGATCCAGC CTTGCTGTGA ACGTCAACAA TTTGCTCGAT
CGTGAGTATG TTGCCAGTTG CTTCCAGACT TATGGCTGCT TCTGGGGCGC AGAACGTCAG
GTCGTTGCAA CCGCAACCTT CCGTTTCTAA
 
Protein sequence
MARSKTAQPK HSLRKIAVVV ATAVSGMSVY AQAAVEPKED TITVTAAPAP QESAWGPAAT 
IAAKHSATAT KTDTPIEKTP QSISVVTNEE MQMHQFQSVK EALGYTPGVT VSSRGASNTY
DFVIIRGFSS VGLNQNNYLD GLKLQGNFYN DAVIDPYMLE RVELMRGPTS VLYGKSNPGG
IISMVSKRPT TEPLKEIQFK MGTDNLFQTG FDFSDALDDN GEFSYRLTGL ARSTNEQQKN
SESQRYTIAP SFSWRPDDKT NFTFLSYFQN EPETGYYGWL PKEGTVEPLP NGKRLPTDFN
EGASNNTYSR NQKMVGYSFE HGFNDTFTVR QNLRFSEMKT SQKSVYGTGI ANDGHTLNRG
TVVDNERLQN FSVDTQLESK FATGEVEHTL LTGVDFMRMR NDINASFGSA PSIDLYNKYH
PEYFAFGNAE PYQMNESKQT GIYVQDQAEW NKWVFTLGGR YDWSKQATTV RENSYTPTEG
YIERNDHQFT WRGGVNYLFD NGISPYFSYS QSFEPSAFDL WSNPRVSYKP SKGEQYEAGV
KYVPNDMPVV VTGAVYQLTK TNNLTADPTN PLAQVPAGEI RARGVELEAK AALNANINLT
ASYTYTDAEY TKDTNLKGKT PEQVPEHMAS LWGDYTFNEG PLSGLTLGTG GRFIGSSYGD
PANTFKVGSA AVMDAVVKYD LARFGMAGSS LAVNVNNLLD REYVASCFQT YGCFWGAERQ
VVATATFRF