Gene EcSMS35_0165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0165 
SymbolfhuB 
ID6143524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp181780 
End bp183762 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content60% 
IMG OID641615066 
Productiron-hydroxamate transporter permease subunit 
Protein accessionYP_001742282 
Protein GI170683245 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0609] ABC-type Fe3+-siderophore transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTAAAC GAATTGCGCT TTTTCCGGTG TTATTGCTGG TGCTGCTGGT GGTTGCTGCT 
GCGGCGTTGA CCTGGATGAA CTTCTCGCAG GCGCTGCCGC GCAGCCAGTG GGCGCAGGCC
GCCTGGTCGC CGGATATTGA CGTCATCGAG CAGATGATTT TTCACTACAG CTTGTTGCCG
CGTCTGGCGA TTTCGCTGCT GGTGGGCGCG GGCCTGGGGC TGGTGGGCGT GCTGTTTCAG
CAAGTGCTGC GTAACCCGCT GGCGGAACCG ACGACGTTGG GCGTTGCAAC AGGCGCGCAA
CTGGGGATTA CCGTCACCAC GCTCTGGGCG ATCCCAGGTG CTATGGCGAG CCAGTTTGCT
GCATTGGCAG GGGCTTGTGT GGTTGGCTTA ATCGTCTTTG GCGTCGCGTG GGGGAAACGG
CTTTCGCCGG TAACGCTGAT CCTCGCGGGG CTGGTAGTGA GCCTTTATTG CGGCGCAATC
AATCAGTTAC TGGTTATCTT CCATCATGAC CAACTGCAAA GCATGTTCCT GTGGAGCACC
GGAACGCTGA CGCAAACCGA CTGGGGCGGC GTTGAGCGTT TATGGCCGCA GCTGCTGGGC
GGCGTGATGC TGACGTTATT GCTACTTCGC CCGTTAACTC TGATGGGGCT TGATGATGGA
GTGGCGCGCA ATCTCGGGCT GGCCTTGTCG CTGGCGCGTC TGGCGGCGTT GTCGCTGGCG
ATTGTCATCA GTGCGCTGCT GGTGAACGCG GTGGGGATTA TCGGCTTTAT CGGTTTGTTC
GCGCCACTGC TGGCGAAAAT GCTGGGGGCG CGGCGTCTGC TACCAAGGTT GATGCTGGCG
TCGCTGATTG GTGCGCTGAT CCTCTGGCTT TCCGATCAAA TCATCCTCTG GCTGACTCGC
GTGTGGATGG AAGTGTCCAC CGGTTCGGTC ACTGCGTTGA TCGGTGCGCC GCTGCTACTG
TGGTTGCTGC CGCGTTTACG CAGCATTAGC GCGCCGGATA TGAAGGTCAA CGATCGTGTC
GAGACTGAAC GCCAACACGT GCTGGCGTTT GCCCTCGCGG GCGGCGTGCT GCTGTTGATG
GCTGTGGTGG TGGCGCTGTC GTTTGGTCGT GATGCGCACG GCTGGACGTG GGCGAGCGGG
GCGCTGCTCG ATGATTTAAT GCCCTGGCGC TGGCCGCGAA TTATGGCGGC ACTGTTTGCG
GGCATCATGC TGGCAGTGGC GGGCTGTATT ATTCAGCGGC TGACCGGAAA CCCGATGGCA
AGCCCGGAAG TGCTGGGGAT TAGTTCCGGC GCGGCGTTTG GTGTGGTGTT GATGCTGTTT
TTGGTGCCGG GTAATGCCTT TGGCTGGCTG TTACCCGCGG GCAGCCTCGG GGCGGCGGTG
ACGCTGTTGA TCATTATGAT CGCCGCCGGA CGCGGTGGAT TTTCCCCACA CCGCATGTTA
CTGGCGGGGA TGGCGTTAAG TACCGCGTTC ACCATGCTTT TGATGATGTT GCAGGCAAGT
GGTGACCCGC GAATGGCGCA AGTTCTGACC TGGATTTCCG GTTCGACCTA CAACGCGACC
GAGGCGCAGG TCTGGCGCAC GGGAATTGTG ATGGTAATTT TGCTGGCGAT TACCCCGCTG
TGCCGCCGCT GGCTGACCAT TTTACCGCTG GGCGGTGATA CCGCACGAGC CGTAGGAATG
GCGCTGACGC CGACGCGAAT TGCGCTGCTG CTGTTAGCGG CTTGCCTGAC AGCGACTGCG
ACAATGACCA TTGGACCGTT GAGTTTTGTT GGTTTAATGG CACCGCATAT TGCGCGGATG
ATGGGCTTTC GACGGACGAT GCCACACATC GTAATTTCGG CGCTGGTGGG TGGTTTACTG
CTGGTGTTCG CTGACTGGTG TGGGCGGATG GTATTGTTTC CATTCCAGAT CCCGGCGGGG
CTGCTGTCGA CCTTTATCGG CGCGCCATAT TTTATCTATT TGTTGAGAAA GCAGAGCCGT
TAA
 
Protein sequence
MSKRIALFPV LLLVLLVVAA AALTWMNFSQ ALPRSQWAQA AWSPDIDVIE QMIFHYSLLP 
RLAISLLVGA GLGLVGVLFQ QVLRNPLAEP TTLGVATGAQ LGITVTTLWA IPGAMASQFA
ALAGACVVGL IVFGVAWGKR LSPVTLILAG LVVSLYCGAI NQLLVIFHHD QLQSMFLWST
GTLTQTDWGG VERLWPQLLG GVMLTLLLLR PLTLMGLDDG VARNLGLALS LARLAALSLA
IVISALLVNA VGIIGFIGLF APLLAKMLGA RRLLPRLMLA SLIGALILWL SDQIILWLTR
VWMEVSTGSV TALIGAPLLL WLLPRLRSIS APDMKVNDRV ETERQHVLAF ALAGGVLLLM
AVVVALSFGR DAHGWTWASG ALLDDLMPWR WPRIMAALFA GIMLAVAGCI IQRLTGNPMA
SPEVLGISSG AAFGVVLMLF LVPGNAFGWL LPAGSLGAAV TLLIIMIAAG RGGFSPHRML
LAGMALSTAF TMLLMMLQAS GDPRMAQVLT WISGSTYNAT EAQVWRTGIV MVILLAITPL
CRRWLTILPL GGDTARAVGM ALTPTRIALL LLAACLTATA TMTIGPLSFV GLMAPHIARM
MGFRRTMPHI VISALVGGLL LVFADWCGRM VLFPFQIPAG LLSTFIGAPY FIYLLRKQSR