Gene ECH74115_4856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4856 
SymbolhmuS 
ID6967086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4491464 
End bp4492492 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content49% 
IMG OID643388546 
Producthemin transport protein HmuS 
Protein accessionYP_002272974 
Protein GI209396915 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3720] Putative heme degradation protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCACT ACACACGCTG GCTTGAGTTA AAAGAACAAA ATCCCGGAAA GTACGCGCGT 
GACATCGCAG GGTTAATGAA TATTAGAGAA GCAGAACTGG CATTTGCACG CGTCACGCAC
GATGCGTGGC GGATGCACGG CGATATCCGT GAAATTCTGG CGGCGCTCGA AAGTGTTGGC
GAAACCAAAT GTATTTGTCG TAATGAATAT GCAGTCCATG AGCAAGTTGG TACGTTCACA
AACCAGCATT TGAACGGACA TGCCGGATTG ATCCTCAATC CGCGCGCGCT GGATTTACGT
CTGTTTCTCA ATCAATGGGC CAGTGTTTTC CACATCAAAG AAAACACGGC TCGTGGCGAA
CGCCAGAGTA TTCAGTTCTT TGATCATCAG GGCGATGCAT TACTAAAAGT TTATGCCACC
GACAATACCG ATATGGCGGC ATGGAGTGAG CTTCTGGCAC GGTTTATCAC CGATGAGAAT
ACGCCGCTTG AGTTAAAAGC CGTTGATGCG CCAGTTGTTC AAACGCGAGC CGATGCCACT
GTGGTCGAGC AAGAGTGGCG GGCGATGACC GACGTTCATC AGTTTTTTAC GTTGCTCAAG
CGCCACAACC TGACGCGCCA ACAGGCGTTC AATCTGGTGG CAGACGATTT GGCCTGCAAA
GTATCCAACA GTGCGTTGGC GCAAATTCTT GAATCTGCAC AGCAGGATGG TAATGAAATC
ATGGTGTTTG TTGGCAACCG TGGCTGCGTA CAGATTTTCA CCGGTGTGGT AGAAAAAGTG
GTGCCAATGA AAGGTTGGCT GAATATTTTC AACCCGACGT TTACTCTTCA TCTATTAGAA
GAGAGCATTG CTGAAGCCTG GGTTACCCGT AAACCGACCA GCGATGGCTA CGTAACCAGT
CTGGAATTGT TTGCCCATGA TGGTACGCAG ATAGCGCAAC TTTATGGTCA ACGTACAGAA
GGCGAACAGG AGCAAGCGCA ATGGCGTAAG CAAATTGCTT CGCTGATACC GGAAGGCGTT
GCTGCATAA
 
Protein sequence
MNHYTRWLEL KEQNPGKYAR DIAGLMNIRE AELAFARVTH DAWRMHGDIR EILAALESVG 
ETKCICRNEY AVHEQVGTFT NQHLNGHAGL ILNPRALDLR LFLNQWASVF HIKENTARGE
RQSIQFFDHQ GDALLKVYAT DNTDMAAWSE LLARFITDEN TPLELKAVDA PVVQTRADAT
VVEQEWRAMT DVHQFFTLLK RHNLTRQQAF NLVADDLACK VSNSALAQIL ESAQQDGNEI
MVFVGNRGCV QIFTGVVEKV VPMKGWLNIF NPTFTLHLLE ESIAEAWVTR KPTSDGYVTS
LELFAHDGTQ IAQLYGQRTE GEQEQAQWRK QIASLIPEGV AA