Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4856 |
Symbol | hmuS |
ID | 6967086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4491464 |
End bp | 4492492 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643388546 |
Product | hemin transport protein HmuS |
Protein accession | YP_002272974 |
Protein GI | 209396915 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3720] Putative heme degradation protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCACT ACACACGCTG GCTTGAGTTA AAAGAACAAA ATCCCGGAAA GTACGCGCGT GACATCGCAG GGTTAATGAA TATTAGAGAA GCAGAACTGG CATTTGCACG CGTCACGCAC GATGCGTGGC GGATGCACGG CGATATCCGT GAAATTCTGG CGGCGCTCGA AAGTGTTGGC GAAACCAAAT GTATTTGTCG TAATGAATAT GCAGTCCATG AGCAAGTTGG TACGTTCACA AACCAGCATT TGAACGGACA TGCCGGATTG ATCCTCAATC CGCGCGCGCT GGATTTACGT CTGTTTCTCA ATCAATGGGC CAGTGTTTTC CACATCAAAG AAAACACGGC TCGTGGCGAA CGCCAGAGTA TTCAGTTCTT TGATCATCAG GGCGATGCAT TACTAAAAGT TTATGCCACC GACAATACCG ATATGGCGGC ATGGAGTGAG CTTCTGGCAC GGTTTATCAC CGATGAGAAT ACGCCGCTTG AGTTAAAAGC CGTTGATGCG CCAGTTGTTC AAACGCGAGC CGATGCCACT GTGGTCGAGC AAGAGTGGCG GGCGATGACC GACGTTCATC AGTTTTTTAC GTTGCTCAAG CGCCACAACC TGACGCGCCA ACAGGCGTTC AATCTGGTGG CAGACGATTT GGCCTGCAAA GTATCCAACA GTGCGTTGGC GCAAATTCTT GAATCTGCAC AGCAGGATGG TAATGAAATC ATGGTGTTTG TTGGCAACCG TGGCTGCGTA CAGATTTTCA CCGGTGTGGT AGAAAAAGTG GTGCCAATGA AAGGTTGGCT GAATATTTTC AACCCGACGT TTACTCTTCA TCTATTAGAA GAGAGCATTG CTGAAGCCTG GGTTACCCGT AAACCGACCA GCGATGGCTA CGTAACCAGT CTGGAATTGT TTGCCCATGA TGGTACGCAG ATAGCGCAAC TTTATGGTCA ACGTACAGAA GGCGAACAGG AGCAAGCGCA ATGGCGTAAG CAAATTGCTT CGCTGATACC GGAAGGCGTT GCTGCATAA
|
Protein sequence | MNHYTRWLEL KEQNPGKYAR DIAGLMNIRE AELAFARVTH DAWRMHGDIR EILAALESVG ETKCICRNEY AVHEQVGTFT NQHLNGHAGL ILNPRALDLR LFLNQWASVF HIKENTARGE RQSIQFFDHQ GDALLKVYAT DNTDMAAWSE LLARFITDEN TPLELKAVDA PVVQTRADAT VVEQEWRAMT DVHQFFTLLK RHNLTRQQAF NLVADDLACK VSNSALAQIL ESAQQDGNEI MVFVGNRGCV QIFTGVVEKV VPMKGWLNIF NPTFTLHLLE ESIAEAWVTR KPTSDGYVTS LELFAHDGTQ IAQLYGQRTE GEQEQAQWRK QIASLIPEGV AA
|
| |