Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3801 |
Symbol | hmuS |
ID | 6145970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3866897 |
End bp | 3867925 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641618627 |
Product | hemin transport protein HmuS |
Protein accession | YP_001745767 |
Protein GI | 170682449 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3720] Putative heme degradation protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.189206 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCACT ACACACGCTG GCTTGAGTTA AAAGAACAAA ATCCCGGAAA GTACGCGCGT GACATCGCAG GTTTAATGAA TATCAGCGAA GCAGAACTGG CATTTGCGCG CGTCACGCAC GACGCGTGGC GGATGCGCGG CGATATCCGT GACATTCTGG CGGCGCTCGA AAGTGTTGGC GAGACCAAAT GTATTTGCCG TAATGAATAT GCAGTCCATG AGCAAGTTGG TGCGTTCACA AACCAGCATT TGAACGGACA TGCCGGATTG ATCCTCAATC CGCGCGCGCT GGATTTACGT CTGTTTCTCA ATCAATGGGC CAGTGTTTTC CACATCAAAG AAAACACGGC TCGTGGCGAA CGCCAGAGTA TTCAGTTCTT TGATCATCAG GGCGATGCAT TACTAAAAGT TTATGCCACC GACAATACCG ATATGGCGGC ATGGAGTGAG CTTCTGGCAC GGTTTATCAC CGATGAGAAT ACGCCGCTTG AGTTAAAAGC CGTTGATGCG CCAGTTGTTC AAACGCGAGC CGATGCCAGT GTGGTCGAGC AAGAGTGGCG GGCGATGACC GACGTTCATC AGTTTTTTAC GTTGCTCAAG CGCCACAACC TGACGCGCCA ACAGGCGTTC AATCTGGTGG CAGACGATTT GGCCTGCAAA GTATCCAACA GTGCGTTGGC GCAAATTCTT GAATCTGCAC AGCAGGATGG CAATGAAATC ATGGTGTTTG TTGGCAACCG TGGCTGCGTA CAGATTTTCA CTGGCGTGGT AGAAAAAGTG GTGCCAATGA AAGGTTGGCT GAATATTTTT AACCCGACGT TTACTCTTCA TCTATTAGAA GAGAGCATTG CTGAAACCTG GGTAACCCGT AAACCTGCTA GTGACGGTTA TGTGACCAGT CTGGAATTGT TTGCCCATGA TGGTACGCAG ATAGCGCAAC TTTATGGTCA ACGTACAGAA GGCGAACAGG AGCAAGCGCA ATGGCGTAAG CAAATTGCTT CGCTGATACC GGAAGGCGTT ACTGCATAA
|
Protein sequence | MNHYTRWLEL KEQNPGKYAR DIAGLMNISE AELAFARVTH DAWRMRGDIR DILAALESVG ETKCICRNEY AVHEQVGAFT NQHLNGHAGL ILNPRALDLR LFLNQWASVF HIKENTARGE RQSIQFFDHQ GDALLKVYAT DNTDMAAWSE LLARFITDEN TPLELKAVDA PVVQTRADAS VVEQEWRAMT DVHQFFTLLK RHNLTRQQAF NLVADDLACK VSNSALAQIL ESAQQDGNEI MVFVGNRGCV QIFTGVVEKV VPMKGWLNIF NPTFTLHLLE ESIAETWVTR KPASDGYVTS LELFAHDGTQ IAQLYGQRTE GEQEQAQWRK QIASLIPEGV TA
|
| |