Gene Dshi_0574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_0574 
SymbolhmuS 
ID5712027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp563756 
End bp564787 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content66% 
IMG OID641266476 
Producthemin transport protein hmuS 
Protein accessionYP_001531921 
Protein GI159043127 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3720] Putative heme degradation protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.790836 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGATG CAAAAGAGAT CCGGGAGGCG CGCACCCACA AGGCCGGGCG CGCGCGCGAC 
ATAGCCCAGG CGCTCGGGCT GCCGGAGGCC GCGCTGGTCG CGGCGCAGGT CGGGCATGAT
GCCGTGGCGC TGCGCCCGCA TCCAAACGAC CTGATCCCGG CCCTCGGGGC CCTCGGCCCG
ATGATGGCGC TCACGCGCAA CGACGCCTGC GTCATCGAAA AGGATGGGGA ATATACCGAC
TACCACGGGG GCGATCACGC GACGATGACC CTCAACGAGG GGATCGATTT GCGGATGTTT
CCGCGCCACT GGGTGCATGC CTTCGCGGTC TCCGAGCAGG TCAAGAGCGG TCTGCGCCAC
AGCGTGCAGG TGTTCGACGC CGCAGGCGAT GCGGTGCACA AGGCCTATCT GCGCGACGGC
GCGGACATGG CGGCCTGGAC ACGGCTGCAA TCGGACCTCG CCCTGCCCGC GCAAACCGAC
ACCTTGGCCC TGAAGGATCG GGAGCCGCCC GAGGGCGCGC GGATCAATCT CGACAAGCGC
GACATCCTGC TCAAGGAGTG GGCACGGCTC ACCGACACTC ACCAATTCCT GCGCCTCTGC
GCCAAGCTGA AAATGAACCG GTTGGGCGCC TATCGGATTG CGGAACCCCC CTTCGTGCGG
CCGCTTGCGC CTTCGGCAGT GGACACGATG TTGCGCGCGA TACAGGTTGC GGGATTCGAG
ATCATGCTGT TCGTCGGCAA TCGCGGCTGC ATCGAAATCC ACACCGGCCC CCTTCGGCGG
ATAGAGCCGA TGGGCCCCTG GGTGAACGTG CTGGACCCGG ACTTCAACCT CCATCTGCGC
GGCGACAAGG TCGCGGAGGT CTGGCAGGTC GAAAAGCCGA CACAACGCGG CCCGGCTGTC
TCGGTCGAGG CGTTCGACGC CGACGGTGTC CTGATCCTTC AGGCGTTCGG CGTTCCGAAG
GAAGGCAAGG ATACCCGCAC CGCGTTCACC GAGATCGTCA ACGGCTTGCC GACACAGGAG
ACCACGGCAT GA
 
Protein sequence
MLDAKEIREA RTHKAGRARD IAQALGLPEA ALVAAQVGHD AVALRPHPND LIPALGALGP 
MMALTRNDAC VIEKDGEYTD YHGGDHATMT LNEGIDLRMF PRHWVHAFAV SEQVKSGLRH
SVQVFDAAGD AVHKAYLRDG ADMAAWTRLQ SDLALPAQTD TLALKDREPP EGARINLDKR
DILLKEWARL TDTHQFLRLC AKLKMNRLGA YRIAEPPFVR PLAPSAVDTM LRAIQVAGFE
IMLFVGNRGC IEIHTGPLRR IEPMGPWVNV LDPDFNLHLR GDKVAEVWQV EKPTQRGPAV
SVEAFDADGV LILQAFGVPK EGKDTRTAFT EIVNGLPTQE TTA