Gene Hmuk_2307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2307 
Symbol 
ID8411848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2226397 
End bp2227584 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content72% 
IMG OID645020650 
Productthiamine biosynthesis protein 
Protein accessionYP_003178126 
Protein GI257388353 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0561858 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCCGC CGGGAGCCGA CATCGTCCTC GTTCGCCACG GCGAGATCGG GACCAAGAGC 
GAGCAGGTCC GTCGATCCAT GGAGGAGCGG CTGGCGCGGA ACCTCTCGGC CCTGCTCGCG
GATCGCGGCG TCGACGGCTC CGTCGAGCGC GAACGGACGC GGCTGTTCGT CCACAGCGAC
GAGCCCAGCG CCGCCGTCGG GGCCGCGACA GACACCTTCG GCGTCGTCTC TGCCAGCGCT
GCCGTCCGCA CCGAGCCGAC GCTCGACGCC ATCTGTGAGG GGCTGGCAGC GATCGCGCGC
GAGCGATTCG ACGGTGGCAC CTTCGCCGTC GACGCCCGCC GGGCGGGCCA GCAGTCCGCC
CACGACTTCT CCAGCGAGGA CATCGAGTCC GACGGCGGCG CTGCCGTGTG GGCCGCGATC
GAAGCGGCGG GCGGCGATCC GGCGGTCGAT CTCGACGAGC CGGACCTGAC GTTCCACGTC
GAGTGTCGTC GGGAGGTCGC GTACCTGTTT CTCGACAAGC AGGCGGGTCC CGGCGGGCTT
CCCGTCGGGA CTCAAGAGCC CGTCGTCGTC TTGCTCTCCG GGGGAATCGA CTCTCCCGTC
GCGGCCTGGA AGCTGCTCAA GCGCGGCTGT CCCGTCGTGC CGGTGTACGT CGATCTCGGG
GCGTACGGCG GTCCGGACCA CCGCGCGCGA GCCCTCTCGA CCGCCGAGAC GCTCGCCAGC
TACGTCCCGA ACGTCGACCT CTCGGTGCGG GTCGCCGACG GAGGGGCCGT CGTGGAGCGG
CTCGCAGACG AGCTGGACGC ACGGCGGATG CTCGCCCTGC GGCGGTTCAT GCTCGCCGTC
GGGGCCCGGG TCGCCGAGCG TACCGACGCC GTCGGCGTGG CGACCGGCGA GGCGATCGGA
CAGAAGTCGA GCCAGACGAG CGCCAACCTC GCGGTGACGG ACGTGGCCGT CGACTGTCCC
GTCTTCCGGC CGAACCTGAC CGCCGACAAG GCCGACATCA CGCAGCTGGC CCGCGAGATC
GGGACCTTCG AGGCGTCGAC GATCCCGACC GGCTGTAACC GCGTCGCGCC GTCGCTGCCC
GAGACGAACG CCGACCTGCA CGCCGTCCGC GAACGCGAAC CGGACGACCT GTTCGAGCGT
GCGAGGGCCG TCGCTGACCG CGCCGAAGTC GTCGCCCTCG ACCGCTGA
 
Protein sequence
MHPPGADIVL VRHGEIGTKS EQVRRSMEER LARNLSALLA DRGVDGSVER ERTRLFVHSD 
EPSAAVGAAT DTFGVVSASA AVRTEPTLDA ICEGLAAIAR ERFDGGTFAV DARRAGQQSA
HDFSSEDIES DGGAAVWAAI EAAGGDPAVD LDEPDLTFHV ECRREVAYLF LDKQAGPGGL
PVGTQEPVVV LLSGGIDSPV AAWKLLKRGC PVVPVYVDLG AYGGPDHRAR ALSTAETLAS
YVPNVDLSVR VADGGAVVER LADELDARRM LALRRFMLAV GARVAERTDA VGVATGEAIG
QKSSQTSANL AVTDVAVDCP VFRPNLTADK ADITQLAREI GTFEASTIPT GCNRVAPSLP
ETNADLHAVR EREPDDLFER ARAVADRAEV VALDR