Gene Hmuk_2297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2297 
Symbol 
ID8411838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2217530 
End bp2218957 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content67% 
IMG OID645020640 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_003178116 
Protein GI257388343 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCC AGCTCCAGCG TGCACGAGAC GGAGAGATCA CGTCGGCGAT GGAACGGATC 
GCAGAGCGCG AGACCGTCGA CGCCGAGTTC GTCCGCGAGC AGGTCGCGGA CGGACAGGCA
GTGATCCCGG CGAACGTCGG CCACGAGACG CTCGACCCGA TGATCATCGG CCGGGAGTTC
TCGACGAAGG TCAACGCCAA CATCGGCAAC AGCGAGGAGA CGAGCGACCT CGACGGCGAA
CTGGAGAAGC TCCACACCGC GGTCCACTAC GGCGCGGACA CGGTGATGGA CCTCTCGACG
GGCGCGAACT TAGACGAGAT CCGCGAGGCA AACGTCGAGC ATTCGCCGGT TCCCGTCGGG
ACGGTCCCGA TCTACGAGGC CGTCAAGCGA GCGGGCAGCC CCGAGGAGAT CACCCACGAA
CTCCTGCTGG ACGTGATCGA GAAACAGGCC GAGCAGGGCG TCGACTACAT GACGATCCAC
GCGGGCGTGC TGATGGAACA CCTCCCGCTG ACCGACGGCC GCAAGACCGG GATCGTCTCT
CGCGGCGGAT CGATCATGGC CAAGTGGATG GAGGAAAACG GGATGCAGAA CCCCCTCTAC
ACGAAATACG AGGAGATCTG CGAGATCTTC CGTGAACACG ACGTGACCTT CAGTCTCGGC
GACGGCCTGC GGCCCGGCTG TATCGCAGAC GCGAGCGACG AGGCGCAGTT CGCAGAGCTG
GACACCCTGG GTGAACTGAC CCGAAAAGCG TGGGACGAGG GCGTCCAGGT GATGGTCGAG
GGACCGGGCC ACGTGCCCAT GGACGAGGTC GCGGACAACG TCGAGCGCCA GCAGGAGGTC
TGTGACGGCG CGCCGTTCTA CGTCCTCGGC CCGCTGGTGA CCGACATCGC GCCCGGCTAC
GACCACATCA CCAGCGCGAT CGGCGCGACC GAGGCCGGAC GCGCGGGCGC GGCGATGCTG
TGTTACGTCA CGCCCAAAGA GCACCTCGGC CTGCCAGAAC GAGAGGACGT GCGCGAAGGG
CTCGCGGCCT ACCGGATCGC CGCACACGCC GCCGACGTTG CCAACGGGCG CGAGGGGGCC
AGCGACTGGG ACGACGCCCT CTCGGAGGCC CGCTACGCCT TCGACTGGTC GGAGCAATTC
GAGCTCGCGC TCGACCCCGA GCGCGCGAAG GCCTACCACG ACAAGACGCT GCCGGGTGAC
AACTACAAGG ACGCCCGCTT CTGTTCGATG TGTGGCGTCG AGTTCTGCTC GATGCGGATC
GATCAGGACG CGCGGGAGGG CGACGAGATG GCGTCGATCG CCGACGAGAC CGACCTCGAA
GGATCGGCCG CCGCGTCGGT GAACCGACCG CCCGTCGGCA CCCACGACAG CGACGCCGAG
TTGCACCACC ACGAGGGGCG ACCGACAGTC GTCGGCGACG ACGACTGA
 
Protein sequence
MTTQLQRARD GEITSAMERI AERETVDAEF VREQVADGQA VIPANVGHET LDPMIIGREF 
STKVNANIGN SEETSDLDGE LEKLHTAVHY GADTVMDLST GANLDEIREA NVEHSPVPVG
TVPIYEAVKR AGSPEEITHE LLLDVIEKQA EQGVDYMTIH AGVLMEHLPL TDGRKTGIVS
RGGSIMAKWM EENGMQNPLY TKYEEICEIF REHDVTFSLG DGLRPGCIAD ASDEAQFAEL
DTLGELTRKA WDEGVQVMVE GPGHVPMDEV ADNVERQQEV CDGAPFYVLG PLVTDIAPGY
DHITSAIGAT EAGRAGAAML CYVTPKEHLG LPEREDVREG LAAYRIAAHA ADVANGREGA
SDWDDALSEA RYAFDWSEQF ELALDPERAK AYHDKTLPGD NYKDARFCSM CGVEFCSMRI
DQDAREGDEM ASIADETDLE GSAAASVNRP PVGTHDSDAE LHHHEGRPTV VGDDD