Gene Huta_2472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2472 
Symbol 
ID8384774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2546225 
End bp2547670 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content66% 
IMG OID644973546 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_003131369 
Protein GI257053536 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.709563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGGAA CCCAACTGCA GCGCGCACGC AACGGTGAGA TCACGCCGGC GATGGAGCGC 
GTCGCCGAGC GGGAGAACCG TGATCCCGAG TACGTCCGCG AGAAGGTCGC CGCGGGCGAG
GCAGTGATCC CGAACAACCA CGAGCACGAC TCGCTGGATC CGATGATTAT CGGGAAGGAC
TTTTCGACGA AAGTCAACGC CAACATCGGC AACAGCGAAC CCGAAAGCGA TATCGAAACC
GAACGGGAGA AACTCCACAC GGCCGTCGAA TACGGCGCGG ACACGGTGAT GGATCTCTCG
ACGGGTGGTG ACATTCCGGA ACTTCGCGCC GGCCAGATCG AACACTCGCC GGTCCCGCTC
GGGACAGTGC CGATCTACGA GGCGCTAAAG CAGGCCGGAT CACCCGAGGA ACTCACGGCG
GACCTCCTCC TGGACGTGAT CGAGTCCCAG GCCGCCCAGG GCACGGACTA TCAGACGATT
CATGCGGGGG TTCTCCGGGA GCACTTGCCG CTGACCGACG GCCGGATCAC GGGCATCGTC
TCCCGCGGTG GGTCGATTCT CGCCGAATGG ATGGAAAATC ACGGCGAGCA GAACCCGCTG
TACACGCATT TCGACGAGAT CTGTGCGATC CTTGCGGAAT ACGACGTGAC GATCAGTCTC
GGCGACGGGC TCCGACCCGG GAGCCTGGCA GACGCCAACG ACGACGCCCA GCTGGCGGAA
CTGGAGACGC TGGGTGACCT CACCGAGCGC GCCCACGAGC ACGGCGTCCA GGTCATGGTC
GAGGGGCCGG GCCACGTCCC CCTAGACGAG ATCGGCGAAC AGGTCGAGTA TCAACAGGAG
GTCTGTGACG GCGCGCCGTT CTACCTGCTG GGCCCGCTGG TGACTGACGT TGCGCCGGGG
TACGACCATA TCACGAGTGC GATCGGCGCA ACCGAGGCCG CCCGTCACGG CGCGGCGATG
CTCTGCTACG TCACACCCAA AGAGCACCTC GGTTTGCCCG ACGCCGAGGA CGTCCGGGAC
GGGCTGGCGG CCTACCGGAT CGCCGCTCAT GCGGGCGACG TGGCCGCCGG GAAGCCAGGC
GCTCGCGACT GGGACGACGC GCTTTCCCAG GCGCGGTACA ACTTCGACTG GCGCGAGCAG
TTCAACCTCG CGCTGGACCC CGACCGAGCC AAGGAGTATC ACGACCAGAC GCTCCCGGAG
GACAACTACA AGGAAGCCCG CTACTGCTCG ATGTGTGGCG CGGAATTTTG CTCGATGCGG
ATCGACCAGG ACGCCCGTGA GGGGGAGGAG ATGGAGGGAC TGGACGCCGA CGTCGATCTC
GCGGACTCGC CCGCAGCCGA CGTCAATCTG CCGCCGACGG GCAAACACGA TACAAGTGGC
CTGCCCACGG TGCCCGAGGC GTTGTGTGAC CACGCCGGGG GCGAGGGTTT GCCGGGCGAC
GACTGA
 
Protein sequence
MMGTQLQRAR NGEITPAMER VAERENRDPE YVREKVAAGE AVIPNNHEHD SLDPMIIGKD 
FSTKVNANIG NSEPESDIET EREKLHTAVE YGADTVMDLS TGGDIPELRA GQIEHSPVPL
GTVPIYEALK QAGSPEELTA DLLLDVIESQ AAQGTDYQTI HAGVLREHLP LTDGRITGIV
SRGGSILAEW MENHGEQNPL YTHFDEICAI LAEYDVTISL GDGLRPGSLA DANDDAQLAE
LETLGDLTER AHEHGVQVMV EGPGHVPLDE IGEQVEYQQE VCDGAPFYLL GPLVTDVAPG
YDHITSAIGA TEAARHGAAM LCYVTPKEHL GLPDAEDVRD GLAAYRIAAH AGDVAAGKPG
ARDWDDALSQ ARYNFDWREQ FNLALDPDRA KEYHDQTLPE DNYKEARYCS MCGAEFCSMR
IDQDAREGEE MEGLDADVDL ADSPAADVNL PPTGKHDTSG LPTVPEALCD HAGGEGLPGD
D