Gene Htur_1671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1671 
Symbol 
ID8742265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1734302 
End bp1735480 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content69% 
IMG OID646512249 
Productthiamine biosynthesis protein 
Protein accessionYP_003403229 
Protein GI284164950 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCCCC CGGGAGCCGA TACCGTCCTC GTTCGTCACG GGGATCTCAA CACCAAGAGC 
AACACCGTCA AGCGGTACAT GGTGGACGTC CTCTGTGAGA ACCTCGAGGC CCTCCTCGCG
GACCGCTCGA TCCCAGGCGA CGTCGAGCGC AAGTGGAATC GACCGCTGAT CCACACGACC
GAGGACGCCG TCGAGGAGGC AACCGACGCG GCCACCGACG CCTTCGGCGT CGTCTCGGCC
AGCCCCGCCC TGACCGTCAG TACCGAGAAA GAACGGATCA TCGAGGCGCT GACCGAGGCC
GCCCGCGAGT GTTACGACGG CGGAGCGTTC GCGGTCGACG CCCGCCGGGC GAACAAGGAC
GTCCCCTACA GCAGCGAGGA TCTGGCCCGC GAGGGCGGCG ACGCCGTCTG GGCCGCCGTC
GAGGACGAGT TCGAGCCCGA AGTCGACCTC GACGATCCCG ACGTCACCTT CGGCGTCGAA
GTCCGCGACG AGTGCACCTA CGTCTACCTC GAGAAGCGCC CCGGACCGGG CGGACTACCG
CTCGGTTCCC AGGAGCCCGC GGTCGCGCTG GTCAGCGGCG GGATCGACTC GCCGGTCGCG
GCCTACGAGA TCATGAAGCG GGGGAGCCCG ATCGTGCCGG CCTACGTCGA CCTCGGCGAC
TACGGCGGGA TCGACCACGA AGCGCGCGCG ATGGAGACCG TCCGGCTCCT CTCCGAGTAC
GCGCCCAATT TCGACATGGA CGTCTACCGG ATTCCCGGGG GCGAGACGGT CGACCTGCTG
GTTCGAGAGA TGGACAAGGG GCGGATGCTC TCCCTGCGCC GCTTTTTCTA CCGGGCCGCC
GAGACGCTGG CCGAGCGCGT CGACGCCCAT GGGATCGTCA CCGGCGAGGC CGTCGGCCAG
AAGTCCAGCC AGACCCTCCA GAACCTCGGC GTCACCAGCC GCGCCGCCGA CCTCCCGATC
CACCGCCCGC TGCTCACCCG CGACAAGCAG GACATCGTCG CCCAGGCCCG CGAGATCGGC
ACGTTCACCG ACTCGACGAT CGACGCCGGC TGCAACCGCG TCACCCCCGA CCGCGTCGAG
ACCAACGCCC GCCTCGAGCC GCTGCTCGCA CACGAGCCCG ACGACCTCCT CGAGCGGGCC
GAGGAAGCGG CGAAGAACGC GACGCTGGTC GCGCCCTGA
 
Protein sequence
MSPPGADTVL VRHGDLNTKS NTVKRYMVDV LCENLEALLA DRSIPGDVER KWNRPLIHTT 
EDAVEEATDA ATDAFGVVSA SPALTVSTEK ERIIEALTEA ARECYDGGAF AVDARRANKD
VPYSSEDLAR EGGDAVWAAV EDEFEPEVDL DDPDVTFGVE VRDECTYVYL EKRPGPGGLP
LGSQEPAVAL VSGGIDSPVA AYEIMKRGSP IVPAYVDLGD YGGIDHEARA METVRLLSEY
APNFDMDVYR IPGGETVDLL VREMDKGRML SLRRFFYRAA ETLAERVDAH GIVTGEAVGQ
KSSQTLQNLG VTSRAADLPI HRPLLTRDKQ DIVAQAREIG TFTDSTIDAG CNRVTPDRVE
TNARLEPLLA HEPDDLLERA EEAAKNATLV AP