Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_1671 |
Symbol | |
ID | 8742265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 1734302 |
End bp | 1735480 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646512249 |
Product | thiamine biosynthesis protein |
Protein accession | YP_003403229 |
Protein GI | 284164950 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCCCC CGGGAGCCGA TACCGTCCTC GTTCGTCACG GGGATCTCAA CACCAAGAGC AACACCGTCA AGCGGTACAT GGTGGACGTC CTCTGTGAGA ACCTCGAGGC CCTCCTCGCG GACCGCTCGA TCCCAGGCGA CGTCGAGCGC AAGTGGAATC GACCGCTGAT CCACACGACC GAGGACGCCG TCGAGGAGGC AACCGACGCG GCCACCGACG CCTTCGGCGT CGTCTCGGCC AGCCCCGCCC TGACCGTCAG TACCGAGAAA GAACGGATCA TCGAGGCGCT GACCGAGGCC GCCCGCGAGT GTTACGACGG CGGAGCGTTC GCGGTCGACG CCCGCCGGGC GAACAAGGAC GTCCCCTACA GCAGCGAGGA TCTGGCCCGC GAGGGCGGCG ACGCCGTCTG GGCCGCCGTC GAGGACGAGT TCGAGCCCGA AGTCGACCTC GACGATCCCG ACGTCACCTT CGGCGTCGAA GTCCGCGACG AGTGCACCTA CGTCTACCTC GAGAAGCGCC CCGGACCGGG CGGACTACCG CTCGGTTCCC AGGAGCCCGC GGTCGCGCTG GTCAGCGGCG GGATCGACTC GCCGGTCGCG GCCTACGAGA TCATGAAGCG GGGGAGCCCG ATCGTGCCGG CCTACGTCGA CCTCGGCGAC TACGGCGGGA TCGACCACGA AGCGCGCGCG ATGGAGACCG TCCGGCTCCT CTCCGAGTAC GCGCCCAATT TCGACATGGA CGTCTACCGG ATTCCCGGGG GCGAGACGGT CGACCTGCTG GTTCGAGAGA TGGACAAGGG GCGGATGCTC TCCCTGCGCC GCTTTTTCTA CCGGGCCGCC GAGACGCTGG CCGAGCGCGT CGACGCCCAT GGGATCGTCA CCGGCGAGGC CGTCGGCCAG AAGTCCAGCC AGACCCTCCA GAACCTCGGC GTCACCAGCC GCGCCGCCGA CCTCCCGATC CACCGCCCGC TGCTCACCCG CGACAAGCAG GACATCGTCG CCCAGGCCCG CGAGATCGGC ACGTTCACCG ACTCGACGAT CGACGCCGGC TGCAACCGCG TCACCCCCGA CCGCGTCGAG ACCAACGCCC GCCTCGAGCC GCTGCTCGCA CACGAGCCCG ACGACCTCCT CGAGCGGGCC GAGGAAGCGG CGAAGAACGC GACGCTGGTC GCGCCCTGA
|
Protein sequence | MSPPGADTVL VRHGDLNTKS NTVKRYMVDV LCENLEALLA DRSIPGDVER KWNRPLIHTT EDAVEEATDA ATDAFGVVSA SPALTVSTEK ERIIEALTEA ARECYDGGAF AVDARRANKD VPYSSEDLAR EGGDAVWAAV EDEFEPEVDL DDPDVTFGVE VRDECTYVYL EKRPGPGGLP LGSQEPAVAL VSGGIDSPVA AYEIMKRGSP IVPAYVDLGD YGGIDHEARA METVRLLSEY APNFDMDVYR IPGGETVDLL VREMDKGRML SLRRFFYRAA ETLAERVDAH GIVTGEAVGQ KSSQTLQNLG VTSRAADLPI HRPLLTRDKQ DIVAQAREIG TFTDSTIDAG CNRVTPDRVE TNARLEPLLA HEPDDLLERA EEAAKNATLV AP
|
| |