Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_1462 |
Symbol | |
ID | 8742053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 1526616 |
End bp | 1527548 |
Gene Length | 933 bp |
Protein Length | 310 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646512038 |
Product | thiazole biosynthesis enzyme |
Protein accession | YP_003403021 |
Protein GI | 284164742 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1635] Flavoprotein involved in thiazole biosynthesis |
TIGRFAM ID | [TIGR00292] thiazole biosynthesis enzyme |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAT TCGAGCAGTT CAGTCAGGTG GGAGAGGCCG ACGTTACCCG CGCGATCGGA CAGGAGTGGA CCGAGGAGTT CATGGACTTC TCGGACAGCG ACGTCATCAT CGTGGGCGGC GGGCCCTCGG GACTGACGGC CGCGAAGGAA CTCTCCGAGC GCGGCGTCAA GACGATGGTC GTCGAGAAGA ACAACTACCT CGGGGGCGGC TTCTGGCTCG GCGGCTTCCT GATGAACAAG GTCACCGTCC GCGACCCCGC CCAGCAGATC CTCGACGAGC TCGACGTCTC GCACAAACAG TCCGAGGACA GCGAGGGGCT CTACATCGCC AACGGCCCCG AGGCCTGTTC CGGCCTCATC AAGGCCGCCT GCGACGCCGG CGCGAAGATG CAGAACATGA CGGAGTTCAC GGACATCGTC ATCCGCGAGG ACCACAAGGT CTCGGGGATC GTCATGAACT GGACGCCGGT CCACGCGCTG CCCCGCGAGA TCACCTGCGT CGACCCGATC GCCGTCGAGG CCGACCTAGT CATCGACGCG ACGGGCCACG ACGCGATGGC CGTCAAGAAA CTCGACGAGC GCGGCGTCCT CGACGCCCCC GGTATCGCCG ACGCGAAGGA ATCGGCGACG GGCATGGACC AGACCGACGA CGACACGTAC GGTGCGCCCG GCCACGACTC GCCAGGACAC GACTCCATGT GGGTCGGCAA GTCCGAGGAC GCCGTCGTCG AGCACACCGG CCTCGTCCAC GACGGCCTGA TCGCGACGGG GATGGCCACC GCGACGACCT ACGGACTCCC GCGCATGGGC CCGACCTTCG GCGCCATGCT CGTCTCCGGC AAGCGCGCCG CCCAGGTCGC GCTCGACGAA CTCGAGGTCG ACGCCGAACC GGTCGACATC ACCTCGCGCG CGGCGACGCC GGCCGACGAC TAA
|
Protein sequence | MSEFEQFSQV GEADVTRAIG QEWTEEFMDF SDSDVIIVGG GPSGLTAAKE LSERGVKTMV VEKNNYLGGG FWLGGFLMNK VTVRDPAQQI LDELDVSHKQ SEDSEGLYIA NGPEACSGLI KAACDAGAKM QNMTEFTDIV IREDHKVSGI VMNWTPVHAL PREITCVDPI AVEADLVIDA TGHDAMAVKK LDERGVLDAP GIADAKESAT GMDQTDDDTY GAPGHDSPGH DSMWVGKSED AVVEHTGLVH DGLIATGMAT ATTYGLPRMG PTFGAMLVSG KRAAQVALDE LEVDAEPVDI TSRAATPADD
|
| |