Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1917 |
Symbol | |
ID | 6315297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 2012273 |
End bp | 2013376 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642644299 |
Product | thiazole biosynthesis protein ThiH |
Protein accession | YP_001918076 |
Protein GI | 188586531 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTTTA GAGATTTTAT TAAAAAATAC CAACAGCTAG ACTTTCAGCA AACTTTTCAA GATATCACTC CCCAAAGGGT TGAAACCGCA ATTTATAAAG ACAATCCAAA CTTTAGAGAC TTTTTAGCCA TGCTATCGCC AGCCGCAGAA AATTACCTGG AAGAAATGGC GCAAAAAGCT AATCAACTCA CCACAAATTT TTTTGGAAAA GCTATCGTTT TGTATGCTCC CATTTACGTT AGTGATCATT GTGATAATAA CTGCCTTTAT TGTGCTTTTA AAGTAGATAA TCAGTTTCAA AGAACTACTT TAAGCCTTGA AGAAGTTGAA CAAGAAGCCC AAGCTATCAG CCATACTGGT CTTCGCCATA TATTGCTTTT AACTGGTGAA TCTAAACCAC ATGCACCCCT AGATTATATA GAAAAATGTA TTGATATTTT AAAAAAATAC TTTTCTTCTA TAGCTATAGA AATCTATCCA TTGACAGCCA AAGAATATAA ACAGTTAATT GACAATGAAG TTGATGGACT TACAATCTAC CAAGAGGTTT ACGACGAAGA TATTTATCAA CAAGTTCATA AAAGCGGCCC AAAACGCGAT TATGATTTTC GATTACTAGC CCCGGAACGC AGCCTACAAA TGGGCATGAG AAGGGTTAAC ATAGGTGCAT TATTTGGTTT GGGCCCCTGG AGGCAGGAAG CTTTTTTTAC TGGTCTACAT GCTTGGTACT TATTAAATCA TTATCCGGAA GCTGAAATTT CCATATCTTT TCCCCGCTTA AGACCTTTTG CTGATGAAAC CTTGGAATAT TATAGAGTTG CGGATAAAAA CTTGGTTCAA ATGATAGTAG CCACAAGAAT TTTTTTACAC AGTGTGGGGA TCAATATTTC TACTAGAGAA AGCCCTGATT TAAGAGAAAA TCTGCTTCCC CTAGGAGTAA CTAGAATGTC GGCAGGAGCT AAGACAGCTG TAGGAAGCTA TTCCGGTGTA GAAAATAGCG AGTCTCAGTT TCATACCGCC GATGAACGTT CAGTAGTAGA GATCAAAGGT ATGTTAATAA ATAATGGTTA TCAACCGGTA TTAAAAGACT GGGAATTAAT TTAG
|
Protein sequence | MSFRDFIKKY QQLDFQQTFQ DITPQRVETA IYKDNPNFRD FLAMLSPAAE NYLEEMAQKA NQLTTNFFGK AIVLYAPIYV SDHCDNNCLY CAFKVDNQFQ RTTLSLEEVE QEAQAISHTG LRHILLLTGE SKPHAPLDYI EKCIDILKKY FSSIAIEIYP LTAKEYKQLI DNEVDGLTIY QEVYDEDIYQ QVHKSGPKRD YDFRLLAPER SLQMGMRRVN IGALFGLGPW RQEAFFTGLH AWYLLNHYPE AEISISFPRL RPFADETLEY YRVADKNLVQ MIVATRIFLH SVGINISTRE SPDLRENLLP LGVTRMSAGA KTAVGSYSGV ENSESQFHTA DERSVVEIKG MLINNGYQPV LKDWELI
|
| |