Gene Nther_1917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1917 
Symbol 
ID6315297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2012273 
End bp2013376 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content37% 
IMG OID642644299 
Productthiazole biosynthesis protein ThiH 
Protein accessionYP_001918076 
Protein GI188586531 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTTA GAGATTTTAT TAAAAAATAC CAACAGCTAG ACTTTCAGCA AACTTTTCAA 
GATATCACTC CCCAAAGGGT TGAAACCGCA ATTTATAAAG ACAATCCAAA CTTTAGAGAC
TTTTTAGCCA TGCTATCGCC AGCCGCAGAA AATTACCTGG AAGAAATGGC GCAAAAAGCT
AATCAACTCA CCACAAATTT TTTTGGAAAA GCTATCGTTT TGTATGCTCC CATTTACGTT
AGTGATCATT GTGATAATAA CTGCCTTTAT TGTGCTTTTA AAGTAGATAA TCAGTTTCAA
AGAACTACTT TAAGCCTTGA AGAAGTTGAA CAAGAAGCCC AAGCTATCAG CCATACTGGT
CTTCGCCATA TATTGCTTTT AACTGGTGAA TCTAAACCAC ATGCACCCCT AGATTATATA
GAAAAATGTA TTGATATTTT AAAAAAATAC TTTTCTTCTA TAGCTATAGA AATCTATCCA
TTGACAGCCA AAGAATATAA ACAGTTAATT GACAATGAAG TTGATGGACT TACAATCTAC
CAAGAGGTTT ACGACGAAGA TATTTATCAA CAAGTTCATA AAAGCGGCCC AAAACGCGAT
TATGATTTTC GATTACTAGC CCCGGAACGC AGCCTACAAA TGGGCATGAG AAGGGTTAAC
ATAGGTGCAT TATTTGGTTT GGGCCCCTGG AGGCAGGAAG CTTTTTTTAC TGGTCTACAT
GCTTGGTACT TATTAAATCA TTATCCGGAA GCTGAAATTT CCATATCTTT TCCCCGCTTA
AGACCTTTTG CTGATGAAAC CTTGGAATAT TATAGAGTTG CGGATAAAAA CTTGGTTCAA
ATGATAGTAG CCACAAGAAT TTTTTTACAC AGTGTGGGGA TCAATATTTC TACTAGAGAA
AGCCCTGATT TAAGAGAAAA TCTGCTTCCC CTAGGAGTAA CTAGAATGTC GGCAGGAGCT
AAGACAGCTG TAGGAAGCTA TTCCGGTGTA GAAAATAGCG AGTCTCAGTT TCATACCGCC
GATGAACGTT CAGTAGTAGA GATCAAAGGT ATGTTAATAA ATAATGGTTA TCAACCGGTA
TTAAAAGACT GGGAATTAAT TTAG
 
Protein sequence
MSFRDFIKKY QQLDFQQTFQ DITPQRVETA IYKDNPNFRD FLAMLSPAAE NYLEEMAQKA 
NQLTTNFFGK AIVLYAPIYV SDHCDNNCLY CAFKVDNQFQ RTTLSLEEVE QEAQAISHTG
LRHILLLTGE SKPHAPLDYI EKCIDILKKY FSSIAIEIYP LTAKEYKQLI DNEVDGLTIY
QEVYDEDIYQ QVHKSGPKRD YDFRLLAPER SLQMGMRRVN IGALFGLGPW RQEAFFTGLH
AWYLLNHYPE AEISISFPRL RPFADETLEY YRVADKNLVQ MIVATRIFLH SVGINISTRE
SPDLRENLLP LGVTRMSAGA KTAVGSYSGV ENSESQFHTA DERSVVEIKG MLINNGYQPV
LKDWELI