Gene Nther_1687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1687 
Symbol 
ID6315551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1761827 
End bp1762975 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content34% 
IMG OID642644063 
ProductThiamin pyrophosphokinase catalytic region 
Protein accessionYP_001917849 
Protein GI188586304 
COG category[S] Function unknown 
COG ID[COG4825] Uncharacterized membrane-anchored protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.677284 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.00191184 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGATCAGTC CAATCAAGCA AGGGAAAATA AAAGGAAGAG GGAAATTAGG ATTTAAAACT 
AAAGATTTAA CACCACTGCT TGATTCCGGT GATATAGCTA TTATAAGACA TGAAGATATT
GATGAAATAG CCGCCAGATC ATTGTGTGAA GCAAATATAA AGGCTGTTAT TAATCTCAGT
GACAGCATGA CTGGATATTA TCCTAATCAA GGTCCGAGGG TATTTATTGA ACACAGTGTT
CCATTGATTG ATCAAGTTGA TGAAAATATT ATAGATAAAT TAAAACCAGA AAGAGATATA
TTCATAGATG GTGAGTGTAT ATATCAAGAT GACGAATTAA TAGGTAGAGG TCGAATTGTA
AATTTACAAG TTATTAGCCA GTTAGAAAAA ATCGCTGAAA ACAATTTTGA AACTCGCCTG
AGAGAGTTCG TAGAAAACAC TTTTCAATAT GCAATTAAAG AAAAAGATTT AATGTTAAAA
GATCTGTATA CCGATGAGTT ATCATCTGAT CTTGTTAATT TATTTTATAA TAGACATTGT
GTGATTGTTG TTAGAGGTAA AGATTATCGT AAAGATTTAG ATGCCATCTC AGAATATATA
AATGAAGAAA ATCCTGTACT TATTGGGGTT GATGGAGGAG CAGATGCATT AATAGAATAT
GGCTTCTCAC CTGATATTGT AATAGGAGAT ATGGATAGTA TTTCAGATTA TGCTCTTAAT
AAAATCAAAA ATCGAATTGT GCATGCTTAC CCCAATGGAA GTGCGCCTGG TGCGAAAAGA
TTAAAAAAAC TAGGTTTAGA TTATAATGAT ATCCCTGCAC CTGGGACCAG TGAGGATCTA
GCTTTATTAC TGGCTTACCA AATGGAAGCT CGTCTATTAG TAGCTTTGGG GACTCATACT
CATGTAATTG ACTTTTTAGA AAAGGGCCGT CAGGGAATGG CAAGTACTTT TTTAGCAAGA
CTAAAGGTTG GTGAAAAATT AGTTGATGCC AAGGGAGTTA GTCAACTTTA TCGGCCGAGA
GTTAAGTTAC AATCTTTAGG TTTAATTGTA GTTTCAGCCA TATTACCTGT AGTGATATTA
GCTATGCTTT CCCCAATAGT GGCGCATTTT TTTAGGTTAT TGTATCTATA TGTACGTTTG
TTTTTCTGA
 
Protein sequence
MISPIKQGKI KGRGKLGFKT KDLTPLLDSG DIAIIRHEDI DEIAARSLCE ANIKAVINLS 
DSMTGYYPNQ GPRVFIEHSV PLIDQVDENI IDKLKPERDI FIDGECIYQD DELIGRGRIV
NLQVISQLEK IAENNFETRL REFVENTFQY AIKEKDLMLK DLYTDELSSD LVNLFYNRHC
VIVVRGKDYR KDLDAISEYI NEENPVLIGV DGGADALIEY GFSPDIVIGD MDSISDYALN
KIKNRIVHAY PNGSAPGAKR LKKLGLDYND IPAPGTSEDL ALLLAYQMEA RLLVALGTHT
HVIDFLEKGR QGMASTFLAR LKVGEKLVDA KGVSQLYRPR VKLQSLGLIV VSAILPVVIL
AMLSPIVAHF FRLLYLYVRL FF