Gene Nther_0304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_0304 
Symbol 
ID6316137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp321285 
End bp322886 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content40% 
IMG OID642642690 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_001916490 
Protein GI188584945 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.734599 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAAAGG TTGTATCTCC AGAGGAAATG GCTCAAATTG ATAAAATGGC CATAGATGAG 
GGTAGAATTC CCGGGATAGT TCTGATGGAA AATGCCGCTA CTGCCGTTAC TAATGTAGTC
CTAAATTTCT TAAACAACAT GAAGGCCGAC TATAATCAAA CTCAGGTCAC TGTTTTAGCC
GGGATTGGAA ATAACGGTGG GGATGGATTT GCAGTAGCTC GACAGCTGGC CATGAAAGAA
ATTAATGTTA GCTTGGTTTT AATCGGTAAG GCCGATAAAC TTTCAGGAGA TGCCCTGACA
AATTGGGAAA TTATCAAGCA TAGAGATGAC ATTAAAATAC ATACAATAGA CCAAAGAAGT
ACGGATAATT TAACTTCTCT GAACAATCTA ATTATTGAAT CAGATATTAT CCTTGATGCA
TTACTCGGTA CAGGTTTAGC AGGTGCACCT AAAGAACCAT TTAATACCTG TATTCAAATT
GCCAACCAAA GTAGAAGAAA AGATTGTCTA ACTATCAGTG TAGATATTCC TTCAGGGGTT
TCTGGCTCAG GAGGAGAAGT TGATGGTAAT GCCGTCATGG CAGATATAAC AGTAACCTTC
GCTCAACCAA AAACAGGTTT ATTATTTTAT CCAGGAGCCC ATTTTACTGG AGAATTACTG
ACTGTGCCTA TAGGGATTCC CAATTGGATT GTAACGAAAA ACGAATCACA AAATTATCTA
GTCACCGAAG GTAGTGCGGC TAACCTACTT CCACAGCGCT TGCCTGATAC CCATAAAGGT
CATTATGGTA GAGTGACTAT TATAGCAGGT TCTAGCCATA TGTTGGGTGC TGGGGTATTA
GTATCAATGG CCGCTGTAAA AAGTGGATCT GGACTCGTTA CATGGGCTGT ACCTGAATCA
GAAGAAAGAA CGGCTCAATC AAAAGTAGGT TCAGAAGTTA TGACTTGGTG TTTACCATCA
AAGGACGGTG TCTTAGCACA ACAAGTCCAC GAAAAATTAG AAAAAACTGT CACTGGTTTA
GAACACGGTC CAGATGTTAT GGCTGTGGGT CCAGGCCTGG GGACGGAAAA AGGAACTGAA
GAAGTTGTAA AAAAAGCTCT GTTAGATTTT AACACTTCCC TTGTATTGGA TGCAGATGGC
TTAAATGTTT TAGTGGGGAA TACTAATTGG CTTAGTAGTT CTGCTACACC TAAAGTTCTA
ACACCACACA TTGGAGAATA TAGTCGGTTA ATAGATAAAT CAATAGATGA AATTAAATCT
AATCCAATAA ATCTGGTGAC ACAAAGTGCT CAGGAATGGA ATTCTGTTAT CGTACTTAAA
GGGACACCGA CTATAATTGC TTCTCCCCAT GGTAGTTCTT ATATTGTATC CACCAGTAAT
TCCGGGATGG CAACGGGAGG TAGTGGTGAC GTATTAACAG GTATCATTAC TTCTTTAATT
GGTCAAGGTT TGACTGTTGA AGAAGGTGCC ATTCTTGGTG TATATCTCCA TAAAATTGCT
GGTGAAAAAG CTAGATCCCA AAGTGGAGAA GCAAGTATGA CAGCAACTGA TCTGTGGAAT
TCATTACCAG ATGCATTTCA ATATCTTAAC GATAAAAGTT AA
 
Protein sequence
MLKVVSPEEM AQIDKMAIDE GRIPGIVLME NAATAVTNVV LNFLNNMKAD YNQTQVTVLA 
GIGNNGGDGF AVARQLAMKE INVSLVLIGK ADKLSGDALT NWEIIKHRDD IKIHTIDQRS
TDNLTSLNNL IIESDIILDA LLGTGLAGAP KEPFNTCIQI ANQSRRKDCL TISVDIPSGV
SGSGGEVDGN AVMADITVTF AQPKTGLLFY PGAHFTGELL TVPIGIPNWI VTKNESQNYL
VTEGSAANLL PQRLPDTHKG HYGRVTIIAG SSHMLGAGVL VSMAAVKSGS GLVTWAVPES
EERTAQSKVG SEVMTWCLPS KDGVLAQQVH EKLEKTVTGL EHGPDVMAVG PGLGTEKGTE
EVVKKALLDF NTSLVLDADG LNVLVGNTNW LSSSATPKVL TPHIGEYSRL IDKSIDEIKS
NPINLVTQSA QEWNSVIVLK GTPTIIASPH GSSYIVSTSN SGMATGGSGD VLTGIITSLI
GQGLTVEEGA ILGVYLHKIA GEKARSQSGE ASMTATDLWN SLPDAFQYLN DKS