Gene Nther_0431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_0431 
Symbol 
ID6314418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp450413 
End bp451663 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content36% 
IMG OID642642815 
Productoxidoreductase domain protein 
Protein accessionYP_001916615 
Protein GI188585070 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTACA GCATAGCCTT AATTGGAGCA GGACACCGAG GAAAAGATAT CTACGGCAAG 
TTTATAAAAA ACCACTGTCC CCAATTAAAA ATTACTGCAG TGGCTGAACC CAATTTATAT
AAAAGAAAAC TAGTGGCTAG AGAACATCAA GTACCTTCAC AAAATCAATT TCACTCCTAT
GATAAGCTAC TATCTCAAAA TAAACTTGCC GATGCTTTAC TTATTACTAC AATGGATCAC
GATCATTATA AGCCGGTTAT GAAAGGTATT GACAGTGGAT ATAAAATTCT GGTGGAAAAG
CCTATAGCAC CCAATTTTCA TGAATTTACA GATATTGTTA ATAAAGCTTT TGCCACCAAG
GCAGATATCT TAGTTGCCCA TGTACTGCGA TATACTCCTT TTTATCAAAA ACTAAAGCAA
TTGCTGACAG ATGGTACAAT TGGAAAAATA AAAAGTATTC AACATATAGA AAATGTGGGT
TATTTCCATT TCGCACATAG TTATGTAAGA GGTAATTGGC GCAATGAACA AGTTTCGGCA
CCACTGTTTT TAGCAAAAAG CTCCCACGAC TTCGATTTGT TTTCATGGCT TTTAGATAGA
AAAAGCTTAA CGGTTTATGC TCAAGGAGAA CAACAATATT TTACTAGGGA AAATTCGCCT
GAGAAGGCAG GAAAGCGCTG TTTCAAATGT CAAGCTGAAA GTAACTGTCC TTATTCCGCC
AGACATATCT ATTTAGATAA ACATCTACCC TGGCCTCAAG AAATAATTGA TTCCATGCCA
CCTAGATATC AGCGCTATTT AGGAGTCCGG TTTACCAATC TGGGGAAATG CGTGTATCAA
TGTGATAATA CTATGCCCGA AATTTTAACT GCATCTTTAA ATTATGAAGA CAATATCCAG
GTCAGTTTTA CCCTCACAGG TCTATCAAAA GAAATGAATA GAACAACCAC TATCTTTGGA
ACTAAGGGAG AGCTAAGAGC AGATTTTGCT CATAGTCAAA TACGACTAAT GCCTTTCAGA
GGTCAAGAAA AGACTTTTGA AGTAACTAAA AAAGCTGGAG CCCACGGAGG AGGAGATTTG
GGGTTAATGG AACATTTTGA TGGCTTTATT AAGGGAGGAA ATCTATCGAA AGATGCTTCC
ACTCTAGAAG AATCCATAGA AAGCCATATT ACAGCTTTTG CGGCAGAACA ATCAAGAATT
ACTGGAAAAA CTGTTGAAAT AAATAACTTA AGGAGACAGT TAAAATATTA A
 
Protein sequence
MSYSIALIGA GHRGKDIYGK FIKNHCPQLK ITAVAEPNLY KRKLVAREHQ VPSQNQFHSY 
DKLLSQNKLA DALLITTMDH DHYKPVMKGI DSGYKILVEK PIAPNFHEFT DIVNKAFATK
ADILVAHVLR YTPFYQKLKQ LLTDGTIGKI KSIQHIENVG YFHFAHSYVR GNWRNEQVSA
PLFLAKSSHD FDLFSWLLDR KSLTVYAQGE QQYFTRENSP EKAGKRCFKC QAESNCPYSA
RHIYLDKHLP WPQEIIDSMP PRYQRYLGVR FTNLGKCVYQ CDNTMPEILT ASLNYEDNIQ
VSFTLTGLSK EMNRTTTIFG TKGELRADFA HSQIRLMPFR GQEKTFEVTK KAGAHGGGDL
GLMEHFDGFI KGGNLSKDAS TLEESIESHI TAFAAEQSRI TGKTVEINNL RRQLKY