Gene Nther_1850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1850 
Symbol 
ID6315286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1930031 
End bp1931674 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content42% 
IMG OID642644228 
ProductDihydroxy-acid dehydratase 
Protein accessionYP_001918010 
Protein GI188586465 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0203802 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAAA AGTTTGTCGG TACTGATGCG ATTATGAGAC GAGCTATGCT CAAAGGTTGT 
GGATTCGGTG ATGATGATAT CAAAACCAAA CCGCATATTG GGATAGTAAA CACTTACAAC
GAAGGTGCTC CTGGACACGC CCATCTCAAA CAATTATCCG AAGTAATCAA ACAAGGAGTT
TGGGCTGCAG GGGGCGTTCC CTTTGAGTTT GGAGCTCCCT CTACATGTGG AGATATGATC
GTAGGTGAAG AAGAATTAAA ATTTGAACTT GCAGGCAGAG ATGTAGTCGC CCAGGCTGTC
GAGTATGTTT CAACTGTACA TCAGTTTGAT GGGCTTATAT TACTGGCAAG CTGTGATAAT
ATTATCCCCG GTGTTGCTCT AGGAGCTATT AGAATGAACA TCCCCTCTAT CATTTTAACC
GGAGGTTCTA TGCTGGTCGG TGAATACCAA GGAGAGGAAA TTCTGCCCTG CGATGTAGGT
GTTATGACTA TGGGTAAAGA TGCGGAAAGT GAACGGGTCA AAGAGATTGA AAATGTTGCT
TGTATGTGCC CTGGAGCTTG TTCCACCATG GGAACGGCAA ATAGTATGCA AATTATGATG
GAAGTACTAG GTTTAACATT ACCCGGTGTC TCAACAATCC CTGCTGTTTA TGCTGACAAA
CAGAGGGCTT CACGCCTAGC AGGAAAAAGA ATAGTAGATA TGGTCAAGGA AGATTTAAAG
CCAAGTAATG TTTTAACAAG AGAAACCTTT TTAAATGCAG TAACAACAGA TATTGCCATG
GGTGGTTCTA CTAATGTCAT CCTGCATTTA ATTGCCCTTG CAAGAGAAGC AGGAGTAGAA
TTAACGGTTG ATGATTTTGA TAGAATTGGC CGAAATGTAC CTTGTGTTTG TGGTGTAAAA
CCATCAGGCG ATTATACAAT AGTAGATTTT CATAACGCCG GTGGTGTGCC CGCTATGTTA
AAAGAACTAC AATCTTTACT CTATTTAGAT TCAAAAGCTA TTACCGGAGA AACATTACAA
GAAATCATCA ATAAAGCAAG CAATAAAAAT CCCGATGTCA TTAGATCTAT GGACAATCCT
ATCACTTCAG ATGGTGGTCT AACTATCTTA AGAGGTAATC TAGCACCTAA CAGTGCTATT
ATCAGATCTT CTTCTGTCCC TGAAAGCATG AAGAAGTTTT CGGGCCGAGC CAGGGTATTT
CATAGAGATC AGGACGGTGC TAAGGCCATT AAAGAAGGCA AAATTCAACC AGGAGATGTT
ATGGTTATTC GGTACGAGGG ACCGAAAGGT GCCCCAGGAA TGAAAGAAAT AATGTTGAGT
ACCGATGCTC TAGTAGCTCA TGGGCTCGAC GATAGTGTCG GACTTGTGAC AGACGGTAGA
TTTTCCGGAT TTAACCGCGG CCCCATAGTA GGCCATATAA CCCCTGAAGC TTTTGAAGGC
GGCCCTCTGG CCTTGGTAGA AGATGGTGAT ATCATCTCAG TAGATATTAA AGAAGCTACC
CTTACTATTG ACATCAGTGA AGAAGAAATG AAACGACGAG GAGCTAACTG GCAACAACCA
GAACCGAAAG TAAAACAAGG AATGATGCGA CTATACTCCA AGATGTGCAG ATCTGCTGAA
GAAGGAGCAG GTATGACATT ATAA
 
Protein sequence
MNEKFVGTDA IMRRAMLKGC GFGDDDIKTK PHIGIVNTYN EGAPGHAHLK QLSEVIKQGV 
WAAGGVPFEF GAPSTCGDMI VGEEELKFEL AGRDVVAQAV EYVSTVHQFD GLILLASCDN
IIPGVALGAI RMNIPSIILT GGSMLVGEYQ GEEILPCDVG VMTMGKDAES ERVKEIENVA
CMCPGACSTM GTANSMQIMM EVLGLTLPGV STIPAVYADK QRASRLAGKR IVDMVKEDLK
PSNVLTRETF LNAVTTDIAM GGSTNVILHL IALAREAGVE LTVDDFDRIG RNVPCVCGVK
PSGDYTIVDF HNAGGVPAML KELQSLLYLD SKAITGETLQ EIINKASNKN PDVIRSMDNP
ITSDGGLTIL RGNLAPNSAI IRSSSVPESM KKFSGRARVF HRDQDGAKAI KEGKIQPGDV
MVIRYEGPKG APGMKEIMLS TDALVAHGLD DSVGLVTDGR FSGFNRGPIV GHITPEAFEG
GPLALVEDGD IISVDIKEAT LTIDISEEEM KRRGANWQQP EPKVKQGMMR LYSKMCRSAE
EGAGMTL