Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1850 |
Symbol | |
ID | 6315286 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 1930031 |
End bp | 1931674 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 642644228 |
Product | Dihydroxy-acid dehydratase |
Protein accession | YP_001918010 |
Protein GI | 188586465 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.0203802 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAAA AGTTTGTCGG TACTGATGCG ATTATGAGAC GAGCTATGCT CAAAGGTTGT GGATTCGGTG ATGATGATAT CAAAACCAAA CCGCATATTG GGATAGTAAA CACTTACAAC GAAGGTGCTC CTGGACACGC CCATCTCAAA CAATTATCCG AAGTAATCAA ACAAGGAGTT TGGGCTGCAG GGGGCGTTCC CTTTGAGTTT GGAGCTCCCT CTACATGTGG AGATATGATC GTAGGTGAAG AAGAATTAAA ATTTGAACTT GCAGGCAGAG ATGTAGTCGC CCAGGCTGTC GAGTATGTTT CAACTGTACA TCAGTTTGAT GGGCTTATAT TACTGGCAAG CTGTGATAAT ATTATCCCCG GTGTTGCTCT AGGAGCTATT AGAATGAACA TCCCCTCTAT CATTTTAACC GGAGGTTCTA TGCTGGTCGG TGAATACCAA GGAGAGGAAA TTCTGCCCTG CGATGTAGGT GTTATGACTA TGGGTAAAGA TGCGGAAAGT GAACGGGTCA AAGAGATTGA AAATGTTGCT TGTATGTGCC CTGGAGCTTG TTCCACCATG GGAACGGCAA ATAGTATGCA AATTATGATG GAAGTACTAG GTTTAACATT ACCCGGTGTC TCAACAATCC CTGCTGTTTA TGCTGACAAA CAGAGGGCTT CACGCCTAGC AGGAAAAAGA ATAGTAGATA TGGTCAAGGA AGATTTAAAG CCAAGTAATG TTTTAACAAG AGAAACCTTT TTAAATGCAG TAACAACAGA TATTGCCATG GGTGGTTCTA CTAATGTCAT CCTGCATTTA ATTGCCCTTG CAAGAGAAGC AGGAGTAGAA TTAACGGTTG ATGATTTTGA TAGAATTGGC CGAAATGTAC CTTGTGTTTG TGGTGTAAAA CCATCAGGCG ATTATACAAT AGTAGATTTT CATAACGCCG GTGGTGTGCC CGCTATGTTA AAAGAACTAC AATCTTTACT CTATTTAGAT TCAAAAGCTA TTACCGGAGA AACATTACAA GAAATCATCA ATAAAGCAAG CAATAAAAAT CCCGATGTCA TTAGATCTAT GGACAATCCT ATCACTTCAG ATGGTGGTCT AACTATCTTA AGAGGTAATC TAGCACCTAA CAGTGCTATT ATCAGATCTT CTTCTGTCCC TGAAAGCATG AAGAAGTTTT CGGGCCGAGC CAGGGTATTT CATAGAGATC AGGACGGTGC TAAGGCCATT AAAGAAGGCA AAATTCAACC AGGAGATGTT ATGGTTATTC GGTACGAGGG ACCGAAAGGT GCCCCAGGAA TGAAAGAAAT AATGTTGAGT ACCGATGCTC TAGTAGCTCA TGGGCTCGAC GATAGTGTCG GACTTGTGAC AGACGGTAGA TTTTCCGGAT TTAACCGCGG CCCCATAGTA GGCCATATAA CCCCTGAAGC TTTTGAAGGC GGCCCTCTGG CCTTGGTAGA AGATGGTGAT ATCATCTCAG TAGATATTAA AGAAGCTACC CTTACTATTG ACATCAGTGA AGAAGAAATG AAACGACGAG GAGCTAACTG GCAACAACCA GAACCGAAAG TAAAACAAGG AATGATGCGA CTATACTCCA AGATGTGCAG ATCTGCTGAA GAAGGAGCAG GTATGACATT ATAA
|
Protein sequence | MNEKFVGTDA IMRRAMLKGC GFGDDDIKTK PHIGIVNTYN EGAPGHAHLK QLSEVIKQGV WAAGGVPFEF GAPSTCGDMI VGEEELKFEL AGRDVVAQAV EYVSTVHQFD GLILLASCDN IIPGVALGAI RMNIPSIILT GGSMLVGEYQ GEEILPCDVG VMTMGKDAES ERVKEIENVA CMCPGACSTM GTANSMQIMM EVLGLTLPGV STIPAVYADK QRASRLAGKR IVDMVKEDLK PSNVLTRETF LNAVTTDIAM GGSTNVILHL IALAREAGVE LTVDDFDRIG RNVPCVCGVK PSGDYTIVDF HNAGGVPAML KELQSLLYLD SKAITGETLQ EIINKASNKN PDVIRSMDNP ITSDGGLTIL RGNLAPNSAI IRSSSVPESM KKFSGRARVF HRDQDGAKAI KEGKIQPGDV MVIRYEGPKG APGMKEIMLS TDALVAHGLD DSVGLVTDGR FSGFNRGPIV GHITPEAFEG GPLALVEDGD IISVDIKEAT LTIDISEEEM KRRGANWQQP EPKVKQGMMR LYSKMCRSAE EGAGMTL
|
| |