Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_2050 |
Symbol | |
ID | 6315568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 2166408 |
End bp | 2167490 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642644438 |
Product | 3-deoxy-D-arabinoheptulosonate-7-phosphate synthase |
Protein accession | YP_001918205 |
Protein GI | 188586660 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.701038 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00000000020842 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCATTG TAATGAATAG ACAAACTCAA GAAGAAAAAA TAAATAAAGT GATAAATAGA CTCTCTGAAT TGGAATTAGA ATGTCATATA TCTAAAGGCA AGAATAAAAT TGTTATCGGA GTAATTGGAG AAAATAAACG CCATGCTTTA GAAGGTTTAG AAGCTCTGCC ATACGTTGAA CAAATTGTAC CCATAAGTAA ACCATTTAAG CTTGTATCGC GAGAGTTTAA GGAAGATGAT ACTCAAATAG CTTTTGGTGG AAAAAGTTCT CATGTAGTTA AAGATTCATC CCTATCAGGA CCAGTAGTTG GTGGAAATAA CTTCTCATTA ATGGCTGGAC CGTGTGCAAT CGAAAATAAA GAAAACACTT TAGAAATAGC TCAACAAGTC AAGCAAACTG GAGCCCAGTT CCTCAGAGGT GGCGCTTTTA AACCTAGAAC TTCACCGTAT AGTTTTCAAG GATTAGGAGC GGATGGCCTG AAAATTATGT GGGAAGCCAG CCAGAAAACG GGTTTAAAAA TAATCACAGA AGTTATGGAC CCTCGACAGA TTGAATTAGT TAGTGATTAT GCCCATGTTT TACAAATAGG CGCTAGAAAT ATGCAAAATT TTGAGTTATT AAAAGAGGCT GGAAACAGTA ATCATCCTGT ATTATTGAAG CGGGGTATGT CAGCTACAAT AGAAGAGTGG TTGATGGCAG CCGAATACAT TTTATCTAAA GGAAACTATA AAGTAATGCT CTGTGAACGG GGTATAAGAA CCTTTGAAAC AGCTACTAGA AACACTCTAG ATTTATCTGC AGTAGCACTT GTCAAACAAT TAAGCCATCT TCCTGTAATA GTTGACCCTA GTCACGGTAC TGGAAAATGG AAGCTAGTGC CAAGTATGAG TAAAGCAGCT TTAGCTGCAG GGGCAGATGG TTTAATTATT GAAGTTCATT CAACTCCTGA AACAGCCTTG TCCGATGGTT CTCAGTCTCT TACTCCAGCT AATCTCGAAA AACTAACCAA TCAACTAACA GAATTAGCAC CTCACTTTGA TAAAAGTTTT ACTATTGGTT CCAAAACCGA GGAGCATATA TAA
|
Protein sequence | MIIVMNRQTQ EEKINKVINR LSELELECHI SKGKNKIVIG VIGENKRHAL EGLEALPYVE QIVPISKPFK LVSREFKEDD TQIAFGGKSS HVVKDSSLSG PVVGGNNFSL MAGPCAIENK ENTLEIAQQV KQTGAQFLRG GAFKPRTSPY SFQGLGADGL KIMWEASQKT GLKIITEVMD PRQIELVSDY AHVLQIGARN MQNFELLKEA GNSNHPVLLK RGMSATIEEW LMAAEYILSK GNYKVMLCER GIRTFETATR NTLDLSAVAL VKQLSHLPVI VDPSHGTGKW KLVPSMSKAA LAAGADGLII EVHSTPETAL SDGSQSLTPA NLEKLTNQLT ELAPHFDKSF TIGSKTEEHI
|
| |