Gene Nther_2050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2050 
Symbol 
ID6315568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2166408 
End bp2167490 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content37% 
IMG OID642644438 
Product3-deoxy-D-arabinoheptulosonate-7-phosphate synthase 
Protein accessionYP_001918205 
Protein GI188586660 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.701038 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00000000020842 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCATTG TAATGAATAG ACAAACTCAA GAAGAAAAAA TAAATAAAGT GATAAATAGA 
CTCTCTGAAT TGGAATTAGA ATGTCATATA TCTAAAGGCA AGAATAAAAT TGTTATCGGA
GTAATTGGAG AAAATAAACG CCATGCTTTA GAAGGTTTAG AAGCTCTGCC ATACGTTGAA
CAAATTGTAC CCATAAGTAA ACCATTTAAG CTTGTATCGC GAGAGTTTAA GGAAGATGAT
ACTCAAATAG CTTTTGGTGG AAAAAGTTCT CATGTAGTTA AAGATTCATC CCTATCAGGA
CCAGTAGTTG GTGGAAATAA CTTCTCATTA ATGGCTGGAC CGTGTGCAAT CGAAAATAAA
GAAAACACTT TAGAAATAGC TCAACAAGTC AAGCAAACTG GAGCCCAGTT CCTCAGAGGT
GGCGCTTTTA AACCTAGAAC TTCACCGTAT AGTTTTCAAG GATTAGGAGC GGATGGCCTG
AAAATTATGT GGGAAGCCAG CCAGAAAACG GGTTTAAAAA TAATCACAGA AGTTATGGAC
CCTCGACAGA TTGAATTAGT TAGTGATTAT GCCCATGTTT TACAAATAGG CGCTAGAAAT
ATGCAAAATT TTGAGTTATT AAAAGAGGCT GGAAACAGTA ATCATCCTGT ATTATTGAAG
CGGGGTATGT CAGCTACAAT AGAAGAGTGG TTGATGGCAG CCGAATACAT TTTATCTAAA
GGAAACTATA AAGTAATGCT CTGTGAACGG GGTATAAGAA CCTTTGAAAC AGCTACTAGA
AACACTCTAG ATTTATCTGC AGTAGCACTT GTCAAACAAT TAAGCCATCT TCCTGTAATA
GTTGACCCTA GTCACGGTAC TGGAAAATGG AAGCTAGTGC CAAGTATGAG TAAAGCAGCT
TTAGCTGCAG GGGCAGATGG TTTAATTATT GAAGTTCATT CAACTCCTGA AACAGCCTTG
TCCGATGGTT CTCAGTCTCT TACTCCAGCT AATCTCGAAA AACTAACCAA TCAACTAACA
GAATTAGCAC CTCACTTTGA TAAAAGTTTT ACTATTGGTT CCAAAACCGA GGAGCATATA
TAA
 
Protein sequence
MIIVMNRQTQ EEKINKVINR LSELELECHI SKGKNKIVIG VIGENKRHAL EGLEALPYVE 
QIVPISKPFK LVSREFKEDD TQIAFGGKSS HVVKDSSLSG PVVGGNNFSL MAGPCAIENK
ENTLEIAQQV KQTGAQFLRG GAFKPRTSPY SFQGLGADGL KIMWEASQKT GLKIITEVMD
PRQIELVSDY AHVLQIGARN MQNFELLKEA GNSNHPVLLK RGMSATIEEW LMAAEYILSK
GNYKVMLCER GIRTFETATR NTLDLSAVAL VKQLSHLPVI VDPSHGTGKW KLVPSMSKAA
LAAGADGLII EVHSTPETAL SDGSQSLTPA NLEKLTNQLT ELAPHFDKSF TIGSKTEEHI