Gene Moth_1511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1511 
Symbol 
ID3831976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1555787 
End bp1557709 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content61% 
IMG OID637829443 
Product1-deoxy-D-xylulose-5-phosphate synthase 
Protein accessionYP_430363 
Protein GI83590354 
COG category[H] Coenzyme transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG1154] Deoxyxylulose-5-phosphate synthase 
TIGRFAM ID[TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGC TAGAAAAAAT CAACGAACCG GCAGCCATAA AAAAATTTAC CCTTGCCGAG 
CTTGATATAC TCGCCAGGGA AATCCGCCAG GAACTGGTCC AGACGGTCGC CCGCACGGGC
GGTCACCTGG CCCCCAACCT TGGGGTTGTG GAGCTGACCC TGGCCCTGCA CAGCGTCTTT
GATCTACCCC GGGATAAAAT CATCTGGGAC GTCGGCCACC AGTGCTACGT TCATAAGATC
CTTACCGGAC GGCGCCAGGA AATGACCAGC CTGCGTCAGT TCGGGGGCCT GAGCGGTTTT
CCCAAGCGGG CCGAAAGCCC CTACGACGCC TTTGATACCG GGCATAGCAG CACCTCGATC
TCGGCTGCCC TGGGGATGGC CCTGGCCCGG GACTTAAAGG GAGAAGACTA CCAGGTGGTG
GCTGTTATCG GCGATGGTGC CCTGACGGGC GGGATGGCTT TCGAAGCCAT GAACCATGCC
GGCCACCTGC AGGCCAACTT GATTGTTGTC TTAAATGATA ATGAGATGTC TATCGCCCCT
CCGGTTGGTG GCCTGGCGGC CTATCTTTCC CGCCTGCGGA CGGACCCCAT GTATTCCCGA
GGTAAGGAAG AGCTGGAGAA TCTTCTCAAC CGGCTCCCCC ATTTAGGTCC CCGGGTGCTC
AAGGTAATTG ATCGCCTCAA GGACAGCTTT AAATATCTGG TCGTTCCAGG CATGTTTTTC
GAAGAGATTG GTTTTACCTA CCTGGGGCCC ATTGAAGGTC ACAATATTGC CCGGTTAAAA
GAGGTCCTCC AGCATGCCCG GAATACCAGA GGCCCGGTCC TGGTACATGT AATTACCACC
AAGGGGAAGG GTTACCAGCC GGCCGAGGAC CATCCCGACC GCTTCCACGG CATAGGCCCC
TTTGATCCGG CAACAGGGGA ACCCCTGGCC GGAGGAGGGC CGCCGACCTA CACCTCTGTT
TTTGGTGCCG AACTGGTGCG CCAGGGGGAA AAGAACAACC GCCTGGTGGC CATAACGGCT
GCCATGCCCG ATGGCACCGG CCTGACGCCC TTTGCCCGGC GCTTCCCCAA ACGCTTTTTT
GATGTCGGCA TCGCCGAGCA GCACGCCCTG ACCCTGGCCG CCGGCCTGGC CGCTGCCGGG
ATGCACCCTG TAGTAGCCAT CTATTCTACT TTTTTACAGC GGGCCATTGA CCAGGTAATC
CACGATATCG CCTTAATGGA GCTGCCGGTG GTCCTGGCCA TTGACCGGGC CGGCCTGGTA
GGTGAAGACG GTGAAACCCA CCAGGGTCTC TTTGATGTGT CCCTGTTGCG TTGTGTTCCC
GGCCTGGTCC TCATGGCACC CAAGGATGAA CAGGAACTGC GCCACATGCT GGTAACCGCC
CTCCAGTACC AAGGACCGGC GGCGCTGCGC TACCCCCGGG GCGCCGGTAT GGGTGTGCCC
CTGACGGGAA CGGCCCAGCC TTTGCCCATT GGCAAGGGTG AAGTCCTGCG TCGTGGCCGG
GATGTCACCA TCCTGGCTCT AGGCCCCCTG GCGTATGCAG CCCTGGAAGC GGCCGGGGAC
CTGGCAGCCC GGGGTATCGA AGCCACCGTC ATTAATCCCC GGTTTATTAA GCCCCTGGAT
GAAGACCTGA TCCTCACCTG GGCGGATCGC ACCGGCCATC TGGTGACCGT GGAAGAACAC
GTCCTGGCCG GGGGCTTTGG CAGCGCCGTT CTGGAACTCC TGGCACGGAA CGGGCGCAAG
GGTATCCGGG TGCGGTGCCT GGGGGTGAAG GACGAGTTTG TCCACCAGGG TAAACCAGCC
ATTTTACGGG AACACTTAGG CTTGACTCCG GCCGGGATCA GGGCTGCCGT CCAGGCGCTG
CTGGCGGAGA CCCCGGTCCT GCACCGGCGG CGCAACCAGA CAAAGGGGAT TTCCGGTGGC
TAA
 
Protein sequence
MSLLEKINEP AAIKKFTLAE LDILAREIRQ ELVQTVARTG GHLAPNLGVV ELTLALHSVF 
DLPRDKIIWD VGHQCYVHKI LTGRRQEMTS LRQFGGLSGF PKRAESPYDA FDTGHSSTSI
SAALGMALAR DLKGEDYQVV AVIGDGALTG GMAFEAMNHA GHLQANLIVV LNDNEMSIAP
PVGGLAAYLS RLRTDPMYSR GKEELENLLN RLPHLGPRVL KVIDRLKDSF KYLVVPGMFF
EEIGFTYLGP IEGHNIARLK EVLQHARNTR GPVLVHVITT KGKGYQPAED HPDRFHGIGP
FDPATGEPLA GGGPPTYTSV FGAELVRQGE KNNRLVAITA AMPDGTGLTP FARRFPKRFF
DVGIAEQHAL TLAAGLAAAG MHPVVAIYST FLQRAIDQVI HDIALMELPV VLAIDRAGLV
GEDGETHQGL FDVSLLRCVP GLVLMAPKDE QELRHMLVTA LQYQGPAALR YPRGAGMGVP
LTGTAQPLPI GKGEVLRRGR DVTILALGPL AYAALEAAGD LAARGIEATV INPRFIKPLD
EDLILTWADR TGHLVTVEEH VLAGGFGSAV LELLARNGRK GIRVRCLGVK DEFVHQGKPA
ILREHLGLTP AGIRAAVQAL LAETPVLHRR RNQTKGISGG