Gene Moth_1335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1335 
Symbol 
ID3831045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1381337 
End bp1382365 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content62% 
IMG OID637829271 
Product3-deoxy-7-phosphoheptulonate synthase 
Protein accessionYP_430191 
Protein GI83590182 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0331189 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.222136 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCATAG TCATGCAACC CGGAGCCACC CGCGAAGAAG AGCAGAGAGT TATCAGTCAC 
CTGGAACGGG AGGGTTTCAA GGTGCACCTG TCACGGGGTA TAGAACGGAC TATAATCGGC
GTAATAGGCG ATAAAACCCG GTTAAAGGCC GAAACTGTAA CGGCCCTGGC CGGAGTGGAG
AAGGTTGTCC CCATTTTGCA ACCCTATAAG CTGGCCAGCC GGGATTTCCA TCCCGAGGAT
ACGGTGGTAG AAACAGGCAA TCGGACCATA GGCGGCGCGG CGGTGCAGAT AATCGCCGGG
CCATGTGCCG TGGAAAGCCG TGACCAGCTC CTGGAGGCTG CCGACGCTGT CCGGGAGGCG
GGGGCCACCA TGCTCCGCGG CGGCGCCTAT AAACCCCGCA GCTCACCTTA CTCCTTCCAG
GGGCTGGCTG CCAAGGGTCT GGAAATCCTG GCTGAGGCCC GGGAACGTAC CGGCCTGCCG
GTGGTCACCG AGGTTATGGA CCAGAACCTG GTAGAAGATG TAGCCGCCGT TGCCGACGTC
CTGCAGATCG GCTCCCGCAA TATGCAGAAC TTTGCCCTCC TGCAGGCCGT AGGCCAGACA
AACAAACCCG TCCTCCTAAA GCGCGGTCTG GCAGCCACCA TTGAAGAATG GCTTCTGGCT
GCCGAGTACA TCCTCAACGC AGGCAATAGC CGGGTGATTT TATGCGAGAG AGGTATCCGC
ACCTTTGAGA CCTATACCCG CAATACCCTG GACCTCAGCG CGGTACCGGC GGTAAAGCAT
CTCTCCCACC TGCCTGTAAT TGTCGATCCC AGCCACGGGA CCGGGCGCCG GTTTATGGTG
GCCCCTATGG CCCGGGCCGC CCTGGCAGCC GGGGCCGATG GGATCATGGT TGAGGTACAC
CCCCGACCCC AGGAGGCCCT CTCCGACGGC TCCCAGTCCC TGGACCCGGA ACAGTTCGCC
GCCCTGGTCA GGGAGATCCG GCCCATCATC ACCGCCAGCG GGCGTGAACT GGAGCGGCAG
GCCGTATGA
 
Protein sequence
MIIVMQPGAT REEEQRVISH LEREGFKVHL SRGIERTIIG VIGDKTRLKA ETVTALAGVE 
KVVPILQPYK LASRDFHPED TVVETGNRTI GGAAVQIIAG PCAVESRDQL LEAADAVREA
GATMLRGGAY KPRSSPYSFQ GLAAKGLEIL AEARERTGLP VVTEVMDQNL VEDVAAVADV
LQIGSRNMQN FALLQAVGQT NKPVLLKRGL AATIEEWLLA AEYILNAGNS RVILCERGIR
TFETYTRNTL DLSAVPAVKH LSHLPVIVDP SHGTGRRFMV APMARAALAA GADGIMVEVH
PRPQEALSDG SQSLDPEQFA ALVREIRPII TASGRELERQ AV