Gene Moth_0631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0631 
Symbol 
ID3832529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp654714 
End bp655658 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content42% 
IMG OID637828573 
Productdihydrodipicolinate synthetase 
Protein accessionYP_429503 
Protein GI83589494 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.000301226 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGAAA AATGGAGTGG AATTTTTCCT GCGATCATGG TCCCGCTTAA AGAAGACTAT 
ACCATTAATG AAAAAGAGTT TAGAAATTAT ATTGATTGGT TGCTTACCTT TACTGATCAG
GGAATAACAG GTCTTGTAAC TAACGGGCAT ACGGGAGAAA TATCGGGTTT CAATCCGGAA
GAAAGAAAGC GCATCACAAA GATTGCTGCC GAACAAGTTG CGGGCAGATG CTTGGTTGTA
TCAGGGGTTT CGGCGGAAGG AACCTTTGAA GCAATAGAAC AGGCCAAAGC AGCGCAGGAG
GCAGGTGCTG ATGGAATTTT GCTAATGCCT CCGCATATTT GGCTGAGATT TGGTATGAAG
CCGGAGTCTG CTCTCAAGTT TGTGCAGGAT GTTGCCTCAG CCATCGATAT TAAAATTATC
ATTCATCTTT ATCCCGCTTC GACGAAGGCA TTTTATCCCA TTGAACTTCT ATTAGAAATG
GTAAAAATTC CTAACGTAGT AGCTGTCAAG ATGGGAACTC GTGATATGCC AATGTATGAA
AGGGATGTTC GGATTTTACG CCAAAAGGCA CCTGAAATAG CGTTACTAAC TTGTCACGAT
GAAAACTTGC TATCTACAAT GATACAAGGT GTTGATGGTG CACTGGTCGG CTTTGCCGGC
TGCGTTCCGG AATTGGTTAC TGCTTTGTTC CAAGCTGTTC AAAAGGAAGA TTTGAAAGAA
GCGAAAAAGA TTAATGAAAG ATTATTTGGG GTTTCAAGTG CTATTTACCA AATTGGTCAA
CCTAGCGGGG AAGCTCATGC CCGTATGAAA GAATTCCTGT GTCAGCGGAA AGTATTTTCA
CTGCCATTAA TGAGGCCACC CATTGTTCCT CTCGATCAAA AAGAAAAAGA TAAGGTGGCA
AAGGCAGTGG CTGATTATGG AATAAGTATA GTTAATTTAG TTTAA
 
Protein sequence
MREKWSGIFP AIMVPLKEDY TINEKEFRNY IDWLLTFTDQ GITGLVTNGH TGEISGFNPE 
ERKRITKIAA EQVAGRCLVV SGVSAEGTFE AIEQAKAAQE AGADGILLMP PHIWLRFGMK
PESALKFVQD VASAIDIKII IHLYPASTKA FYPIELLLEM VKIPNVVAVK MGTRDMPMYE
RDVRILRQKA PEIALLTCHD ENLLSTMIQG VDGALVGFAG CVPELVTALF QAVQKEDLKE
AKKINERLFG VSSAIYQIGQ PSGEAHARMK EFLCQRKVFS LPLMRPPIVP LDQKEKDKVA
KAVADYGISI VNLV