Gene Moth_1068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1068 
Symbol 
ID3833333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1098227 
End bp1099105 
Gene Length879 bp 
Protein Length292 aa 
Translation table11 
GC content58% 
IMG OID637828996 
Productdihydrodipicolinate synthase 
Protein accessionYP_429925 
Protein GI83589916 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID[TIGR00674] dihydrodipicolinate synthase
[TIGR00683] N-acetylneuraminate lyase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.126267 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.042063 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTGGG GTAGGATCCT CACAGCAATG GTGACTCCCT TTACAGCGGA CGGGAAATTA 
GATTTAGACG GTGCCCGCAG GCTGGCCGCC TACCTGGTAG ACCACGGCAG CGACGGGTTG
GTGGTTGCCG GTACTACCGG GGAATCGCCG ACCCTGACCC ACGAGGAAAA AATAGCCCTT
TTCCGGGAGG TTAAAAAAGC AGTAGGCGAC CGGGCGGCAG TCATCGCCGG TACAGGTACT
AATTCCACCG CCGCCAGTAT TGAACTCTCC CGGGAAGCCG AGGCCCTGGG GGTAGACGGC
TTGATGCTCG TAGTACCCTA TTACAACCGG CCATCCCAGG AGGGCCTTTA CCAGCATTTT
AAAGCTATAG CAGCGGCCAC CACCCTGCCT ATTATCCTAT ATAATATTCC TTCCCGTACC
GGGCGCAATA TGGATGCGGC TACAACCCTA CGTCTGGCTG AGATCAAGAA TATCAAGGCC
GTAAAAGAGG CCAGCGGCGA TCTGGACCAG GCAACGGCTA TCCTGCGGCA GGCGCCGGCC
GATTTTCTGG TATATAGCGG CGACGACTCC CTGACCCTGC CCCTGATGGC TGTGGGTGGT
TACGGCATAA TCAGTGTCGT CGCCCACGTG GCCGGCGAAA AGATGCAGGC CATGGTCAGG
GCCTTTACTG CCGGGGATGT CCAGGGGGCG GCAGCTCTTC ACCGGGAACT CTTTCCCCTC
TTTAAAGCCC TCTTTATAAC CAGTAACCCG GTGCCGGTAA AGGAAGCCTT GCAGATGTTG
GGACTGCCGG CCGGCCCGGT GCGTTTGCCC CTGGTGGGGG CCACCCGGGA GGAGAAGGAG
AAAATCGCTG CTGCATTGAA GGAAACAGGC CTGTTATAG
 
Protein sequence
MQWGRILTAM VTPFTADGKL DLDGARRLAA YLVDHGSDGL VVAGTTGESP TLTHEEKIAL 
FREVKKAVGD RAAVIAGTGT NSTAASIELS REAEALGVDG LMLVVPYYNR PSQEGLYQHF
KAIAAATTLP IILYNIPSRT GRNMDAATTL RLAEIKNIKA VKEASGDLDQ ATAILRQAPA
DFLVYSGDDS LTLPLMAVGG YGIISVVAHV AGEKMQAMVR AFTAGDVQGA AALHRELFPL
FKALFITSNP VPVKEALQML GLPAGPVRLP LVGATREEKE KIAAALKETG LL