Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1068 |
Symbol | |
ID | 3833333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1098227 |
End bp | 1099105 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637828996 |
Product | dihydrodipicolinate synthase |
Protein accession | YP_429925 |
Protein GI | 83589916 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | [TIGR00674] dihydrodipicolinate synthase [TIGR00683] N-acetylneuraminate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.126267 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.042063 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGTGGG GTAGGATCCT CACAGCAATG GTGACTCCCT TTACAGCGGA CGGGAAATTA GATTTAGACG GTGCCCGCAG GCTGGCCGCC TACCTGGTAG ACCACGGCAG CGACGGGTTG GTGGTTGCCG GTACTACCGG GGAATCGCCG ACCCTGACCC ACGAGGAAAA AATAGCCCTT TTCCGGGAGG TTAAAAAAGC AGTAGGCGAC CGGGCGGCAG TCATCGCCGG TACAGGTACT AATTCCACCG CCGCCAGTAT TGAACTCTCC CGGGAAGCCG AGGCCCTGGG GGTAGACGGC TTGATGCTCG TAGTACCCTA TTACAACCGG CCATCCCAGG AGGGCCTTTA CCAGCATTTT AAAGCTATAG CAGCGGCCAC CACCCTGCCT ATTATCCTAT ATAATATTCC TTCCCGTACC GGGCGCAATA TGGATGCGGC TACAACCCTA CGTCTGGCTG AGATCAAGAA TATCAAGGCC GTAAAAGAGG CCAGCGGCGA TCTGGACCAG GCAACGGCTA TCCTGCGGCA GGCGCCGGCC GATTTTCTGG TATATAGCGG CGACGACTCC CTGACCCTGC CCCTGATGGC TGTGGGTGGT TACGGCATAA TCAGTGTCGT CGCCCACGTG GCCGGCGAAA AGATGCAGGC CATGGTCAGG GCCTTTACTG CCGGGGATGT CCAGGGGGCG GCAGCTCTTC ACCGGGAACT CTTTCCCCTC TTTAAAGCCC TCTTTATAAC CAGTAACCCG GTGCCGGTAA AGGAAGCCTT GCAGATGTTG GGACTGCCGG CCGGCCCGGT GCGTTTGCCC CTGGTGGGGG CCACCCGGGA GGAGAAGGAG AAAATCGCTG CTGCATTGAA GGAAACAGGC CTGTTATAG
|
Protein sequence | MQWGRILTAM VTPFTADGKL DLDGARRLAA YLVDHGSDGL VVAGTTGESP TLTHEEKIAL FREVKKAVGD RAAVIAGTGT NSTAASIELS REAEALGVDG LMLVVPYYNR PSQEGLYQHF KAIAAATTLP IILYNIPSRT GRNMDAATTL RLAEIKNIKA VKEASGDLDQ ATAILRQAPA DFLVYSGDDS LTLPLMAVGG YGIISVVAHV AGEKMQAMVR AFTAGDVQGA AALHRELFPL FKALFITSNP VPVKEALQML GLPAGPVRLP LVGATREEKE KIAAALKETG LL
|
| |