Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0631 |
Symbol | |
ID | 3832529 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 654714 |
End bp | 655658 |
Gene Length | 945 bp |
Protein Length | 314 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637828573 |
Product | dihydrodipicolinate synthetase |
Protein accession | YP_429503 |
Protein GI | 83589494 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.000301226 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGAAA AATGGAGTGG AATTTTTCCT GCGATCATGG TCCCGCTTAA AGAAGACTAT ACCATTAATG AAAAAGAGTT TAGAAATTAT ATTGATTGGT TGCTTACCTT TACTGATCAG GGAATAACAG GTCTTGTAAC TAACGGGCAT ACGGGAGAAA TATCGGGTTT CAATCCGGAA GAAAGAAAGC GCATCACAAA GATTGCTGCC GAACAAGTTG CGGGCAGATG CTTGGTTGTA TCAGGGGTTT CGGCGGAAGG AACCTTTGAA GCAATAGAAC AGGCCAAAGC AGCGCAGGAG GCAGGTGCTG ATGGAATTTT GCTAATGCCT CCGCATATTT GGCTGAGATT TGGTATGAAG CCGGAGTCTG CTCTCAAGTT TGTGCAGGAT GTTGCCTCAG CCATCGATAT TAAAATTATC ATTCATCTTT ATCCCGCTTC GACGAAGGCA TTTTATCCCA TTGAACTTCT ATTAGAAATG GTAAAAATTC CTAACGTAGT AGCTGTCAAG ATGGGAACTC GTGATATGCC AATGTATGAA AGGGATGTTC GGATTTTACG CCAAAAGGCA CCTGAAATAG CGTTACTAAC TTGTCACGAT GAAAACTTGC TATCTACAAT GATACAAGGT GTTGATGGTG CACTGGTCGG CTTTGCCGGC TGCGTTCCGG AATTGGTTAC TGCTTTGTTC CAAGCTGTTC AAAAGGAAGA TTTGAAAGAA GCGAAAAAGA TTAATGAAAG ATTATTTGGG GTTTCAAGTG CTATTTACCA AATTGGTCAA CCTAGCGGGG AAGCTCATGC CCGTATGAAA GAATTCCTGT GTCAGCGGAA AGTATTTTCA CTGCCATTAA TGAGGCCACC CATTGTTCCT CTCGATCAAA AAGAAAAAGA TAAGGTGGCA AAGGCAGTGG CTGATTATGG AATAAGTATA GTTAATTTAG TTTAA
|
Protein sequence | MREKWSGIFP AIMVPLKEDY TINEKEFRNY IDWLLTFTDQ GITGLVTNGH TGEISGFNPE ERKRITKIAA EQVAGRCLVV SGVSAEGTFE AIEQAKAAQE AGADGILLMP PHIWLRFGMK PESALKFVQD VASAIDIKII IHLYPASTKA FYPIELLLEM VKIPNVVAVK MGTRDMPMYE RDVRILRQKA PEIALLTCHD ENLLSTMIQG VDGALVGFAG CVPELVTALF QAVQKEDLKE AKKINERLFG VSSAIYQIGQ PSGEAHARMK EFLCQRKVFS LPLMRPPIVP LDQKEKDKVA KAVADYGISI VNLV
|
| |