Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1066 |
Symbol | |
ID | 3833331 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1095907 |
End bp | 1096914 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637828994 |
Product | aspartate semialdehyde dehydrogenase |
Protein accession | YP_429923 |
Protein GI | 83589914 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0136] Aspartate-semialdehyde dehydrogenase |
TIGRFAM ID | [TIGR01296] aspartate-semialdehyde dehydrogenase (peptidoglycan organisms) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.058115 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00506315 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGAATT TGAATGTTGC CGTAGTAGGT ACCGGAGCTG TAGGCCAGAC CATGCTTAAA GTTCTGGAGG AAAGGAATTT TCCCGTCGGC AGGTTAAAGG TCCTGGCGAC CAGTCGCTCC GCAGGAAAGA AAGTCACCTT CAAGGGCGAG GAGTACCGCG TTGAGGAAAC CACTCCGGAA TCCTTCGCCG GGGTTAACGT AGCCCTCTTT GCCGGGGGTG AAGCCAGTAA AATCTTTGGC CGGGCGGCGG TGGCCGCCGG AGCAGTGGTA ATTGATAATA GCAATAACTT CCGTATGGAT CCGGAGGTAC CCTTGGTGGT ACCGGAGGTT AATCCCCAGG ATGTACGCTG GCATAAAGGG CTGATTGCCA ACCCCAACTG CTCCACCATT CAGATGGTGG TTGCCCTGAA GCCCCTGTAT GACGCCGCCG GCATCAAGCG GGTAGTAGTC TCGACCTACC AGGCTGTTTC TGGCGCCGGC CAGGAAGCCA TCGATGAGCT GCGGAAACAG AGCCAGCAGG TCTTGGAGGG CAGGGAAGTG AGCGGCAGGG TCTTCCCCTG GCAGATTGCC TTCAACTGCC TGCCCCATAT CGATATCTTC CTGGAGAACG GTTATAGCAA GGAAGAAATG AAGATGGTCA ACGAGACCAA GAAAATTATG GGAGATAATG ATATCCGGGT GACGGCCACC ACGGTACGGG TTCCGGTCTT TAACGGCCAT TCGGAAGCAA TTAATGTAGA GACAAGGGAA AAGCTGACGG CCTCCCAGGC CAGGGAACTC TTGAGCCGGG CCCCCGGGGT GGTGGTAGTC GACGATCTTG ATAATAAGGC CTATCCCCTG GCCATCCAGG CCGACGGCCG GGACGAGGTA TTCGTCGGGC GTATCCGGGA GGATTTCTCC ATTGCCAACG GCCTGAACCT GTGGGTGGTT GCCGATAACC TGCGCAAGGG TGCAGCGACC AATGCCGTGC AGATTGCCGA ATTACTGCTG CAGGAAGGCC TTCTTTAG
|
Protein sequence | MANLNVAVVG TGAVGQTMLK VLEERNFPVG RLKVLATSRS AGKKVTFKGE EYRVEETTPE SFAGVNVALF AGGEASKIFG RAAVAAGAVV IDNSNNFRMD PEVPLVVPEV NPQDVRWHKG LIANPNCSTI QMVVALKPLY DAAGIKRVVV STYQAVSGAG QEAIDELRKQ SQQVLEGREV SGRVFPWQIA FNCLPHIDIF LENGYSKEEM KMVNETKKIM GDNDIRVTAT TVRVPVFNGH SEAINVETRE KLTASQAREL LSRAPGVVVV DDLDNKAYPL AIQADGRDEV FVGRIREDFS IANGLNLWVV ADNLRKGAAT NAVQIAELLL QEGLL
|
| |