Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1307 |
Symbol | |
ID | 3831793 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1350196 |
End bp | 1351494 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637829243 |
Product | homoserine dehydrogenase |
Protein accession | YP_430163 |
Protein GI | 83590154 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.336564 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTGGGAC CAATCAATCT TGGCCTTCTG GGCCTGGGTA CTGTTGGCAG CGGTGTGGTC CGGTTGCTGG AGCAGAACAA AGCCATCATT ACCCAGAAAT TGGGCCAGCC TTTAAATATT AAACGCATCC TGGTCCGGGA TTTAAACCGC CCTCGTCAGG TGGCAGTCGA TCCAGCCTTG CTGACAACCG ACCCGGATAC CATCCTGGGT GATCCAGATA TCCCTATTAT CGTCGAGGTC ATGGGGGGTA CCGGGACGGC CAGAGAGTAC ATCCTCCAGG CTCTAAGCCG GGGTAAAAGT GTGGTTACGG CCAACAAGGA TCTCCTCGCC CTTTATGGCA AGGAGCTTTT TGATGCCGCC GACGCCCATG GGGCCGACCT CCTCTTTGAA GCCAGCGTAG GAGGGGGGAT ACCCATTATT CGCCCCCTGA AGGAATGCCT GGCGGGTAAC CGGATTCGTC AGGTCATGGG CATCATTAAT GGTACCACCA ACTATATCTT GACCAAGATG AGCCGCGAAG GCCGCGACTT TAACGACGTT CTAAAGGAAG CCCAGTCCTT GGGTTACGCC GAAGCCGATC CTACGTCCGA TATTGAAGGC GATGATGCCG CACGTAAAAT GGCCATCCTC GCTTCCATAG CCTTCGGTAC CCGGATTACT TACCCGGAGG TTTACCGGGA GGGTATAGGC CGCCTGTCGT CCCATGACAT CAACTACGCC AGGGATATGG GCTATGCCGT CAAGCTCCTG GGCATCGCCC GGGAAGACGA GGACGGGATC GAGGTGCGGG TCCACCCGGC TCTGGTACCC CTGAATCACC CCCTGGCCTC GGTTAGCGAT GTTTTTAACG CCATCTTCGT GGAAGGCGAC GCCGTGGGCG AGACGATGTT TTACGGCCGC GGAGCCGGTT CCCTGCCGAC TGCCAGCGCC GTTGTCGGGG ACATTATTGA AGGGGCCCGT AACCTCCAGC ATCACGACCG GGGCCGGATA TCCTGCACTT GTTTTTATGA TAAACCCCTA AAACCGATAG GAGCAATTAT TACTAAATAT TACCTCCGCC TGGTAGTCGT CGACCGACCG GGAGTCCTGG CTACCATTGC CGGGATTTTC GGCGAGCGTG AAGTCAGCCT GGCCTCGGTC ATCCAGGAAC GGATGCTTGG CGACCTGGCG GAACTGGTGC TTATTACCCA CCGCGTCCGG GAAAAGAATG TCCGGGAAGC CCTGGAGGTT TTAGGCAGCC TGCCGGTGGT CAAAGAGATA GCCAGCGTAA TAAGGGTAGA AGGAGGAGAA GCCAGGTGA
|
Protein sequence | MLGPINLGLL GLGTVGSGVV RLLEQNKAII TQKLGQPLNI KRILVRDLNR PRQVAVDPAL LTTDPDTILG DPDIPIIVEV MGGTGTAREY ILQALSRGKS VVTANKDLLA LYGKELFDAA DAHGADLLFE ASVGGGIPII RPLKECLAGN RIRQVMGIIN GTTNYILTKM SREGRDFNDV LKEAQSLGYA EADPTSDIEG DDAARKMAIL ASIAFGTRIT YPEVYREGIG RLSSHDINYA RDMGYAVKLL GIAREDEDGI EVRVHPALVP LNHPLASVSD VFNAIFVEGD AVGETMFYGR GAGSLPTASA VVGDIIEGAR NLQHHDRGRI SCTCFYDKPL KPIGAIITKY YLRLVVVDRP GVLATIAGIF GEREVSLASV IQERMLGDLA ELVLITHRVR EKNVREALEV LGSLPVVKEI ASVIRVEGGE AR
|
| |