Gene Moth_1307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1307 
Symbol 
ID3831793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1350196 
End bp1351494 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content57% 
IMG OID637829243 
Producthomoserine dehydrogenase 
Protein accessionYP_430163 
Protein GI83590154 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.336564 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGGGAC CAATCAATCT TGGCCTTCTG GGCCTGGGTA CTGTTGGCAG CGGTGTGGTC 
CGGTTGCTGG AGCAGAACAA AGCCATCATT ACCCAGAAAT TGGGCCAGCC TTTAAATATT
AAACGCATCC TGGTCCGGGA TTTAAACCGC CCTCGTCAGG TGGCAGTCGA TCCAGCCTTG
CTGACAACCG ACCCGGATAC CATCCTGGGT GATCCAGATA TCCCTATTAT CGTCGAGGTC
ATGGGGGGTA CCGGGACGGC CAGAGAGTAC ATCCTCCAGG CTCTAAGCCG GGGTAAAAGT
GTGGTTACGG CCAACAAGGA TCTCCTCGCC CTTTATGGCA AGGAGCTTTT TGATGCCGCC
GACGCCCATG GGGCCGACCT CCTCTTTGAA GCCAGCGTAG GAGGGGGGAT ACCCATTATT
CGCCCCCTGA AGGAATGCCT GGCGGGTAAC CGGATTCGTC AGGTCATGGG CATCATTAAT
GGTACCACCA ACTATATCTT GACCAAGATG AGCCGCGAAG GCCGCGACTT TAACGACGTT
CTAAAGGAAG CCCAGTCCTT GGGTTACGCC GAAGCCGATC CTACGTCCGA TATTGAAGGC
GATGATGCCG CACGTAAAAT GGCCATCCTC GCTTCCATAG CCTTCGGTAC CCGGATTACT
TACCCGGAGG TTTACCGGGA GGGTATAGGC CGCCTGTCGT CCCATGACAT CAACTACGCC
AGGGATATGG GCTATGCCGT CAAGCTCCTG GGCATCGCCC GGGAAGACGA GGACGGGATC
GAGGTGCGGG TCCACCCGGC TCTGGTACCC CTGAATCACC CCCTGGCCTC GGTTAGCGAT
GTTTTTAACG CCATCTTCGT GGAAGGCGAC GCCGTGGGCG AGACGATGTT TTACGGCCGC
GGAGCCGGTT CCCTGCCGAC TGCCAGCGCC GTTGTCGGGG ACATTATTGA AGGGGCCCGT
AACCTCCAGC ATCACGACCG GGGCCGGATA TCCTGCACTT GTTTTTATGA TAAACCCCTA
AAACCGATAG GAGCAATTAT TACTAAATAT TACCTCCGCC TGGTAGTCGT CGACCGACCG
GGAGTCCTGG CTACCATTGC CGGGATTTTC GGCGAGCGTG AAGTCAGCCT GGCCTCGGTC
ATCCAGGAAC GGATGCTTGG CGACCTGGCG GAACTGGTGC TTATTACCCA CCGCGTCCGG
GAAAAGAATG TCCGGGAAGC CCTGGAGGTT TTAGGCAGCC TGCCGGTGGT CAAAGAGATA
GCCAGCGTAA TAAGGGTAGA AGGAGGAGAA GCCAGGTGA
 
Protein sequence
MLGPINLGLL GLGTVGSGVV RLLEQNKAII TQKLGQPLNI KRILVRDLNR PRQVAVDPAL 
LTTDPDTILG DPDIPIIVEV MGGTGTAREY ILQALSRGKS VVTANKDLLA LYGKELFDAA
DAHGADLLFE ASVGGGIPII RPLKECLAGN RIRQVMGIIN GTTNYILTKM SREGRDFNDV
LKEAQSLGYA EADPTSDIEG DDAARKMAIL ASIAFGTRIT YPEVYREGIG RLSSHDINYA
RDMGYAVKLL GIAREDEDGI EVRVHPALVP LNHPLASVSD VFNAIFVEGD AVGETMFYGR
GAGSLPTASA VVGDIIEGAR NLQHHDRGRI SCTCFYDKPL KPIGAIITKY YLRLVVVDRP
GVLATIAGIF GEREVSLASV IQERMLGDLA ELVLITHRVR EKNVREALEV LGSLPVVKEI
ASVIRVEGGE AR