Gene Moth_2268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2268 
Symbol 
ID3831379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2375283 
End bp2376359 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content57% 
IMG OID637830188 
Productalcohol dehydrogenase 
Protein accessionYP_431098 
Protein GI83591089 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.396502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0088894 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTTATA TGATCCCTGA AAAAATGAAA GCCCTGGTTC TCTTCGGCCC TAACGACGTG 
CGCCTGGTAG AAAAGCCGGT GCCCAAACCC GGCCCCGGCG AGGTGCTGGT CAGGGTGGCG
GCCTGCGGCA TCTGCGGCAC CGATGTAAAG ATTATCACCA AGGGCATGCC AAAGATGCCT
CCCTACGGTG AATTTACCTT CGGCCATGAA TGGGCCGGGA CCATTGTCGC CCTGGGAGAA
ACAGTGGACG AATTCCAGGT CGGCGACCGG GTAGCCATCG AGGCTCACAA GGGTTGCGGC
CGCTGTGAAA ACTGCATCGA CGGCAAGTAC ACTGCCTGCC TAAACTACGG CCGCCTGGAC
AAGGGCCACC GGGCCGCGGG CATGACGGTA GACGGTGGCT TTGCCGAGTA TGCCGTCCAG
CATGTAAATT CAGTCTACAA GATTCCCGAC AATATTACTT TCAACGAAGC CACCTATGTG
ACTACGGCCG GCTGTGCCCT CTACGCCATC GACAAGAGCG GCGGTTATAT TGCCGGGGAT
ACGGTCCTGG TCATCGGCCC CGGCCCTATT GGTCTCTCTG TGGTCCAGGG AGCCCGGTCC
CTGGGGGCCG AAAAGATCAT CCTCATGGGA ACCCGAGAAG ACCGCCTGGT CAAGGGTCGT
GAGCTCGGCG CCACCCATAC CATTAATATT CGTGAGGTGG CCGATCCCGT TGCCGAAGTA
ATGGCCATCA CCGGTGGTAA GGGCTGCGAG CGGGTTTTTG AGTGCGCCGG CAACTCTCAG
TCCTTCGAGT ACGGCATCAA AGCGGCTAAA AAGGGCGGTG TCATGGTCCT GGTTTCCTTC
TATAAGGAAC CGGTAATGGC TAACCTGGAT TATGTCGTCT TAAACCAGAT CAGCCTGCTC
ACCGTACGCG GTGAGGGCAA CCAGAACTGT AAGCGCGCCC TGTCACTGAT GGCCCAGGGT
AAGATTGACG CCAAACCAAT TATGACCCAC GCCTTCCCGT TAGAAGAGTT CCAGAAGGGC
CTGGATTACT TCGTCAACCG CAAAGACGGG GCCATGAAGG TGGTTATCAA CCCCTAA
 
Protein sequence
MTYMIPEKMK ALVLFGPNDV RLVEKPVPKP GPGEVLVRVA ACGICGTDVK IITKGMPKMP 
PYGEFTFGHE WAGTIVALGE TVDEFQVGDR VAIEAHKGCG RCENCIDGKY TACLNYGRLD
KGHRAAGMTV DGGFAEYAVQ HVNSVYKIPD NITFNEATYV TTAGCALYAI DKSGGYIAGD
TVLVIGPGPI GLSVVQGARS LGAEKIILMG TREDRLVKGR ELGATHTINI REVADPVAEV
MAITGGKGCE RVFECAGNSQ SFEYGIKAAK KGGVMVLVSF YKEPVMANLD YVVLNQISLL
TVRGEGNQNC KRALSLMAQG KIDAKPIMTH AFPLEEFQKG LDYFVNRKDG AMKVVINP