Gene Moth_1589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1589 
Symbol 
ID3832735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1623596 
End bp1624537 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content58% 
IMG OID637829518 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_430438 
Protein GI83590429 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000469702 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.717874 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCTTC TGATAACTGG CGGGTCTGGT GACGTCGGAC GGTACCTGGT CCGGGATTTA 
GCCGGCCGCG GCCATCGGGT ACGGGTCCTG GACCGGGCTC TACCTAACGG TGATGGCCTC
CCTGTCAGCC AAGAGACTCT TTTTAAAGGC CAGCTGGAGG ATAAGGAACT AGTAGTCAGG
GCTGTTAAAG GGGTTGAAGC GGTAATTCAC CTGGCCTGGA GCTTCAGCGA CGACCCCCTG
GAGGTTTTCG GCGGCGACCT GATTGGGCAT ATAAATCTCT TAACAGCGGC TACCAGGGCT
GGGGTGAAGC ACTTTATTTA TGCCAGTACC GCTACAGTTT ACGGCCGGGC TGCCGGGCAT
CCAGTTGTAG AAGAACATCC CTGCCTGGTG GGAGAAGCGC GTAAACCCCT TTATGCCCTG
GGCAAGTTTG CCGCCGAGGA GCTGTGCCGT CAGTACTGCC GTGAACAGGG ATTGCCGGTG
ACCATTTTCC GTTTCTGGTG GGCCTTTGGC GATGAGATTG GCGGTCGCCA TTTGCGCAAC
CTCATACGGG CGGCCCTCAA TGAGGAACCC ATCAAGGTAC CAGTCGCTGC CGGGGGCACC
TTTGTCAGTA TGGCTGACCT GGCTGCCGCC TGCCGGCTGG TCCTGGCGGG GGAAGGGGCT
TGCGGCCAGG TCTATAACCT GGGCAGCCTG TATTTGACCT GGGAAGAGAT CGCCAGCAAG
ATAATTGAAC TTACCGGTTC CGCGGGGGAG CTACAACTGG TACCCCAGAA TGAATGGACA
GGACCGGCTT TCTTAAACGA AGTCTGGGAT CTCAGCTGGG AAAAGGCGGC CCGGGAATTG
GGCTATCGAC CTACCCTCAC CGTCGATGAG GGTCGGTTGG CCTTTACCAG GGCATTGCTT
CGCTGTGTGG ATAAAGTCCG AACGGAAATG GGGAAAAACT AG
 
Protein sequence
MELLITGGSG DVGRYLVRDL AGRGHRVRVL DRALPNGDGL PVSQETLFKG QLEDKELVVR 
AVKGVEAVIH LAWSFSDDPL EVFGGDLIGH INLLTAATRA GVKHFIYAST ATVYGRAAGH
PVVEEHPCLV GEARKPLYAL GKFAAEELCR QYCREQGLPV TIFRFWWAFG DEIGGRHLRN
LIRAALNEEP IKVPVAAGGT FVSMADLAAA CRLVLAGEGA CGQVYNLGSL YLTWEEIASK
IIELTGSAGE LQLVPQNEWT GPAFLNEVWD LSWEKAAREL GYRPTLTVDE GRLAFTRALL
RCVDKVRTEM GKN