Gene Moth_2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2007 
Symbol 
ID3831961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2092594 
End bp2093262 
Gene Length669 bp 
Protein Length222 aa 
Translation table11 
GC content61% 
IMG OID637829936 
ProductHAD family hydrolase 
Protein accessionYP_430846 
Protein GI83590837 
COG category[R] General function prediction only 
COG ID[COG0546] Predicted phosphatases 
TIGRFAM ID[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTTGCCG GCGTACTTTT TGATTTTGAC GGCACCCTGG TGGATACTTC CGAGCTGGTT 
ATCAAGTCCT TCCAGCATAC CCTGGCGCCC TACCTGGGCC GGGTGGTGGC CCCGAAGGAG
GTTTACCCTT ACTTTGGTGT TACCTTGAAG GAGGGCCTGG CCGCTTTCAT GCCCGACCAC
CTGGACGAGA TGCTCCATGA GTATCGTCGT TATAGTGCCG CGCACTTTGA TGACCTGGTC
CGGCCCTGTC CTGGGGTCCG TGAGGGTTTG CAACGATTGC AGCAGGCCGG TATTAAACTT
GGAGTAGTCA CTTCTCGATT ACGTGATACT ACCCTCTACG GTCTGGAACT CTGTGGCTTG
ACTTCTTTTT TCCCGGTGAT CGTTGCCGCC GAGGACGTAA CCAGCCACAA GCCGGGCCCG
GAGCCTGTCC GTTACGGCCT GGAACTCCTG GGAGTGGAGG CGGGGGCGGC AGCCATGATC
GGCGACAGTC CTCACGATAT CCAGGCGGCC CGGGCAGCGG GCGTCACCAG CGTGGCCGCC
GGCTGGAGCA GGGTGCCCCG GGACCAGATC CTGGCCGCCG GCCCGGAGGT ACTGGTGGCC
AGCATGACTG AGTTTGTCGA TTTCTGCCTG GACGGTCCTA AGGGGGGGAG GCTGGCTCAG
AATGGGTAA
 
Protein sequence
MFAGVLFDFD GTLVDTSELV IKSFQHTLAP YLGRVVAPKE VYPYFGVTLK EGLAAFMPDH 
LDEMLHEYRR YSAAHFDDLV RPCPGVREGL QRLQQAGIKL GVVTSRLRDT TLYGLELCGL
TSFFPVIVAA EDVTSHKPGP EPVRYGLELL GVEAGAAAMI GDSPHDIQAA RAAGVTSVAA
GWSRVPRDQI LAAGPEVLVA SMTEFVDFCL DGPKGGRLAQ NG