Gene Moth_1081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1081 
Symbol 
ID3833194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1111782 
End bp1113296 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content55% 
IMG OID637829009 
Productmetal dependent phosphohydrolase 
Protein accessionYP_429938 
Protein GI83589929 
COG category[R] General function prediction only 
COG ID[COG1418] Predicted HD superfamily hydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR03319] conserved hypothetical protein YmdA/YtgF 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.075195 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTAATAG CTTTCCTGGT CGGCGGGGCG GGTGGTTATG CCATCCGCAA ATACCTGGCT 
GAGGCCAAGA TAGCTTCAGC CGAAAAGGCT GCCGCCACCA TTATCGAAGA GGCCAAAAAA
GAAGCTGAAG CCAGGAAAAG GGAAGCGGTT CTGGAGGCCA AGGATGAAGT CCACCGCATG
CGTAATGAGG TGGAACGGGA GAGCAGGGAA CGACGCAATG AACTCCAGCG TTTAGAGCGG
CGCTTGCTGC AAAAAGAGGA AACCCTGGAA CGCAAATCTG AAACCCTGGA ACGCAAAGAG
GCCAGCCTGC ACCGCCAGGA AGAAGCGATT CAGCGTACCA GGGAAGAGGT AGAGAAAATT
CGCCAGCAGC AAGTGAGCGA ACTGGAGCGG ATTTCCGGCT TAACTACCGA GGCTGCCAGG
AATATCCTTT TGAAAAACGT GGAGGAAGAA ATCAGGCATG AAACAGCAAT GCTCATCAAG
CAGGTAGAGG CTGAAGCGAA GGAAGAGGCC GAGAAAAGGG CCCGGGAAAT TATCACCTAC
GCTATCCAGC ACTGTGCTGC CGACTATGTA GCTGAAGCTA CAGTATCGGT AGTTAACCTG
CCCAATGACG AGATGAAGGG GCGGATAATC GGCCGTGAGG GCCGGAATAT CCGGGCCCTG
GAAACCCTGA CTGGCGTAGA CCTCATTATA GATGATACGC CGGAAGCGGT TATCCTGTCC
TGCTTTGATC CTATTCGCCG GGAGATTGCC CGGATAGCCT TGGAAAAACT TATAGCCGAT
GGGCGCATCC ATCCGGCGCG GATTGAAGAA ATGGTGGAAA AGGCCCGGCG GGAACTGGAT
ACGAAGATCC GGGAGGAAGG CGAGCAGGCC ACCTTCGAGG TGGGTATCCA CGGCCTGCAC
CCGGAATTGG TGCGCCTGCT GGGTAAATTG AAATATCGTA CCAGCTACGG CCAGAATGTC
CTGAAACACT CCCTGGAAGT CGCCTTCCTG GCGGGGGCTA TGGCCGCCGA ACTGGGTGTA
GATGTACTGG TGGCTAAACG GGCTGGCCTG CTTCATGACA TAGGCAAGGC GGTGGACTTT
GAAGTTGAAG GCCCCCACGT CAACCTGGGG GTTGAACTGG CCAAAAAGTA CCGGGAGTCA
CCGGAGGTCA TTCATGCCAT TGAAGCCCAT CACGGCGATG TAGAGCCTAA AAGTATTGAA
GCTGGACTGG TCCAGGCTGC TGATGCCATT TCCGCCGCCC GTCCCGGAGC CAGGCGTGAG
ACCCTGGAAG CCTATATTAA GCGCTTAGAA AAACTGGAAG AGATTGCCAA TTCCTTTAGC
GGCGTAGAAA AATCCTATGC CATCCAGGCC GGACGTGAAG TTCGTATCCT GGTTAAACCG
GATAAGATTG ACGATGCCAT GGCCGTTCGC TTGGCCCGGG ATATCGTCAA AACCATCGAG
CAGACAATGG AGTATCCAGG CCAGATCAAG GTAGTGGTCA TCCGGGAAAC CCGGGCTGTA
GATTACGCCA AATAG
 
Protein sequence
MLIAFLVGGA GGYAIRKYLA EAKIASAEKA AATIIEEAKK EAEARKREAV LEAKDEVHRM 
RNEVERESRE RRNELQRLER RLLQKEETLE RKSETLERKE ASLHRQEEAI QRTREEVEKI
RQQQVSELER ISGLTTEAAR NILLKNVEEE IRHETAMLIK QVEAEAKEEA EKRAREIITY
AIQHCAADYV AEATVSVVNL PNDEMKGRII GREGRNIRAL ETLTGVDLII DDTPEAVILS
CFDPIRREIA RIALEKLIAD GRIHPARIEE MVEKARRELD TKIREEGEQA TFEVGIHGLH
PELVRLLGKL KYRTSYGQNV LKHSLEVAFL AGAMAAELGV DVLVAKRAGL LHDIGKAVDF
EVEGPHVNLG VELAKKYRES PEVIHAIEAH HGDVEPKSIE AGLVQAADAI SAARPGARRE
TLEAYIKRLE KLEEIANSFS GVEKSYAIQA GREVRILVKP DKIDDAMAVR LARDIVKTIE
QTMEYPGQIK VVVIRETRAV DYAK