Gene Moth_2495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2495 
Symbol 
ID3831598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2600794 
End bp2601996 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content57% 
IMG OID637830417 
Productmetal dependent phosphohydrolase 
Protein accessionYP_431320 
Protein GI83591311 
COG category[T] Signal transduction mechanisms 
COG ID[COG2206] HD-GYP domain 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.211386 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTGCC CGCCGGATCG ATCCTTTTGT GAGCTGGTGG TAGCACTATC AACAATCCTT 
GATATCGAGG AAGAAACCAA ACTCTATCAT GCCTGGCGGG TAGCCCTGGT AGCCCAGGAA
CTGGCCCGGA GGGTCATCCC CGACGAGGCG ACCCTGGTGT TTTATGGTGG GCTGCTCCAC
GATATAGGCG CCATGGGGCT GGATGACCAC TTGGTTCATC TGGCCCTCCA GCGTGGTAGC
AGAAATAATC CCGAGGTCGT AAACCACCCC CTGCGGGGGG CGGATATGGT GGCAGCCATT
CCGGGTCTTG GGGAAAAGGT TGCGGCCATG ATCCGGGACC ATCATGAGCG CTGGAATGGC
TCCGGGTACC CCCGGGGTAT TGCCGGGAAT CATATCGTCA CGGGAGCTAT GCTCCTGGGA
CTGGCCGATG AACTGGACCT GGTCCTGCGG GTCCATCCGG GTACTTCTTG GGCCAGGCTG
AGGGAGACCC TGAACCGCAG GGTTCAGGGT GGGTTTCCGC CGGAACTACT GGCTATCCTG
GATAAAATGA TGAATGGTCC CTTGTACGCC GAAATTGCCA CCAATACGGC CCTGGAATTA
AAGATGTTTA AGGTGATTTT GGACCTGCCG CCTATTAATT TCCAGGTACC CGATCCGATG
AAGATCACTA TCGATCTCTT CGCCCGGATA ATTGACGCCA AACATGCCTA TACTGCCGGC
CATTCCCACC GGGTAGCTGC TTATGCTTTA AACCTGGCCC GCTGCCTCGG ATATAATGAT
GCTAAATTGC GGCGTCTGGA GATCGCCGGC CTCCTCCACG ATTTCGGCAA AATCGCCGTT
CCCCGCGCTA TTCTGGATAA ACGGGGCCGG CTCAACAGCG AAGAATTAAA AGTGGTACGC
CGCCACCCGG CCTGGACGAT AGAACTCCTG GAGGGGGTGA CTAGCCTCAA GGATATTGCC
CGGGACGCCG GCCTGCATCA CGAACGTTAT GATGGTAAAG GATATCCCTA TGGCCTCCAT
GATGGTGAAA TTCCCTTGGG GGCCAGGATC ATCGCCGTGG CCGATGCCTT TGACGCCATG
ACTTCTAACC GGCCCTATCA GCCTACCCGT ACGCCGGAAG AGGCTTTAAA AATCCTGGCA
GGGGGGGCCG GCACTCAGTT TGACCCGGAG GTGACAGCAG TTGCCTCCTG CCTGCTAGCC
TGA
 
Protein sequence
MGCPPDRSFC ELVVALSTIL DIEEETKLYH AWRVALVAQE LARRVIPDEA TLVFYGGLLH 
DIGAMGLDDH LVHLALQRGS RNNPEVVNHP LRGADMVAAI PGLGEKVAAM IRDHHERWNG
SGYPRGIAGN HIVTGAMLLG LADELDLVLR VHPGTSWARL RETLNRRVQG GFPPELLAIL
DKMMNGPLYA EIATNTALEL KMFKVILDLP PINFQVPDPM KITIDLFARI IDAKHAYTAG
HSHRVAAYAL NLARCLGYND AKLRRLEIAG LLHDFGKIAV PRAILDKRGR LNSEELKVVR
RHPAWTIELL EGVTSLKDIA RDAGLHHERY DGKGYPYGLH DGEIPLGARI IAVADAFDAM
TSNRPYQPTR TPEEALKILA GGAGTQFDPE VTAVASCLLA