Gene Moth_2131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2131 
Symbol 
ID3833131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2229345 
End bp2230340 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content56% 
IMG OID637830056 
Productmetal dependent phosphohydrolase 
Protein accessionYP_430966 
Protein GI83590957 
COG category[T] Signal transduction mechanisms 
COG ID[COG2206] HD-GYP domain 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000504483 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.911258 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAGAA GTGCAGGCCT GTACAGCGAT GGTATCATTG ATAGGTTGGC TATCAATCGC 
CTGAATTTAG CCACCCCGGC AGTAAAGACC CTTTTCCGGT GGGCGGCGGT TTTATCTCTA
GCTATTATTA CGATTTTAAA CCATTATGCC CGGGGCCTGC CTTTCCACTT TCTCCTTGAC
TTCCTCTACC TCGTCCCGGT AACCGTCGCC GCCCTCAATT CTTTACTCGA AGGCCTGGCG
GTTGCCCTGG TGGCTGGTAG CCTGCGTATG CTTACGAACC CTTTAGTTTT TTCTATTATT
AAACCCAGCG ATTATGTTGA TTTTCTAGTG GTAACCGGCT TTTACCTGGT CGACCCGGTG
ACTATAGAGC TATTGAGACG CCTGGCATGG CAGCGGCAGC AGCTCCAGCG TAATCTGCAA
CTGACGACGG CTGCCTTGCT CGAGGCCCTG CAGATGCGCG ACCAGTATAC CGGTTGGCAC
TCACGCCAGG TAGCCATTTA TGCGCGCCGG ATAGCCGCCA GCTTAGGTTT ATCGCCGTAC
CACCAGGAGT GCCTTTACCT GGCCGGGCTG CTCCATGACA TCGGGAAAAT CGGCGTTGAT
GACGCCTGCC TGAACAAACC CGGCCTGCTG ACGCCGGAAG AATGGCAAAA CGTCCGCCGC
CACCCCGGAT TGGGATACAA GATAATAAGA AAAGTAACCA GTCGAGAAGA AGTTATTGCC
CGGGCAGTCC TGTATCACCA CGAGCGCTAT GACGGCCGCG GTTATCCCAG AGGGCTGAAA
GGTACAAGCA TCCCCTTGGA AGCCCGCATT TTAAGTGTGG CCGACTGCTT TGACGCCATG
ACCACGGACC GGGTTTACCG GCCGGCCCTG TCCCCGGCTG AGGCTGTCAA GGAACTAATG
CGCTGCGCCG GCAGCCAGTT TGACCCTGGC ATAGTCGAGG TCTTCTACCG CATCCTGGCC
GCGGATGGCC TGATACAGAA CCCGGAGGAG GGGTAA
 
Protein sequence
MGRSAGLYSD GIIDRLAINR LNLATPAVKT LFRWAAVLSL AIITILNHYA RGLPFHFLLD 
FLYLVPVTVA ALNSLLEGLA VALVAGSLRM LTNPLVFSII KPSDYVDFLV VTGFYLVDPV
TIELLRRLAW QRQQLQRNLQ LTTAALLEAL QMRDQYTGWH SRQVAIYARR IAASLGLSPY
HQECLYLAGL LHDIGKIGVD DACLNKPGLL TPEEWQNVRR HPGLGYKIIR KVTSREEVIA
RAVLYHHERY DGRGYPRGLK GTSIPLEARI LSVADCFDAM TTDRVYRPAL SPAEAVKELM
RCAGSQFDPG IVEVFYRILA ADGLIQNPEE G