Gene GWCH70_3358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3358 
Symbol 
ID7977116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3386136 
End bp3387437 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content43% 
IMG OID644800125 
Productmetal dependent phosphohydrolase 
Protein accessionYP_002951264 
Protein GI239828640 
COG category[R] General function prediction only 
COG ID[COG1078] HD superfamily phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000060416 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATATC CCGGCGGGCA ACTGAGTGAG GAAAAAGTAT TTAAAGATCC AGTTCATCGA 
TATATACACG TTCGCGACAA AGTCATTTGG GATTTAATCG GAACGAAGGA ATTTCAGCGA
TTGCGCCGCA TTAAACAGCT CGGTACGACG TATTTGACGT TCCACGGCGC CGAGCATAGC
CGCTTTAACC ATTCGCTTGG AGTCTATGAA ATTATTCGCC GCATTGTTGA TGATGTGTTC
GTCGGCCGCG AACATTGGGA TCATAGCGAA CGGCTCCTAT GTTTATGTGC GGCCCTGCTG
CACGATTTAG GCCACGGTCC GTTTTCCCAT TCATTTGAAA AAGTATTTCA TCTTGATCAT
GAAGATTTTA CCCAGGCGAT TATTTTAGGG GATACGGAAG TAAATGAAGT GCTTCGGCGC
GTAGGAGAGG ATTTTCCAAA AAAAGTGGCC GAAGTCATTG CGAAAACATA CCCGAATAAA
CTGGTTGTCA GCTTAATTTC GGGTCAAATT GACGCCGACC GAATGGATTA TTTGTTAAGA
GATGCGTATT ATACCGGCGT CAGTTATGGC AATTTTGATA TGGAACGGAT TTTGCGCGTC
ATGCGTCCGC GCGAAGATCA AGTGGTCATT AAGCGAAGCG GCATGCATGC GGTGGAAGAC
TATATTATGA GCCGCTATCA AATGTATTGG CAAGTATATT TTCATCCAGT GACGCGCAGT
GCCGAAGTCA TTTTAACGAA AATTTTGCAT CGGGCGAAAA AGTTGCATGA AGAAGGATAC
AATTTTCAAA CGAAACCGGT TCATTTTTAT TCTCTCTTTA CTGGAAAGGT AGATTTACAA
GATTATTTAA AGCTTGATGA AGCGGTCATA TTATTTTATT TCCAGCAATG GCAAGACGAG
CGTGACGCGA TTTTAAGCGA CCTATGTCGT CGTTTTGTAA ATCGTCATTT ATTCAAATAT
GTCGAATTTA ATCCAACAAA TGAACAAATG ACAAAGCTGA TCGAACTGAC AAACTTATTT
AAAAAAGCGG GCATTGATCC GGAATATTAC TTAGTTGTCG ATTCTTCTTC CGATTTGCCG
TACGATTTTT ATCGTCCGGG CGAAGAAGGG GAGCGGCTGC CGATTTATTT GTTAATGCCG
AACGGAGAGC TCCGCGAACT GTCACGGGAA TCGGTGCTTG TCGATGCGAT TTCCGGAAAG
CGGAGAACCG ACCATAAGCT ATATTTTCCA GCTGATTTCA TTTATGATTT TTCAACGAAG
CGGACAACCA AGAAAAAAAT TATCGAAATT TTAGAGGGTT AG
 
Protein sequence
MVYPGGQLSE EKVFKDPVHR YIHVRDKVIW DLIGTKEFQR LRRIKQLGTT YLTFHGAEHS 
RFNHSLGVYE IIRRIVDDVF VGREHWDHSE RLLCLCAALL HDLGHGPFSH SFEKVFHLDH
EDFTQAIILG DTEVNEVLRR VGEDFPKKVA EVIAKTYPNK LVVSLISGQI DADRMDYLLR
DAYYTGVSYG NFDMERILRV MRPREDQVVI KRSGMHAVED YIMSRYQMYW QVYFHPVTRS
AEVILTKILH RAKKLHEEGY NFQTKPVHFY SLFTGKVDLQ DYLKLDEAVI LFYFQQWQDE
RDAILSDLCR RFVNRHLFKY VEFNPTNEQM TKLIELTNLF KKAGIDPEYY LVVDSSSDLP
YDFYRPGEEG ERLPIYLLMP NGELRELSRE SVLVDAISGK RRTDHKLYFP ADFIYDFSTK
RTTKKKIIEI LEG