Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3358 |
Symbol | |
ID | 7977116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 3386136 |
End bp | 3387437 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644800125 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_002951264 |
Protein GI | 239828640 |
COG category | [R] General function prediction only |
COG ID | [COG1078] HD superfamily phosphohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000060416 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTATATC CCGGCGGGCA ACTGAGTGAG GAAAAAGTAT TTAAAGATCC AGTTCATCGA TATATACACG TTCGCGACAA AGTCATTTGG GATTTAATCG GAACGAAGGA ATTTCAGCGA TTGCGCCGCA TTAAACAGCT CGGTACGACG TATTTGACGT TCCACGGCGC CGAGCATAGC CGCTTTAACC ATTCGCTTGG AGTCTATGAA ATTATTCGCC GCATTGTTGA TGATGTGTTC GTCGGCCGCG AACATTGGGA TCATAGCGAA CGGCTCCTAT GTTTATGTGC GGCCCTGCTG CACGATTTAG GCCACGGTCC GTTTTCCCAT TCATTTGAAA AAGTATTTCA TCTTGATCAT GAAGATTTTA CCCAGGCGAT TATTTTAGGG GATACGGAAG TAAATGAAGT GCTTCGGCGC GTAGGAGAGG ATTTTCCAAA AAAAGTGGCC GAAGTCATTG CGAAAACATA CCCGAATAAA CTGGTTGTCA GCTTAATTTC GGGTCAAATT GACGCCGACC GAATGGATTA TTTGTTAAGA GATGCGTATT ATACCGGCGT CAGTTATGGC AATTTTGATA TGGAACGGAT TTTGCGCGTC ATGCGTCCGC GCGAAGATCA AGTGGTCATT AAGCGAAGCG GCATGCATGC GGTGGAAGAC TATATTATGA GCCGCTATCA AATGTATTGG CAAGTATATT TTCATCCAGT GACGCGCAGT GCCGAAGTCA TTTTAACGAA AATTTTGCAT CGGGCGAAAA AGTTGCATGA AGAAGGATAC AATTTTCAAA CGAAACCGGT TCATTTTTAT TCTCTCTTTA CTGGAAAGGT AGATTTACAA GATTATTTAA AGCTTGATGA AGCGGTCATA TTATTTTATT TCCAGCAATG GCAAGACGAG CGTGACGCGA TTTTAAGCGA CCTATGTCGT CGTTTTGTAA ATCGTCATTT ATTCAAATAT GTCGAATTTA ATCCAACAAA TGAACAAATG ACAAAGCTGA TCGAACTGAC AAACTTATTT AAAAAAGCGG GCATTGATCC GGAATATTAC TTAGTTGTCG ATTCTTCTTC CGATTTGCCG TACGATTTTT ATCGTCCGGG CGAAGAAGGG GAGCGGCTGC CGATTTATTT GTTAATGCCG AACGGAGAGC TCCGCGAACT GTCACGGGAA TCGGTGCTTG TCGATGCGAT TTCCGGAAAG CGGAGAACCG ACCATAAGCT ATATTTTCCA GCTGATTTCA TTTATGATTT TTCAACGAAG CGGACAACCA AGAAAAAAAT TATCGAAATT TTAGAGGGTT AG
|
Protein sequence | MVYPGGQLSE EKVFKDPVHR YIHVRDKVIW DLIGTKEFQR LRRIKQLGTT YLTFHGAEHS RFNHSLGVYE IIRRIVDDVF VGREHWDHSE RLLCLCAALL HDLGHGPFSH SFEKVFHLDH EDFTQAIILG DTEVNEVLRR VGEDFPKKVA EVIAKTYPNK LVVSLISGQI DADRMDYLLR DAYYTGVSYG NFDMERILRV MRPREDQVVI KRSGMHAVED YIMSRYQMYW QVYFHPVTRS AEVILTKILH RAKKLHEEGY NFQTKPVHFY SLFTGKVDLQ DYLKLDEAVI LFYFQQWQDE RDAILSDLCR RFVNRHLFKY VEFNPTNEQM TKLIELTNLF KKAGIDPEYY LVVDSSSDLP YDFYRPGEEG ERLPIYLLMP NGELRELSRE SVLVDAISGK RRTDHKLYFP ADFIYDFSTK RTTKKKIIEI LEG
|
| |