Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2615 |
Symbol | |
ID | 7978280 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2649855 |
End bp | 2651354 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644799416 |
Product | type I phosphodiesterase/nucleotide pyrophosphatase |
Protein accession | YP_002950575 |
Protein GI | 239827951 |
COG category | [R] General function prediction only |
COG ID | [COG1524] Uncharacterized proteins of the AP superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000242867 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAAG CTTCAAGATT GGAAAAAGTA GCGGCGCGAT GTTGGAATTT GCTGAACGAG GGAAAACCGT TTACGCCGAT TTTCGTTATC GGGACGATGG CGATCTACCA TCTTGCCGAT TTCGGCACCA TCGAGCATAT GAAGCACTGG CTCTTGGGTT TTCTCGCGGT GCTTCCACTT TTTATCATCT ATTATATGTA CGACTATCCG TTATTTTTGC GAAATTATTT ATGGATTCCA TACGTTGTGT TTTTAATCGT TTGGCAGTTT GCCGACCTTA AGCTGTTAGG CCTGGCGCTG GGTCTATATT TTTTCTTTAC CGTCTTTTTT TGGGGAACGC TTTACTATCA TTTGCGCATT GGGACATCAT GGTGGAATTT TACCCGTTTT TGGAAGTTAG TGCTGAAAAA TAGTGATTCG ACAAGCGGAA ACGCGCAAGA ACAGCTGCCG AAATTTTTGC TGCTTTTGTC GATTTGGCAA TATGTGTACA TACAGCTGGA AGGGGAGGCA GGCGATCTTT CTCTTGCCGG CTTTGCGTTC TATTATGCGG GTGTTTTTTT GTTTTCTTTC CTCTTGCACA GCCAATTATT TGACTGGAAG CCAAAAATCA TTCCGACATA CACCAATAAT GCCAGCGTTC CAAAAGAGCC AATCAATGAG AAAGTCATTG TGATTGTCAT CGACGGTATG CGGAAAGATC GGTTTGAGCA AGCCAATGCG CCATTTTTAA AATGGCTGCG GCAACATGGG ACGGAGTTTG CCCAAATGGA GACGGTCTAT CCGGCGCGGA CGGTCGTTTG TTTTACGTCG ATGTTTACCG GAACCTATCC GTTTGAACAC GGCATCCGCT CTAACATGGT ATGGAAGCTT GGCATCAAAG TAGAAAGCAT TTTTGATTCA CTAAGGAAAG TAGGGAAAAC GGGACGGCTG CTTGGCATTG CCCATCTCGT CGATTCGTTT GGAGATGATG TCGAAACGGT AACAGCGGTC ATGCATAATG ACGTAGCTGA CCGTAACATC ATTGAACGCG CCAAACGGAT TATGGAAGAA CAAGATCCTG ATTTGCTCAT CGTTCAATTA ATTGCCACAG ACCAAACAGG GCACAGCCGC GGTGTATTAT ACGAGGAATA TCTTCAAAAA ATAGAAGAAG CGGACGCGCT TATCAAGGAG TACGTTGAGT GGCTCGAACA GAAAGGAAAA TTAAAAAACG CAACGTTAAT CATTTGCGCC GATCACGGAC AAGCGGACGG CATCGGAGGA CACGGGCATT TGGATGAAGG AGAACGATTT GTGCCATTCT TCTTATATGG CCCGGCCATT GAGCAAGGAA AGCGGATCGA TGAAAAGAAA AGTTTGGTGT CCGTGGCGCC GACGATTGCT TATTTGCTGG GCACTCCGTA TCCTAGCCAT AGCCGCGGGC CCGTACTAAC GGAAGCGATT CGAAAGAGGG AAGCAGAGGA TGAAGAAGCA AAAAGTGATC GTCTTTTTAC CAGCGTATAA
|
Protein sequence | MKEASRLEKV AARCWNLLNE GKPFTPIFVI GTMAIYHLAD FGTIEHMKHW LLGFLAVLPL FIIYYMYDYP LFLRNYLWIP YVVFLIVWQF ADLKLLGLAL GLYFFFTVFF WGTLYYHLRI GTSWWNFTRF WKLVLKNSDS TSGNAQEQLP KFLLLLSIWQ YVYIQLEGEA GDLSLAGFAF YYAGVFLFSF LLHSQLFDWK PKIIPTYTNN ASVPKEPINE KVIVIVIDGM RKDRFEQANA PFLKWLRQHG TEFAQMETVY PARTVVCFTS MFTGTYPFEH GIRSNMVWKL GIKVESIFDS LRKVGKTGRL LGIAHLVDSF GDDVETVTAV MHNDVADRNI IERAKRIMEE QDPDLLIVQL IATDQTGHSR GVLYEEYLQK IEEADALIKE YVEWLEQKGK LKNATLIICA DHGQADGIGG HGHLDEGERF VPFFLYGPAI EQGKRIDEKK SLVSVAPTIA YLLGTPYPSH SRGPVLTEAI RKREAEDEEA KSDRLFTSV
|
| |