Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0182 |
Symbol | |
ID | 7977928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 198584 |
End bp | 200545 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 644797162 |
Product | sulfatase |
Protein accession | YP_002948381 |
Protein GI | 239825757 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGAGGA TTGGTGGAAA GCTATTAGAA AAATGCCGTT CTCTCCCTAA CCAATATATT GGTTTTTTTA TTTTTGCCAT ATTTTTATTT TGGATGAAAA CATATGCAGC TTATCAAGCT GAATTTAATT TAGGCATTAG CAATTCCATG CAGGAATTTC TATTATTTAT CAACCCGATT AGTTCGGCGA TTTTCTTTCT CGGGTTAGCG TTACTAGCGA AAGGAAAGCG GACATATACT TGGCTCATTA TCATTAATTT CATTTTATCG TTTATTTTAT ACGCCAATAT CGTATATTAT CGCTTCTTTA GCGATTTTAT TACGTTGCCA ACGCTCAAGC AAACGAAAAA CTTTGGCGAT TTGGGCGGAA GCATTTTGGA ATTAATCGAA TGGTACGATA TTTTTTACTT CTTAGATACC ATTATTTTAA CGGTTATTGT CGCTTCCAAA CGTTTTTCGT TGCCGCCAGT TCAAGCCGGA CGTTATAAAA AAAGCATTGT GTTCGCCATT TCTGTTCTCA TTTTCAGTGT AAACCTTGCA CTAGCGGAAG CAGACCGTCC ACAATTGTTA ACAAGAACGT TTGATCGCAA CTATATCGTG AAATATTTAG GTGTTTACAA CTATTTGATT TATGATGCTG TGCAAAGCAT GAAATCGTCG ACACAGCGTG CTTTTGCGAA TAAAAGCGAT ATCACAACCG TTCTGAACCA TGTACAAGCA ACATATGCAA AACCAAATCC AAAGTATTTC GGTGTAGCAA AAGGAATGAA TGTGATTTAC ATTCATTTGG AATCAGTCCA AAGCTTTTTG ATTAACTATA AATTACATGG TGAAGAAGTA ACGCCATTTT TGAACTCGCT CGTTCGTGAT CCGAATACGT TTTACTTTGA TAACTTTTTC CATCAAACAG GTCAAGGGAA AACATCCGAT GCTGAGTTTA TGCTAGAAAA CTCGTTGTTT GGATTGCCGC AAGGAGCCGT GTTTACGACA AAAGGACAAA ACACGTACCA CGCAGCTCCG GCTATTCTTT CCCAGCAAGG GTATACGACA GCGGTATTCC ACGGCAACTA TAAAACGTTC TGGAACCGCG ATGAAATTTA TAAATCGTTT GGTTTCGACC ATTTCTTTGA CGCAAGCTAC TATGACATGA GCGATGAAAA TGTGCTTAAC TACGGGTTAA AGGATAAACC GTTCTTTAGA GAGTCGATTC CGTTGTTAGA ATCATTAAAA GAGCCGTTCT ATGTAAAATT TATCACGTTA TCGAACCACT TCCCTTATCC AATCAGCAAA GAAGAGGCAA CAATTGAACC AGCTGAAACG GGAGATGGCT CGGTAGACCG TTATTTCCAA ACGGCTCGCT ATTTGGATGA GGCATTAAAA GAGTTCTTTG ATTATCTCAA AAAATCCGGT TTGTACGACC GTTCCGTCAT CATTTTATAT GGTGACCATT ATGGAATTTC CGAAAATCAT AACAAAGCAA TGTCACAAGT GTTAGGAAAA GAAATTACAC CATTTGAACA TGCACAATTG CAGCGAGTGC CTTTATTTAT CCGTGTTCCG GGCGTCAAAG GCGGAATTAT GCACCAATAC GGCGGACAAA TTGATTTATT GCCAACGGTT CTTCACCTAT TAGGAATCGA TACGAAAAAC TATGTTCATT TCGGTACAGA CTTGTTGTCA CCGGAACATC AGGAAATCGT ACCATTCCGT AATGGTAACT TTGTCACTCC GACAGTGACA GCAGTAAATG GAAAATATTA TGATTCGAAA ACGGGCGAAC CAATTAAAGA AACACCGGAA ATTAAACAGC TTGAGCAAAT CGCCCGTACA AAACTCGATC TCTCGGACAA AGTCGTATAC GGAGATTTAC TGCGGTTCTA CACACCAAAA GGATTCAAAC CGGTCGATCC TACAAAATAC GATTATAATA ATCGTGAAGA TAAAGAAAAG GGAAGTAAAT AA
|
Protein sequence | MKRIGGKLLE KCRSLPNQYI GFFIFAIFLF WMKTYAAYQA EFNLGISNSM QEFLLFINPI SSAIFFLGLA LLAKGKRTYT WLIIINFILS FILYANIVYY RFFSDFITLP TLKQTKNFGD LGGSILELIE WYDIFYFLDT IILTVIVASK RFSLPPVQAG RYKKSIVFAI SVLIFSVNLA LAEADRPQLL TRTFDRNYIV KYLGVYNYLI YDAVQSMKSS TQRAFANKSD ITTVLNHVQA TYAKPNPKYF GVAKGMNVIY IHLESVQSFL INYKLHGEEV TPFLNSLVRD PNTFYFDNFF HQTGQGKTSD AEFMLENSLF GLPQGAVFTT KGQNTYHAAP AILSQQGYTT AVFHGNYKTF WNRDEIYKSF GFDHFFDASY YDMSDENVLN YGLKDKPFFR ESIPLLESLK EPFYVKFITL SNHFPYPISK EEATIEPAET GDGSVDRYFQ TARYLDEALK EFFDYLKKSG LYDRSVIILY GDHYGISENH NKAMSQVLGK EITPFEHAQL QRVPLFIRVP GVKGGIMHQY GGQIDLLPTV LHLLGIDTKN YVHFGTDLLS PEHQEIVPFR NGNFVTPTVT AVNGKYYDSK TGEPIKETPE IKQLEQIART KLDLSDKVVY GDLLRFYTPK GFKPVDPTKY DYNNREDKEK GSK
|
| |