Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1466 |
Symbol | |
ID | 7976912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1538784 |
End bp | 1541291 |
Gene Length | 2508 bp |
Protein Length | 835 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644798370 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_002949543 |
Protein GI | 239826919 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.13243 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAATCA ATAAAAATTG GAAGATACAG CATTTTGATG TAGGACAGGT AAGGGATTTA ACGATTGCAG ATCCAAACTA TATTGATCAC TTTTGGATAC CAGCAAAGGT GCCAGGAGAT GTACATTCGA TTTTACGGGA AAAGAAATTG ATTGATGATC CCTTTTTCGG CTACAATGAT TTGAAATCGA AATGGGTGGA AGAAAAGGTT TGGTGGTATC GTACGGAATT TACATTTGAT AAAAACAATC TAGACAAAGA TGAACGGTTA GAGTTGATTT TTGAAGGCTT AGATACCTTT GCGACTGTTT ATTTAAACGG AGTGGAATTA GGGACTACGG AAAATATGTT CATTTCTCAT ACTTTTGATG TTACTAGAGA AATAGTGGAT GGAAGAAATG TTTTGGCAGT GAAATTTAAT CCCGTTTCCT ATCAGTTAAA GGAGAAAGAG AAAAATTATT GGGCTGGATT TGGAAAAGAT CGCATATGGG CGCGCAAGGC TCAATATCAT TTTGGCTGGG ACTGGGGTCC GCAAATTCTA ACGGTAGGAA TTTGGAAAGA GGTGCGTTTA GAAAAAAGAA AAATTGCCAA AATCGAAAGT GTTTACGCAA GAACACTTGA CTTGAAAGAT TCTCGGGCCG TTGTTCAAAT CGATATTTAC ACCAAAAACT TTGTTAAGGG AAAAAGTTTA CAGGCGGAAG TGACGTTAAA AGATCGAGAA CAACAATTTT TCCAAACCGT AAATATTGAT CAAAATCGAG CAACACTCAC TTTCAACATC GACAATCCAA ACCTTTGGTG GACGCATGAT TTAGGAGAAC CGAATCTTTA TCAGCTATCT GTCGTTTTAA AATGGGAAGG GGAAGTTCTA GACACATATC AAACAGAAAT TGGGATTCGT ACGGTGGAAG TAATGAAAAG AGATCGCAAA GGGAATCCGC GGTTTACCTT TGTTTTAAAT GGTGTAGAGA TATTTGCGAA AGGAGCGAAC TGGATTCCTA TTGACAGCTT TTTAGGATCT GTACCGGAAT CACGCTATCG TCATCTTATT CAGCTTGCGA AAGAAGCGAA TATGAATATG CTGCGTGTAT GGGGCGGTGG GATTTATGAA AAAGACATTT TCTATCAAGA ATGCAATCGC CAAGGTATTT TAGTTTGGCA GGACTTTATG TTTGCCTGTG CGTTATACCC GGATTACAAC CGCGATTATA TGGAAAACGT CCGCGAAGAA GTGATTTCGG TGATTAAGCG GCTTCGTAAT CATCCTTGCA TCGCTTTATG GTGCGGAAAT AACGAAAACG ATTGGTTGTA TGAAGTGGAG CATGCCGCTG GAAAAATTCA CACTCCTTTT TATGGAGAAA AAATATATCA TGAGTTAATT CCTGAACTAC TGGAAGAATT AGATCCTTCC CGTCCATATT GGCCAAGTTC GCCATATGGC GGCAATGATC ACAACTCACA AGAGGAAGGC GACCGGCATA ATTGGCAAGT TTGGCACGGG AATGTGGAAC CTCGAAAATT CGGTCAGAAT TTAGGACAAA ACATCAGTGT GGAGGGGGTT TCGTTTCGAA ATTATAAAAA AGATCGCACC CGGTTTTGCA GCGAGTTTGG CATGCATGCT TCTGCCAATC GTTATACGCT GGAAAAAAAT CTGCCTGACG GAACTTTTTA TTGGGGCAGC GATGAATTGG CATATCGCAA TAAAGATTTT CATCATGAAA AAGGGCTCTT ATTAATGGAA GGATATACCG GCATTCCAAA AAATATTGAG GAATACATGA ATTATTCGAT GCTCACACAG GCTGAAGGTT TAAAGTATGG AATGGAACAT TACCGCCGCA ATAAACCGCA AACAAGCGGG GCTTTAATCT GGCAATTGAA TGATTGCTGG CCTGGCACGA GTTGGTCGAT GATCGATTAT TACTTGTTGC CAAAGGCTTC ATATTATTAT AGTAAAAAAT TTAATGCCCC TCTTTTATAT ACGCTTGAAC ATGACCCTGG CGATGATTTA CATCTATGGG TTGTGAATGA CCGATTAGAA GATGTGAAAG ATACGCTGGT ATTCGAAGTG TTCCGATTTA ATGGCGAGTT AGTGTATTCG AAAGAATTTT TGATCCATGT GAAAGGAAAT GCTTCGGTTC AAATTGCTTC CTTAACAGAA GCGGAGGTTT TACAAGGCAA TCCTGCTGAA CAAGTTGTCG TCCGCTTAAA ATCGTTGAAT AAAAAAGCAG AGGAAAACTA TTATTACCTA AGAAATCATA AAGATCTTCA GCTGCCTAAG GCGAAATTGC AAGTAAAAGT GATGCCGGAA AAACAAGAAG TAGAAATTCG GACAGATTGT TTTGCGCGTT TTGTTAAACT GGAACTTCCG GCAGAAAAGA TTATTTTTTC GGATAACTTC TTCGACCTAC TTCCATCGGA GCGAAAGATC ATCAAGATTA GACATTTAGA TGGCAAGACT ATTTCTTTAG ACGGTTTAAG CGTATCAGCC ATCAACGGCA GTGCTTGA
|
Protein sequence | MLINKNWKIQ HFDVGQVRDL TIADPNYIDH FWIPAKVPGD VHSILREKKL IDDPFFGYND LKSKWVEEKV WWYRTEFTFD KNNLDKDERL ELIFEGLDTF ATVYLNGVEL GTTENMFISH TFDVTREIVD GRNVLAVKFN PVSYQLKEKE KNYWAGFGKD RIWARKAQYH FGWDWGPQIL TVGIWKEVRL EKRKIAKIES VYARTLDLKD SRAVVQIDIY TKNFVKGKSL QAEVTLKDRE QQFFQTVNID QNRATLTFNI DNPNLWWTHD LGEPNLYQLS VVLKWEGEVL DTYQTEIGIR TVEVMKRDRK GNPRFTFVLN GVEIFAKGAN WIPIDSFLGS VPESRYRHLI QLAKEANMNM LRVWGGGIYE KDIFYQECNR QGILVWQDFM FACALYPDYN RDYMENVREE VISVIKRLRN HPCIALWCGN NENDWLYEVE HAAGKIHTPF YGEKIYHELI PELLEELDPS RPYWPSSPYG GNDHNSQEEG DRHNWQVWHG NVEPRKFGQN LGQNISVEGV SFRNYKKDRT RFCSEFGMHA SANRYTLEKN LPDGTFYWGS DELAYRNKDF HHEKGLLLME GYTGIPKNIE EYMNYSMLTQ AEGLKYGMEH YRRNKPQTSG ALIWQLNDCW PGTSWSMIDY YLLPKASYYY SKKFNAPLLY TLEHDPGDDL HLWVVNDRLE DVKDTLVFEV FRFNGELVYS KEFLIHVKGN ASVQIASLTE AEVLQGNPAE QVVVRLKSLN KKAEENYYYL RNHKDLQLPK AKLQVKVMPE KQEVEIRTDC FARFVKLELP AEKIIFSDNF FDLLPSERKI IKIRHLDGKT ISLDGLSVSA INGSA
|
| |