Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2030 |
Symbol | |
ID | 7978983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 2088120 |
End bp | 2089439 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644798852 |
Product | peptidase M42 family protein |
Protein accession | YP_002950022 |
Protein GI | 239827398 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAAAT GGACAAAACT CATGATTCGG CACGGATTTC ATCCGGAAGA TGAGGAAACA GGAATTACAT CACAATGGCT TGCGCAAACA TTCGCAGCGC TTATCCAACG GCACCAGCAC CTTGACACCA TTTCCGAAAC GGACTGGATG GAAGCGCTCC AACAAACCGC CAAGCAAATT GTTTTTTACA ACGATGACAT TCCAGGAAGA GAAACGCTGG TCGACCCGAC GAAAAACGAA CTTCCTCTCA TTCAAATCGA TCCGTACGTC CGCGGCATCG TCCGCTGGCT GAACATGATG CATATTTACA CCGTTTACAG CTGCGACGGA GAGGGAGTTC GCCCGGCAAC GATTTATTTT CTCGAAGACT TATCCGCCCA GCAGCTCGCC ATCATCCGGG CTTGCACTCC GCCACACGTT CGAATTAGAG CGAAAAAAAG AAAAGTAACA TTATTTTATC AGCGCGGACA CATCGATGAC CTTTTAACGA TGGCCGAACG GTTATACAAC GTCTGGCGAA ATCCAGAGCT GCTAACGACG TACCGTTTAG AAACATTCAA ACACCGCCTA TATTCCCTTC TTTCCATCAA CGGAAGAAGC GGCAGGGAAA CGATGATTCG GCAAATGCTC TATCGAAAGC TCCAACAAAA AACCGATTGG TGCCAAATCG ATGCTTACGG AAACTTGCTT GCCGCGGTTT ATTGCGGAAA CGGCCCGACG ATTCTGCTTT CCGCCCATAT GGATACGGTT CGCCCGTTTT CACCGAAACG TACGATTATC GAAAGCGGAA CCGTACTAAG CAGCTCGCGC GGCATCTTAG GCGCCGACGA CCGCGCGGGA ATCGCGGTCA TCTTAGAAAT ACTTGATTTC ATTCGCCATT CCCGCTTCCA AGGAACGCTG AAAATCGCCT TTACCGTCGA GGAAGAAATC GGCTGCCTCG GCTCGCGTAA CATCGACCCA ACATTTTTGC AAGACGTCGA CGCCGCGATT GTTGTAGACC GCCGCGGAAC GCGCGATATC GTCACTTCTT ACGCCGGCAT CGTGCCGTTT TGCACCGATG AATACGGCCG CATTTTCGAA ACAGCCGGAG CGCTCGCCGG CATGCCCGAC TGGAAAATGA CCCATGGCGG ACTAAGCGAC GCCAAAGTCT TCGCCGAATT CGGCATTCCA TCTGTCAACT TATCCGTCGG CTACGAGCAC GAACATACCG AATTCGAAAC GCTCGACTAC AAAGCAACTC TTGAAACGGT GATGTTACTT GAAACGGCAT TTGAAAACAA TATGATTACA GAAGAACTAG TCGTCACGTA TAAGTGTTAG
|
Protein sequence | MEKWTKLMIR HGFHPEDEET GITSQWLAQT FAALIQRHQH LDTISETDWM EALQQTAKQI VFYNDDIPGR ETLVDPTKNE LPLIQIDPYV RGIVRWLNMM HIYTVYSCDG EGVRPATIYF LEDLSAQQLA IIRACTPPHV RIRAKKRKVT LFYQRGHIDD LLTMAERLYN VWRNPELLTT YRLETFKHRL YSLLSINGRS GRETMIRQML YRKLQQKTDW CQIDAYGNLL AAVYCGNGPT ILLSAHMDTV RPFSPKRTII ESGTVLSSSR GILGADDRAG IAVILEILDF IRHSRFQGTL KIAFTVEEEI GCLGSRNIDP TFLQDVDAAI VVDRRGTRDI VTSYAGIVPF CTDEYGRIFE TAGALAGMPD WKMTHGGLSD AKVFAEFGIP SVNLSVGYEH EHTEFETLDY KATLETVMLL ETAFENNMIT EELVVTYKC
|
| |