Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2444 |
Symbol | |
ID | 7979003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2477769 |
End bp | 2478887 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644799246 |
Product | germination protease |
Protein accession | YP_002950406 |
Protein GI | 239827782 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01441] GPR endopeptidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000364283 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCGCT CAATCGATTT AAGCATGTAT TCGGTACGAA CCGATTTAGC GATCGAAGCT CATGAAATAG CTGTGGAAGA GCGTCTCCAA CAAAAAAGGG AAAGCGCTTC CCCTATTGAA GGAGTCATTA TTCACGATCG CGAAATAGAC GGGATTAAAT TGTCGCATGT AGAAGTAACG GAAGAAGGAG CAAAATCGAT TGGTAAAAAA CCGGGAAATT ATTTAACCAT TGAAGCACAA GGAATTCGTG AACATAATAC GGAGTTGCAA CAGAAAGTAC AAGATATATT TGCAAAAGAG TTCAACGCAT TTTTACGAAA ATTGGACATT AGAAAAGAAT CAAGCTGCCT TGTTGTAGGA TTAGGTAATT CAAATGTGAC ACCGGATGCG TTAGGTCCGC TAACGGTGGA AAATCTACTT ATTACAAGAC ATTTATTTCA TCTTCAGCCG GAGAGCGTAG AAGAAGGATT TCGTCCAGTA AGCGCGATTG CTCCAGGTGT GATGGGGACG ACAGGGATTG AAACAAGCGA TATTATTCAC GGAATTGTGG AAAAAACAAA ACCAGATTTT GTGATCGTCA TTGATGCGTT AGCGGCACGA TCGATTGAAC GGGTCAACGC AACGATTCAA ATTTCTGATA CGGGAATTCA TCCAGGTTCA GGTGTAGGGA ATAAACGAAA AGAATTAAGT AAAGAAACGT TAGGTATTCC TGTTATTTCT ATCGGTGTTC CAACGGTTGT CGATGCTGTA TCGATTACAA GTGATACGAT CGATTTTATT TTAAAACATT TTGGAAGGGA AATGCGTGAA GGGAAGCGGC CGTCTAGTGC TCTCGCTCCA GCGGGATGGA CGTTTGGCAA GAAAAAGAGG CTTACAGAGG AAGATATGCC ATCAACGGAG CAACGTTCGA CATTTCTTGG CATAATCGGC ACATTAGAGG AAGAAGAAAA ACGCAGACTC ATTTATGAAG TGCTTTCTCC ACTCGGTCAT AATTTAATGG TTACTCCGAA AGAAGTCGAT ATGTTTATTG AAGATATGGC AAATTTATTA GCGAGCGGAT TAAATGCTGC ATTGCATGAA CAAATTGATC AAGATAATAC CGGTTCTTAT ACACATTAA
|
Protein sequence | MNRSIDLSMY SVRTDLAIEA HEIAVEERLQ QKRESASPIE GVIIHDREID GIKLSHVEVT EEGAKSIGKK PGNYLTIEAQ GIREHNTELQ QKVQDIFAKE FNAFLRKLDI RKESSCLVVG LGNSNVTPDA LGPLTVENLL ITRHLFHLQP ESVEEGFRPV SAIAPGVMGT TGIETSDIIH GIVEKTKPDF VIVIDALAAR SIERVNATIQ ISDTGIHPGS GVGNKRKELS KETLGIPVIS IGVPTVVDAV SITSDTIDFI LKHFGREMRE GKRPSSALAP AGWTFGKKKR LTEEDMPSTE QRSTFLGIIG TLEEEEKRRL IYEVLSPLGH NLMVTPKEVD MFIEDMANLL ASGLNAALHE QIDQDNTGSY TH
|
| |