Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0861 |
Symbol | |
ID | 7977867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 926998 |
End bp | 928692 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644797832 |
Product | oligoendopeptidase, M3 family |
Protein accession | YP_002949005 |
Protein GI | 239826381 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | [TIGR02289] oligoendopeptidase, M3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000179561 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAATTTT CTGAGTTTCG CTATGAACGG CCGAATATCG AGAAGCTGAA AACATCATTT CAACAAGCGC TGCAATCGTT CCAGAAGGCA AGCAGCGTAG AAGAACAGGA TGAGGCGATG AAGGAAATTA ATAAGCTTCG CAACGATTTC AGCACGATGG CGCAAATTTG CTATATTCGC CATACGATTG ATACGAACGA TGAGTTTTAT AAGCAGGAGC AAGATTTTTT CGATGAAGTT GAGCCAATTG TTAAAGGGCT TGTGAACGAT TATTACCGTG CGCTTGTTTC GTCTCCGTTC CGTTCGCAGC TTGAAGAAAA ATGGGGGAAA CAGTTATTTG CGCTCGCTGA GACGGAATTA AAAACGTATT CTCCAGACAT TGTGGAAGAT CTGCAGCTGG AAAATAAGCT GACGAGCGAA TATACAAAAT TGGTCGCTTC GGCAAAAATT TTCTTTGAAG GGGAAGAACG GACGTTAGCG CAGCTGCAGC CGTTTGTCGA ATCGCCGGAC CGCGAGATGC GCAAGCGGGC GAGTGAGGCG CGTTTTGCCT TTTTCCAAGA ACATGGGGAG AAGTTCGATG AAATTTACGA TCAGCTTGTG AAAGTGCGCA CCGCCATCGC GCAAAAGCTC GGATTTAAAA ACTTCGTGGA ACTCGGTTAT GCCCGTCTTG GACGGACTGA TTACAACGCC GAGATGGTCG CGAAATTCCG CAAACAAGTC GAAAAGCATA TCGTTCCAAT TGCTGTGAAG CTTCGCGAGC GGCAGCGGGC GCGCATCGGT GTCGAGAAGC TGAAATATTA TGATGAAGCG TTTGTTTTTC CAACCGGCAA TCCAACGCCA AAAGGGGACG CAAACTGGAT TATTGAAAAC GGGAAAAAGA TGTACGAAGA ACTGTCGCCG GAAACAGGCG AGTTTTTCCG GTATATGATC GATCATGAAC TAATGGACCT CGTTGCGAAA AAAGGAAAAG CAGGAGGCGG CTATTGCACC TATATCGAAA ACTATAAAGC GCCGTTTATT TTCTCGAACT TTACCGGCAC ATCCGGAGAC ATCGACGTGC TTACCCATGA AGCGGGGCAC GCATTTCAAG TGTACGAAAG CCGCCATTAC GAAATTCCGG AATACATCTG GCCGACTTTG GAGGCATGCG AAATTCATTC GATGAGCATG GAATTTTTCA CATGGCCGTG GATGAAGTTA TTCTTCGGGG AAGATGCGGA AAAATATAAA TTCTATCATT TGAGCGATGC TTTATTATTT TTGCCGTATG GCGTAGCGGT GGACGAATTT CAGCATTTCG TATACGAAAA TCCGAACGCA ACGCCGGCGG AACGGAAGCA AGCATGGCGG AAGATTGAAA AGAAATATAT GCCGACACGA GACTATGACG GCAATGATTA TTTAGAACGC GGCGGCTTTT GGCAGCGGCA AAGCCATATT TATACGAACG CATTCTACTA TATCGACTAT ACCCTTGCGC AAATTTGCGC GTTCCAATTT TGGAAACGTT CGCGTGAAAA CTATGAAGAG GCGTGGAATG ATTATTTAAC CTTATGCCGT CAAGGCGGAA GCAAGCCGTT TACCGAATTG GTCCGCGCCG CCAACCTCAT TTCGCCGTTT GAAGACGGAT GCGTTCAATC GGTAGTCGGC GAGATCGAGG GATGGTTGAA TAGTGTTGAC GATCAGAGCT TATAG
|
Protein sequence | MKFSEFRYER PNIEKLKTSF QQALQSFQKA SSVEEQDEAM KEINKLRNDF STMAQICYIR HTIDTNDEFY KQEQDFFDEV EPIVKGLVND YYRALVSSPF RSQLEEKWGK QLFALAETEL KTYSPDIVED LQLENKLTSE YTKLVASAKI FFEGEERTLA QLQPFVESPD REMRKRASEA RFAFFQEHGE KFDEIYDQLV KVRTAIAQKL GFKNFVELGY ARLGRTDYNA EMVAKFRKQV EKHIVPIAVK LRERQRARIG VEKLKYYDEA FVFPTGNPTP KGDANWIIEN GKKMYEELSP ETGEFFRYMI DHELMDLVAK KGKAGGGYCT YIENYKAPFI FSNFTGTSGD IDVLTHEAGH AFQVYESRHY EIPEYIWPTL EACEIHSMSM EFFTWPWMKL FFGEDAEKYK FYHLSDALLF LPYGVAVDEF QHFVYENPNA TPAERKQAWR KIEKKYMPTR DYDGNDYLER GGFWQRQSHI YTNAFYYIDY TLAQICAFQF WKRSRENYEE AWNDYLTLCR QGGSKPFTEL VRAANLISPF EDGCVQSVVG EIEGWLNSVD DQSL
|
| |