Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3002 |
Symbol | |
ID | 7977371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 3021164 |
End bp | 3022510 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644799800 |
Product | Peptidase M23 |
Protein accession | YP_002950939 |
Protein GI | 239828315 |
COG category | [S] Function unknown |
COG ID | [COG3883] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00185035 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGCA GGAAAGTACT GGCGCTTGCC GCAGCTGCGG CGTTAAGTTT TGGTGTTTTC CCGCACTTTG CCAATGCGGT AAGCGACCGG GATATTCAAG AAAAACGGGA TGCGATTAAC AACATTCGTT CAAAGCAGTC CGCGGTACAA GAGAAAATAA ATGATGCTAA TCAAGAAATT CAAAAACTGC AATCCAAACA AAAAGAGCTC TCCGAAGAAA TTAAAAAACT CGATTTAGCA GTGGAAGAAA CGAGTGGAAA AATTCGCAAC TTATCCGCTG ATATTCAACA AACAGAACAA GATATTGAAA CATTGAAGCG TGAAATTGCC GAAGTGCAAG CGCGCATTGA AAAACGAAAT GAGATTTTGA AAGAGCGTGT TCGTTCTTTG CAAGAGAGCG GCGGAGTCAT CAGTTATCTT GAAGTATTGC TCGGAGCGCA AAGCTTTAGC GATTTTATTG ACCGCATGAG CGCTGTCACC ACCATTTTTG AGGCGGACAA ACAAATTATC CGCGAACAAC AAGCGGATAA AGCATTAAAA GAGAAAAAAG AAAAGGAACT TACGGATAAA CTAGCTAGTC TTCAAGCAAA CTTAAAAGAA CTAGAGCAGC TGAAACAAAA ATTAAGCGAG CAAATGAAGC AAAAGAATCA ATTAATGGCA CATTTGAAAC AAGAAGAACA AGAACACCAT GATAAAAAAA TGGCGCTTGC GGAAGAACAA GAATTGTTGA GAAAACAAGA AGCAGCGATG AAATATCAGC TTCAGCAGCT AATGGAGAAA AAGCGTGCTG AAGAAGAAGC GAAACGCAGA GCGGCTGCTA GACATACATA TTCATCAGGA AATGAAAGTG CGCCTTCCGA AAATGAAAAT ACGCCGTCCG GAAGTGAAAA TAGCAGCGGC GCATCTAGTT CCCGAGATAC ATCAAACGTT CCACCCGTTA CAAGCGGTGC GTTTATGAGA CCAGCGAATG GTCCGATTAC ATCAGGATTC GGTTACCGTT TTGGTGGAAG TGATTTTCAT CCAGGAATCG ATATTGGTAA AACTGCTGCT GTTGTTCCTG TTGTGGCAGC GGCTGACGGA TACGTGTTCC GTTCGTATTA TTCGAGCAGT TACGGAAACG TCATTTTCAT CACTCATGTG ATTAATGGAC AAGTATACAC AACGGTATAT GGCCACCTTG AAGCGCGTCT TGTCGGTGAA GGGCAAACGG TCCGCAAAGG ACAAGTCATC GGCTATATGG GGAACACAGG CCGCTCGACA GGGCCTCACC TTCACTTCGA ACTTCATCGC GGAGCGTGGA ATCTGGCAAA ATCGAATGCG GTAAATCCGC TTAATTATAT TAATTAA
|
Protein sequence | MKSRKVLALA AAAALSFGVF PHFANAVSDR DIQEKRDAIN NIRSKQSAVQ EKINDANQEI QKLQSKQKEL SEEIKKLDLA VEETSGKIRN LSADIQQTEQ DIETLKREIA EVQARIEKRN EILKERVRSL QESGGVISYL EVLLGAQSFS DFIDRMSAVT TIFEADKQII REQQADKALK EKKEKELTDK LASLQANLKE LEQLKQKLSE QMKQKNQLMA HLKQEEQEHH DKKMALAEEQ ELLRKQEAAM KYQLQQLMEK KRAEEEAKRR AAARHTYSSG NESAPSENEN TPSGSENSSG ASSSRDTSNV PPVTSGAFMR PANGPITSGF GYRFGGSDFH PGIDIGKTAA VVPVVAAADG YVFRSYYSSS YGNVIFITHV INGQVYTTVY GHLEARLVGE GQTVRKGQVI GYMGNTGRST GPHLHFELHR GAWNLAKSNA VNPLNYIN
|
| |