Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3377 |
Symbol | |
ID | 7977133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 3403153 |
End bp | 3404772 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644800144 |
Product | peptidase M20 |
Protein accession | YP_002951283 |
Protein GI | 239828659 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4187] Arginine degradation protein (predicted deacylase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.106714 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATGGC AAACGACGCA ACAATTAAAG CAATTGCTTT GTCGTTTAGT CGAATATCCA AGCATCAGCG GAACGGAAGC AGAGGTGTTG CTTGCGCAGT ACATAGCGGA ACAGTTGCTT ACACTTGATT ATTTCCAAAG AAATAATGAG TTTGTGCAAT TGCATCCGAC AGGAGACGGC CGTTATTTTG TTACCGCGCT GGTGAAAAAA GCGGAGCAAG TGCGCGATAC CGTCATTCTC ATCAGCCATT TTGATGTGGT CGATGTGCAA GATTACGGGG CATGGAAGGA CGCCGCGTTT TCTCCCGAAA AGCTGACGGA GCGGTTTTAC GAACAGAAAC AACAGCTTCC TTCTGATGTG CAAGCTGATA TGGAAGAAGG GGAATGGTTA TTTGGCCGCG GCGTGATGGA TATGAAGTGC GGTCTTGCTC TTCATATGTC GCTGATCGAG CAAGCATGCC AAGGAAAGTT TGAAGGAAAT TTACTGCTGC TTACGGTACC GGATGAGGAA GTGAGCTCGG TAGGAATGCG TGCGGCCGTG CCGATCCTTG TCGAAATGGC AGAAAAATAC GGATTAACCT ATCGGCTTGT GCTTAATTCC GAACCGATGT TTACTCGCTA TCCAGGGGAC AAAGCGAATT ACATTTACAC CGGCTCGATT GGAAAAGTGC TGCCGGGCTT TTATTGTTAC GGAAAAGAAA CGCATGTCGG AGAACCGTTC GCAGGGCTAA ATGCCAATTT CATGGTGGCG CAAATCGCTA ATGAATTAGA ATTTAACACT GATTTTTGCG AAGTGTTTAG TGGGGAAGTC AGTCCGCCGC CGACAAACTT ATTGCAGACC GATTTAAAAG AAGAATATTC TGTGCAAATA CCGCATCGCG CGGTAACGTT ATTTAATTTA TTTTTGCAGA AAAGGTCGCT CGATGATGTG ACAAACTCGT TAATCGCAAT CGCAAAGCGG GCAGCAAAAC GCATCGAAGA GCGTTATAAC GTTGAGGCGT CCCGCTTCGC GAAACTCGAA AAATGGACAC CGAAACCGCT GTCTGTCAAC GTATTCACCT TTTCAGAGTT AAGAAAGAAA GCAGTGGAGA TGGTTGGATT AGAAAGAATC GAACAGCTTG AAGCAAGCGT TCTATCAACG GAAACAGCGA AAGACGAGCG GGAGAAAACG ATCAAACTAG TCGACCGGCT TGCTATACTT TGTAAAGATT TCGCGCCGAT GATCGTTCTT TTTTATGCTC CTCCTTATTA TCCGGCTGTC AACGCCAGCA GCGATCCGCT CGTTCAGCGT CTTGTTGCGA AGCTGCAAAA ATACGCGCAA GAAAAGCATG GTATTTCTCT TGTGCAGCAA CATTACTTCG GCGGAATTTC CGACTTAAGC TATGTCGGCT TGCAGCAATC TTCATCTTCT TTACGAGCTC TTACCGATAA TATGCCGATA TGGAATCGCG GCTATCATTT GCCGTTTGAT GCGCTGGCAA AATTTCAAGT TCCGGTGCTC AATGTCGGTC CGATCGGACG CGACGCCCAT CAATGGACGG AACGCCTCAA CGTTCGTTTT GCGTTTACGA CGGTGAAATC GTGGCTGGAG TATACAATTA ACGAAGTATT TGCAAGATAG
|
Protein sequence | MKWQTTQQLK QLLCRLVEYP SISGTEAEVL LAQYIAEQLL TLDYFQRNNE FVQLHPTGDG RYFVTALVKK AEQVRDTVIL ISHFDVVDVQ DYGAWKDAAF SPEKLTERFY EQKQQLPSDV QADMEEGEWL FGRGVMDMKC GLALHMSLIE QACQGKFEGN LLLLTVPDEE VSSVGMRAAV PILVEMAEKY GLTYRLVLNS EPMFTRYPGD KANYIYTGSI GKVLPGFYCY GKETHVGEPF AGLNANFMVA QIANELEFNT DFCEVFSGEV SPPPTNLLQT DLKEEYSVQI PHRAVTLFNL FLQKRSLDDV TNSLIAIAKR AAKRIEERYN VEASRFAKLE KWTPKPLSVN VFTFSELRKK AVEMVGLERI EQLEASVLST ETAKDEREKT IKLVDRLAIL CKDFAPMIVL FYAPPYYPAV NASSDPLVQR LVAKLQKYAQ EKHGISLVQQ HYFGGISDLS YVGLQQSSSS LRALTDNMPI WNRGYHLPFD ALAKFQVPVL NVGPIGRDAH QWTERLNVRF AFTTVKSWLE YTINEVFAR
|
| |