Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2483 |
Symbol | |
ID | 7979038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2509134 |
End bp | 2510402 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644799284 |
Product | peptidase U32 |
Protein accession | YP_002950444 |
Protein GI | 239827820 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGTTAA AAAACGATAA AATTTCCGAG GTCATTAACG GCAAGCGCGT GATTGTGAAG AAGCCCGAGC TTCTCGCTCC AGCCGGCAAC CTAGAAAAGC TAAAAATCGC CGTGCATTAT GGAGCAGATG CGGTATTTAT CGGCGGACAA GAATATAGCT TACGCGCCAA CGCCGACAAT TTTACGCTTG AAGAAATTGC GGAAGGAGTG CGCTTTGCAA ACCAATATGG CGCAAAAGTG TATGTGACAG CGAACATTTA TGCACATAAC GAAAACATTC CCGGGCTCGA AGAATATTTG CAGGCGCTAG AACAAGCAGG TGTTCACGGC ATTATTGTCG CTGATCCGCT TATTATCGAA ACGGCGCGTC GGGTGGCGCC GAAATTAGAA GTGCACTTGA GTACGCAGCA GTCGATGGCA AACTGGAAAG CAGTTCAATT TTGGAAAGAA GAAGGATTGG AACGCGTGGT GCTCGCACGC GAAACGACCG CGGAAGAAAT TCGGGAAATT AAAGAAAAAG TCGATATTGA AATTGAGGCG TTTATTCATG GGGCGATGTG TTCCGCGTAC TCCGGCCGCT GTGTATTAAG CAACCATATG ACAGCACGCG ACTCCAACCG CGGGGGATGC TGTCAATCAT GCCGCTGGGA TTACGATTTA TACCAATTAG AAGGGGATAA AGAAATACCG TTGTTTGATG AAAATGATGC ACCGTTCGCG ATGAGCGCAA AAGATTTGAA TTTAATTCGC GCGATTCCAA CGATGATTGA ATTAGGCGTT GACAGCTTGA AAATCGAGGG GCGGATGAAA TCGATCCATT ATGTGGCGAC AGTCGTGAGT GTTTATCGCA AAGTAATCGA TGCCTATTGC GCCGACCCAG ACAATTTTGT CATTCGCGAA GAATGGATAA AAGAGTTGGA TAAATGTGCC AACCGCGATA CGGCTCCATC TTTCTTTGAA GGAATGCCGG GCTATACAGA TCATATGTAC GGTTCTCATA GCCGAAAAAC AAGCCATGAA TTTGCCGGTC TTGTACTGGA TTATGATAAA GAAACGAAGA TCGTCACATT ACAACAACGC AACTTTTTTA AACCGGGAGA TGAAGTCGAA TTTTTTGGAC CGGAAATTGA AAACTTCACA CAAGTAGTGG AAAAAATTTG GGACGAAGAT GGAAACGAGT TAGATGCTGC ACGCCATCCA TTGCAAATTG TCAAGTTTAA AGTAGAGCGA GAAGTCTTCC CATACAACAT GATGAGAAAG GAGATCTAA
|
Protein sequence | MLLKNDKISE VINGKRVIVK KPELLAPAGN LEKLKIAVHY GADAVFIGGQ EYSLRANADN FTLEEIAEGV RFANQYGAKV YVTANIYAHN ENIPGLEEYL QALEQAGVHG IIVADPLIIE TARRVAPKLE VHLSTQQSMA NWKAVQFWKE EGLERVVLAR ETTAEEIREI KEKVDIEIEA FIHGAMCSAY SGRCVLSNHM TARDSNRGGC CQSCRWDYDL YQLEGDKEIP LFDENDAPFA MSAKDLNLIR AIPTMIELGV DSLKIEGRMK SIHYVATVVS VYRKVIDAYC ADPDNFVIRE EWIKELDKCA NRDTAPSFFE GMPGYTDHMY GSHSRKTSHE FAGLVLDYDK ETKIVTLQQR NFFKPGDEVE FFGPEIENFT QVVEKIWDED GNELDAARHP LQIVKFKVER EVFPYNMMRK EI
|
| |