Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2484 |
Symbol | |
ID | 7979039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2510430 |
End bp | 2511359 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644799285 |
Product | peptidase U32 |
Protein accession | YP_002950445 |
Protein GI | 239827821 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.489664 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAC CAGAATTATT AGTGACGCCG ACGAGCGTTT CTCATATACA TGATTTAGCC AATGCCGGCG CTGATGCGGT GATGATCGGC CAGCAGCGTT ACGGTCTGCG CCTAGCGGGA GAGTTTTCCC GTGCGGATGT GGCAGAGGCG GTGAAGATTG CCCGTTCATA TGGCATGAAA GTATATGTTG CGATGAATGC AATTTTCCAT AACGACAAGG TGGACGAACT TGGTGATTAT ATAAAGTTTC TCTCTGATAC AGGCGTGGAT GCGATTGTTT TTGGCGACCC GGCGGTTTTA ATGACTGTAC GTGAAGTGGC CCCTCATATG AAATTGCATT GGAATCCAGA AACGACGGCA ACAAACTGGT ATACGTGCAA TTATTGGGGA CGGAAAGGAG CGAAACGTGC TGTCCTCGCC CGGGAATTAA ATATGGATGC GATTTTAGAA ATTAAAGAGC ACGCCGAAGT AGAAATTGAA GTGCAAGTGC ACGGGATGAC TTGCATGTAT CAATCTAAAC GCTCTTTAAT TGGAAACTAT TTCGAATATC AAGGAAAAGT AATGGAAATC GAACGAAAAA AATATGAAAA AGGCATGTTT TTGTATGATA AAGAGCGCGA TAGTAAATAC CCGATTTTTG AAGACGAAAA CGGCACACAT ATTATGAGCC CGAACGATAT TTGCATTATT GATGAGCTTA GTGAGATGGT GGATGCAGGC ATCGACAGCT TCAAAATTGA TGGAGTATTG CATCAGCCAG ACTATATTAC GGAAGTGACA AAGCTATATC GCCGCGCCAT TGATTTATGT GCCGAAAATC GGAAGCAATA CGAGGAAGAA AAGGATGAGC TGCTGGCGAA AATCGAAGCG ATCCAGCCAA AACATCGCCC GTTGGATACC GGATTTTTCT TTAAAGAAAC GGTTTACTAA
|
Protein sequence | MKKPELLVTP TSVSHIHDLA NAGADAVMIG QQRYGLRLAG EFSRADVAEA VKIARSYGMK VYVAMNAIFH NDKVDELGDY IKFLSDTGVD AIVFGDPAVL MTVREVAPHM KLHWNPETTA TNWYTCNYWG RKGAKRAVLA RELNMDAILE IKEHAEVEIE VQVHGMTCMY QSKRSLIGNY FEYQGKVMEI ERKKYEKGMF LYDKERDSKY PIFEDENGTH IMSPNDICII DELSEMVDAG IDSFKIDGVL HQPDYITEVT KLYRRAIDLC AENRKQYEEE KDELLAKIEA IQPKHRPLDT GFFFKETVY
|
| |