Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_0947 |
Symbol | sohB |
ID | 7387735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | + |
Start bp | 795933 |
End bp | 796799 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643650463 |
Product | proteinase sohB |
Protein accession | YP_002548671 |
Protein GI | 222147714 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.328239 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGGTC TATGGAAGCG GTGGGTGCCG AAGCGGTTTC GCAAGCAGGA AATCGTCATT CCAGTCGTGC GGCTGCATGG CGCGATCATG AGCGGCGGCA GCCGGTTTCG CCCGGCGCTG AACCTCGCGG CTATCGCACC ACTTCTGGAA AAAGCCTTCA AGTTGAAGGA CAGTCCGGCT GTGGTGCTGT CGCTCAATTC GCCAGGCGGC TCGCCGGTGC AATCGCGGAT GATCTTTCAG CGCATCCGGA CACTGGCCGA TGAACACAGC AAGACCGTGC TGGTGTTCGT GGAGGATGTG GCAGCCTCCG GCGGCTATAT GATCGCCCTT GCCGGCGATG AAATCATTGC CGACCCGACC TCGATTGTCG GCTCGATCGG CGTCGTGTCC GGCGGTTTCG GCTTTCCTGA GTTGCTGAAG AAGATCGGCG TCGAGCGGCG TGTCTATACA GCGGGCGAAA ACAAGGTGAT GCTCGATCCG TTCCAGCCGG AAAAGCAGAG CGATATCGAG TATCTCAAGA CCCTTCAGCT GGATATTCAC GATGTTTTCA TCGACATGGT CAAGACACGG CGCGGCATCC GGCTTAACGA CAATCCGGAG CTGTTTTCCG GCTTGTTCTG GACGGGCCGC AAGGGTTTCG AACTTGGTCT GGTGGACGGA CTGGGCAGTA TGCGTGAGGA GATCAAGGCG CGCTACGGCA AGACAGCCCG GCTGGAATTG ATTTCGGGTG CCCGCGGCCT GTTCGGCAGG CGTCTGTCTG GGGTTGATAC GGCGTTTTCT GCTCCCAGTG ATATCGGCAG CGCCGCCGCC GCCGGCCTTG TGGAGACGCT GGAAGATCGG GCGCTCTGGG CGCGCTATGG GCTTTAG
|
Protein sequence | MAGLWKRWVP KRFRKQEIVI PVVRLHGAIM SGGSRFRPAL NLAAIAPLLE KAFKLKDSPA VVLSLNSPGG SPVQSRMIFQ RIRTLADEHS KTVLVFVEDV AASGGYMIAL AGDEIIADPT SIVGSIGVVS GGFGFPELLK KIGVERRVYT AGENKVMLDP FQPEKQSDIE YLKTLQLDIH DVFIDMVKTR RGIRLNDNPE LFSGLFWTGR KGFELGLVDG LGSMREEIKA RYGKTARLEL ISGARGLFGR RLSGVDTAFS APSDIGSAAA AGLVETLEDR ALWARYGL
|
| |