Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_1641 |
Symbol | |
ID | 7387428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | - |
Start bp | 1371367 |
End bp | 1373079 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643650977 |
Product | Subtilisin-like serine protease protein |
Protein accession | YP_002549182 |
Protein GI | 222148225 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCACGCC GACCTCACCT GACACTTCGC CGCCTTGAAG GCCAACTCGA ACGACGCAAG AAACCTGGCT TCGGTAACGC GCCACCAAGA AATCATGTCT TGCATGGTGG CGAACTGCGT CTTCAGCTCG CACGGATCGT TGAGCGGAAC GCAGAGCGCG ATGCTCAACG CGTCGATGAA CATGTTGCCG ATCCCACGGT CATTCTGAAA ATCAATACGG ATGGATATAT CAGCGAAGAT GCCCTGACCG GTGTCGGGCT GCAAATCCTC GAACAGCGCT CGGATGACGT TACCGTGGCC CTATCAAAAG ATCCAAACCT AACTGTTCTG CGGGAAAGAT CACAGCAATA TGCAGGGCCC ATTCCCGTAA CCCAGGTCGG AGCTAGACAT GCTGGCCTCT TCGGAGCCAT AGACAGCTTT GCCGAGCTGT CTGCTAACGA CAAAATTGGC AGTGCCTTGG CTAGGCATGG TTTTGCAACG GTAGAGATGA TTCCGGAAGG CGATGTCTTT CTGATCGATG TCGAGCTCTG GGACGTCAAC GAAGATTTGC TACGTGACCT TTATGTTGAT CGTGTCGCCC GAAAGGCCGA TGAGTTCGGC GGCGAGCTCC TCAGCCGTTA TCGCGGTGCA GGGCTCTTTA TCGCGCGGGT GCGAGTTCCG GGAGCCGGGT TGAAAGAACT GCTGGCAATG ACTGAAGTTG CCTGGATCGA CCTGCCTCCA GTGCCTGACT TTGCCCCCGA CCCGGGAGCG AATCTGACGA CGACCGAACT TCCACCAATA AGCCCGCCAT CACCGAATGC TGTTTGTATT GGGATTATCG ATTCCGGAAT TACAGCGGCT CATCCAATGC TCGACGGCGT CATCGCTGGC GCATTCGGTG TACCTAATCG GCTCGGGAGT GATGACGAAA TACGCCATGG CACATCTGTA GCAGCTCTTG CGAGCTACGG ATCAATTGCC GGACAGATTT CCCAGAACAT TTTGGCACCT CGATTCCGTA TTGCCAGCGC CAAGGTTGTC AATGCGAATG GCCGTTTTGA CGAAGAACGC ACAGTCGCCG ATCTTGTTGA GGAAGCAATC CGGCGGCTTA ATGCAGAATA TAGTTGCCGG GTGATCAACA TATCACTCGC CGATATCGAA CACATCGTTG GAAGCCGCCC ATCGAACTGG GCGATGACAC TCGATAATCT GGTACGCGAG CTTGGCATCG TCATTACCGT TTCAGCAGGC AATATCCTGA GCATAAGCCA TCGCATCGCG GAGGAAGGTG TTGGCATTTA TCCTGAGTAT TTGCTTGAAG AAGAGCACAG ACTCTATGAG CCGGCCAGCT CAATGAACTC GCTTGTCATT GGATCCTTAG CTCACTCCAA TGGCCTCATG CCGGGAGAAG AGTTTGATGC AGACATTGTG GCTTTGACCG CCACACATCA TCCATCACCC TTCAGCCGAG GCGGTCCCGG CTTTGCCAAG AGCATCAAGC CAGATTTGGT CGAATATGGT GGCACTGCCG TATGGCAGGG ATTTTCATCC ACCTTGTCGG CTGATCGCGA CAGCTGTGGC ATTCTCACGC TCAACCCGAA TTATTTACAA AGCCTGATGG TTTATCGTCA CGGAACTTCC TTTGCTGCTC CCATTGCAGC CTATAAAGCG GCTGCGTGTT GGTTTCCACG CAGAACTGAG CCGGTTAGGC GGATAATTTC CATTGAGAAT TGA
|
Protein sequence | MARRPHLTLR RLEGQLERRK KPGFGNAPPR NHVLHGGELR LQLARIVERN AERDAQRVDE HVADPTVILK INTDGYISED ALTGVGLQIL EQRSDDVTVA LSKDPNLTVL RERSQQYAGP IPVTQVGARH AGLFGAIDSF AELSANDKIG SALARHGFAT VEMIPEGDVF LIDVELWDVN EDLLRDLYVD RVARKADEFG GELLSRYRGA GLFIARVRVP GAGLKELLAM TEVAWIDLPP VPDFAPDPGA NLTTTELPPI SPPSPNAVCI GIIDSGITAA HPMLDGVIAG AFGVPNRLGS DDEIRHGTSV AALASYGSIA GQISQNILAP RFRIASAKVV NANGRFDEER TVADLVEEAI RRLNAEYSCR VINISLADIE HIVGSRPSNW AMTLDNLVRE LGIVITVSAG NILSISHRIA EEGVGIYPEY LLEEEHRLYE PASSMNSLVI GSLAHSNGLM PGEEFDADIV ALTATHHPSP FSRGGPGFAK SIKPDLVEYG GTAVWQGFSS TLSADRDSCG ILTLNPNYLQ SLMVYRHGTS FAAPIAAYKA AACWFPRRTE PVRRIISIEN
|
| |