Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ajs_3271 |
Symbol | |
ID | 4673502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidovorax sp. JS42 |
Kingdom | Bacteria |
Replicon accession | NC_008782 |
Strand | - |
Start bp | 3456428 |
End bp | 3457858 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639840311 |
Product | protease Do |
Protein accession | YP_987470 |
Protein GI | 121595574 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.168434 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTATCG CCTTGGCAAT GGGCGGCGCA GGCTTGGTGG TGGCGCCCGC TTCGGTGCAG GCCCAGCCTG CTGCCGTGGT GCGTGGCTTG CCTGACTTCA CCGAACTGGT CGAGCAGGTG GGACCGTCCG TGGTGAACAT TCGCACAACC GAGAAGGTGA CGGCGAGGCC CTCCATCGGA GGCATGGATG AAGACATGCT GGAGTTCTTC CGGCGTTTTG GTGTCCCGGT GCCCAACATT CCACGCCAGC AGGGGCCGCA GCGTCCGCAG CCCGAAGAGC AGCCGCGTGG CGTGGGGTCG GGGTTCATTC TGAGTCCTGA CGGCTATGTC ATGACCAACG CCCACGTGGT GGAGGGGGCG GACGAAGTGA TTGTGACGCT GACCGACAAG CGGGAGTTCA AGGCCAAGAT CATCGGCTCG GACAAGCGCA CCGACGTTGC CGTCGTCAAG ATCGACGCCA CGGGACTGCC CGCGGTAAAG GTGGGTGACG TGGGCCGTCT TAAGGCGGGT GAATGGGTCA TGGCAATCGG CTCTCCCTTT GGCCTGGAAA ACACCGTGAC CGCCGGCATC GTAAGTGCCA AGCAGCGCGA TACGGGCGAC TATCTGCCCT TCATCCAGAC GGACGTGGCC ATCAATCCCG GTAACTCCGG CGGCCCTCTG ATCAACATGC GGGGTGAGGT GGTGGGCATC AACAGCCAGA TCTACTCCCG CTCCGGCGGA TTCATGGGTA TCTCGTTCGC CATTCCCATT GACGAGGCCA TGCGCGTGAG CGAGCAACTG CGTGTGAGCG GTCGCGTGAG CCGTGGCCGC ATTGGTGTTC AGATTGGCTC GGTTCCCAAG GATGTGGCGG AGTCCATCGG GCTTGGCAAA ACCGACGGCG CGCTGGTGCG CGGCGTGGAG ACTGGATCCC CCGCGGAGAA GGCTGGCATC GAGGCGGGCG ACGTGATCAC GCGCTATGAC GGCAAGGCCG TCGAGAAGGC GTCTGACCTT CCCCGCTTGG TGGGCAACAC CAAGCCAGGC ACCAAGACCC ATATCACGGT GTTCCGCCGT GGCGCATTGC GCGACCTGTC GATCACCATC GCCGAGGTGG AGCCTGATGA AAAGGTGGCA GCCAAGGCAG CCGAAGCAGA AGGCAAGGGC AAGTCCTCGA CAGCGGCTCA GCAGATCGGC TTGGTCGTTG CCGACCTGAC CGCTGCGCAG GCGAAGGAAC TGAAGGTGAA GGGAGGGGTG CGCGTGGTCT CTGCCAACGA CGCGGCGGCG CGTGCCGGTC TGCGGGCAGA CGATGTGATC ATCGCGCTGG CTAACACCGA AGTCCGTAAC CTCAAGGATT TCGAAGCGGT ACTAGCGAAG GCTGACATGG GGAAACCCAT CAACGTGCTT TTCCGTCGCG GCGAATGGGC GCAATATGCA CTGATCCGCC CGAACCGCTA G
|
Protein sequence | MAIALAMGGA GLVVAPASVQ AQPAAVVRGL PDFTELVEQV GPSVVNIRTT EKVTARPSIG GMDEDMLEFF RRFGVPVPNI PRQQGPQRPQ PEEQPRGVGS GFILSPDGYV MTNAHVVEGA DEVIVTLTDK REFKAKIIGS DKRTDVAVVK IDATGLPAVK VGDVGRLKAG EWVMAIGSPF GLENTVTAGI VSAKQRDTGD YLPFIQTDVA INPGNSGGPL INMRGEVVGI NSQIYSRSGG FMGISFAIPI DEAMRVSEQL RVSGRVSRGR IGVQIGSVPK DVAESIGLGK TDGALVRGVE TGSPAEKAGI EAGDVITRYD GKAVEKASDL PRLVGNTKPG TKTHITVFRR GALRDLSITI AEVEPDEKVA AKAAEAEGKG KSSTAAQQIG LVVADLTAAQ AKELKVKGGV RVVSANDAAA RAGLRADDVI IALANTEVRN LKDFEAVLAK ADMGKPINVL FRRGEWAQYA LIRPNR
|
| |