Gene Ajs_3271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAjs_3271 
Symbol 
ID4673502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax sp. JS42 
KingdomBacteria 
Replicon accessionNC_008782 
Strand
Start bp3456428 
End bp3457858 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content63% 
IMG OID639840311 
Productprotease Do 
Protein accessionYP_987470 
Protein GI121595574 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.168434 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATCG CCTTGGCAAT GGGCGGCGCA GGCTTGGTGG TGGCGCCCGC TTCGGTGCAG 
GCCCAGCCTG CTGCCGTGGT GCGTGGCTTG CCTGACTTCA CCGAACTGGT CGAGCAGGTG
GGACCGTCCG TGGTGAACAT TCGCACAACC GAGAAGGTGA CGGCGAGGCC CTCCATCGGA
GGCATGGATG AAGACATGCT GGAGTTCTTC CGGCGTTTTG GTGTCCCGGT GCCCAACATT
CCACGCCAGC AGGGGCCGCA GCGTCCGCAG CCCGAAGAGC AGCCGCGTGG CGTGGGGTCG
GGGTTCATTC TGAGTCCTGA CGGCTATGTC ATGACCAACG CCCACGTGGT GGAGGGGGCG
GACGAAGTGA TTGTGACGCT GACCGACAAG CGGGAGTTCA AGGCCAAGAT CATCGGCTCG
GACAAGCGCA CCGACGTTGC CGTCGTCAAG ATCGACGCCA CGGGACTGCC CGCGGTAAAG
GTGGGTGACG TGGGCCGTCT TAAGGCGGGT GAATGGGTCA TGGCAATCGG CTCTCCCTTT
GGCCTGGAAA ACACCGTGAC CGCCGGCATC GTAAGTGCCA AGCAGCGCGA TACGGGCGAC
TATCTGCCCT TCATCCAGAC GGACGTGGCC ATCAATCCCG GTAACTCCGG CGGCCCTCTG
ATCAACATGC GGGGTGAGGT GGTGGGCATC AACAGCCAGA TCTACTCCCG CTCCGGCGGA
TTCATGGGTA TCTCGTTCGC CATTCCCATT GACGAGGCCA TGCGCGTGAG CGAGCAACTG
CGTGTGAGCG GTCGCGTGAG CCGTGGCCGC ATTGGTGTTC AGATTGGCTC GGTTCCCAAG
GATGTGGCGG AGTCCATCGG GCTTGGCAAA ACCGACGGCG CGCTGGTGCG CGGCGTGGAG
ACTGGATCCC CCGCGGAGAA GGCTGGCATC GAGGCGGGCG ACGTGATCAC GCGCTATGAC
GGCAAGGCCG TCGAGAAGGC GTCTGACCTT CCCCGCTTGG TGGGCAACAC CAAGCCAGGC
ACCAAGACCC ATATCACGGT GTTCCGCCGT GGCGCATTGC GCGACCTGTC GATCACCATC
GCCGAGGTGG AGCCTGATGA AAAGGTGGCA GCCAAGGCAG CCGAAGCAGA AGGCAAGGGC
AAGTCCTCGA CAGCGGCTCA GCAGATCGGC TTGGTCGTTG CCGACCTGAC CGCTGCGCAG
GCGAAGGAAC TGAAGGTGAA GGGAGGGGTG CGCGTGGTCT CTGCCAACGA CGCGGCGGCG
CGTGCCGGTC TGCGGGCAGA CGATGTGATC ATCGCGCTGG CTAACACCGA AGTCCGTAAC
CTCAAGGATT TCGAAGCGGT ACTAGCGAAG GCTGACATGG GGAAACCCAT CAACGTGCTT
TTCCGTCGCG GCGAATGGGC GCAATATGCA CTGATCCGCC CGAACCGCTA G
 
Protein sequence
MAIALAMGGA GLVVAPASVQ AQPAAVVRGL PDFTELVEQV GPSVVNIRTT EKVTARPSIG 
GMDEDMLEFF RRFGVPVPNI PRQQGPQRPQ PEEQPRGVGS GFILSPDGYV MTNAHVVEGA
DEVIVTLTDK REFKAKIIGS DKRTDVAVVK IDATGLPAVK VGDVGRLKAG EWVMAIGSPF
GLENTVTAGI VSAKQRDTGD YLPFIQTDVA INPGNSGGPL INMRGEVVGI NSQIYSRSGG
FMGISFAIPI DEAMRVSEQL RVSGRVSRGR IGVQIGSVPK DVAESIGLGK TDGALVRGVE
TGSPAEKAGI EAGDVITRYD GKAVEKASDL PRLVGNTKPG TKTHITVFRR GALRDLSITI
AEVEPDEKVA AKAAEAEGKG KSSTAAQQIG LVVADLTAAQ AKELKVKGGV RVVSANDAAA
RAGLRADDVI IALANTEVRN LKDFEAVLAK ADMGKPINVL FRRGEWAQYA LIRPNR