Gene Pnap_3433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3433 
Symbol 
ID4689170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp3642518 
End bp3643555 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content69% 
IMG OID639836447 
Productphage integrase family protein 
Protein accessionYP_983651 
Protein GI121606322 
COG category[L] Replication, recombination and repair 
COG ID[COG4973] Site-specific recombinase XerC 
TIGRFAM ID[TIGR02224] tyrosine recombinase XerC 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCCG TGCCGCCCGC GCAAGACAGT GCGCCCGAAG AAGCCGACCC GCTGGTCACA 
CGCTACCTCG CCTATGTCCG GGTCGAAAAG CGGCTGGCCA GTCGTACCGT AGAGCTGTAC
ATGGCCGACC TGGAAAAACT GCAGGCGAAT GCGCGCAAGG CGGGCGTGGC GCTGACGGAC
GTGAAGCACA CGCACCTGCG GCGCTGGGTG GCGCAGATGC ACAGCCGTGG GCGCAGCGGG
CGCGGCATTG CCTTGATCCT GTCGGGCTGG CGCGGCTTTT ACACCTGGCT GGGCCGCGAA
GGACTGGTGC CGGGCAACCC GGTGCAGGAC GTGCGCTCGC CCAAGATCGC CAAGCCGCTG
CCCAAGGCCC TAAGCGTCGA TGAAGCGGTG CAGCTGGCCA GCTTCGAGCT TGAGGCCACG
GACGGCGATC CTTTCCTCGA AGCGCGCGAC CAGTGCATCA CCGAGCTGCT CTACGGCTGC
GGACTGCGCA TCAGCGAGCT GGTCGGCCTG GATGCGCAGG CCAGCGGCAA GGCGCGTGGC
TGGATCGACA TGCAAGGCGG CGATGCGCAT GTGCTCGGCA AGGGCGAGAA GCGGCGCAGC
GTGCCGGTCG GCGGCGCGGC GCTGAAGTCG CTGCACAAGT GGCTGGCGGC CCGCAGCCTG
TGGGCGCGGC CCGGCGCTGA CGGCGTTCAG GGCGGCGAGG CGCTGTTCAT CAACCAGCGC
GGCAGCCGGC TCACGCCGCA GCACATCCGC GTGCGCCTCA AGCAGCGCTC GCAGCAGGCT
GGGCTGGCCA CGCCGGTGCA TCCGCACATG CTGCGCCACT CGTTTGCCAG CCATGTGCTG
CAGTCCAGCG GCGACCTGCG CGCGGTGCAG GAGTTGCTGG GCCACGCCAG CATCACGACC
ACGCAGGCCT ACACGCGGCT CGATTTCCAG CACCTGGCCA AGATTTACGA CGCGGCGCAT
CCGAGGGCGA TGTCGGACGG TGCCGGCTCG AAGGCCAAGG CGCCGCCAGA GGTTCAGGAG
CCGCGCAAGC CGAAGTAG
 
Protein sequence
MKPVPPAQDS APEEADPLVT RYLAYVRVEK RLASRTVELY MADLEKLQAN ARKAGVALTD 
VKHTHLRRWV AQMHSRGRSG RGIALILSGW RGFYTWLGRE GLVPGNPVQD VRSPKIAKPL
PKALSVDEAV QLASFELEAT DGDPFLEARD QCITELLYGC GLRISELVGL DAQASGKARG
WIDMQGGDAH VLGKGEKRRS VPVGGAALKS LHKWLAARSL WARPGADGVQ GGEALFINQR
GSRLTPQHIR VRLKQRSQQA GLATPVHPHM LRHSFASHVL QSSGDLRAVQ ELLGHASITT
TQAYTRLDFQ HLAKIYDAAH PRAMSDGAGS KAKAPPEVQE PRKPK