Gene Pnap_4868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_4868 
Symbol 
ID4685679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008759 
Strand
Start bp43431 
End bp44705 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content67% 
IMG OID639826510 
Productdi-haem cytochrome c peroxidase 
Protein accessionYP_973674 
Protein GI121583238 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGC GCGTCATCGC GTTGGCGGTC GCGGCGGCCG CTCTTGCCGT TACCGCCCTG 
GCCGCGCAGG ATTTGAAGAA GGCGCCGCTG CGGGACAGGT GGTCAGTTCA GGAAGTCACT
GCCCTCGCCT CGATGCGTTT GAAAGAGGCT GGCCAGCGGC CCGCTGATGC GTCGAACGCC
TACGAGCAGC GCGCCGAGGC GGCCGCGCTT GGGCGTGCAC TGTTCAATGA CACCCGGCTC
AGCAAGAACG GCCAGGTCGC CTGCGCCAGC TGCCACGCGG CCGACAAGCA GTTCGAGGAC
GGGCGTCAGT TCGGCCAGGG AATTGCCACC GGCAAGCGCC GGACCATGCC GGTCATGGGC
GCTGCGCACG CCCCCTTCCT GTTCTGGGAC GGGCGCAAGG ACAGTGCCTG GTCGCAGGCA
CTGGGGCCAC TCGAAGACGC GGCAGAGCAC GGCGGCAACC GCGTCCGCTT GGTCCGACTG
GTGCTGGCGC AGTACAAGGA CCCGTATGGC AAGGTGTTCG GCGCGGTGCC CGAAGTCGGC
GAACTGCCCG GCGATGCGTC TCCCAACGGA ACGCAGGCCG AACGCGCCGC CTGGGCCGCG
CTTGCGCCGG CGACCCGGAA CAGCGTCAAC CGCGTCTTTG CGAACATGGG CAAGGCCATC
GCGGCCTATG AACGACTCGT TTCCTATGGT GAATCGCGTT TCGACCGGTA CGCCCAGGCT
ACTGTCGCTG GCGATGGGCC AGGCCAGGAT GCGCTCACCG GGCAGGAAGT GCGGGGATTG
CGCCTGTTCC TGACCAAGGG GCAGTGTGTG ACCTGCCACA ACGGGCCGCT GCTCACGGAC
CATGCCTTTC ACAACACAGG CGTTCCACCG CTGGAGCCGG CCAACCCGGA CCGCGGTCGC
GCCGAAGGGC TCAAAAAGCT CCTGGCCGAC GAATTCAATT GCCTGGGCCG CTACAGTGAC
GCCAAACCGG AGCAATGCGG TGAACTGCAG TTCCTGTCAG CGAACGACAC GGCTCAGCTC
GGCGCGTTCC GCACACCAAG CCTGCGCAAC GTGGCGGTCC GGCCGCCCTA CATGCATGCC
GGCCAGTTCT CGACCCTCGA TGCGGTGGTG CAGCACTACG CCGCTTCGCC CCAAGCGGCC
ATCGGCCACA GCGAACTGGC GCAGCCCGGT GAAAACCACG CGCAGCGGCA AAGCATCCGG
CTTTCCGCCG ACGACATCAA GGACCTGGCC GCGTTCCTGG GCACGCTCAC CGGCCCGGTC
CATCAGCCCA GGTGA
 
Protein sequence
MNQRVIALAV AAAALAVTAL AAQDLKKAPL RDRWSVQEVT ALASMRLKEA GQRPADASNA 
YEQRAEAAAL GRALFNDTRL SKNGQVACAS CHAADKQFED GRQFGQGIAT GKRRTMPVMG
AAHAPFLFWD GRKDSAWSQA LGPLEDAAEH GGNRVRLVRL VLAQYKDPYG KVFGAVPEVG
ELPGDASPNG TQAERAAWAA LAPATRNSVN RVFANMGKAI AAYERLVSYG ESRFDRYAQA
TVAGDGPGQD ALTGQEVRGL RLFLTKGQCV TCHNGPLLTD HAFHNTGVPP LEPANPDRGR
AEGLKKLLAD EFNCLGRYSD AKPEQCGELQ FLSANDTAQL GAFRTPSLRN VAVRPPYMHA
GQFSTLDAVV QHYAASPQAA IGHSELAQPG ENHAQRQSIR LSADDIKDLA AFLGTLTGPV
HQPR