Gene Pnap_1118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_1118 
Symbol 
ID4686994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp1191242 
End bp1192516 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content66% 
IMG OID639834122 
Productfumarylacetoacetase 
Protein accessionYP_981355 
Protein GI121604026 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR01266] fumarylacetoacetase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCCG CCATCGACGG CACCCACGAT GCGGCGCTGC AAAGCTGGGT CGAATCGGCC 
AACGATCCGG CCAGCGACTT CCCGATCCAG AACCTGCCGT TTGGCCGCTT TCGCGGTGAA
GATGACGCCG ACTGGCATGT CGGCGTCGCC ATTGGCGACC AGGTGCTGGA CCTGCACGCG
GCCCGGCTGA TCGCTAGCCG CGACATGAAC CGGCTCATGC GCCTGCCGCC CGAGACCCGG
CACATCTTGC GCCAGACCAT CTCGCAAGGC CTGCGCGCAG GCAGCGCCCA GCAAAAAACC
TTCCAGGCCG CGCTGCGCGC CCAGTCAAAA GTCGAACTCG GCCTGCCCTG CCAGATCGGC
GACTACACCG ATTTCTACAC CAGCGTGCAC CACGCCACCA CCATCGGCAA GCAGCTGCGC
CCCGACAACC CGCTGCTGCC CAACTACAAG TGGCTGCCCA TCGGCTACCA CGGACGGGCT
TCGAGCATCA TTCCCAGCGG CCAGGGCTTT CACCGGCCCA AGGGGCAGAC GAAAGCGCCC
GGCCAGGACG CGCCGACCTT TGGCCCCTGC CGCCGACTGG ATTACGAGCT GGAGCTGGGC
ATGCTGATTG CACGCCCCAA CGTATTGGGC GAGCCCATCG CCATGGCGGA TGCGGAGTCG
CACGTCTTCG GCGTCAGCCT ATTCAACGAC TGGAGCGCCC GCGACATCCA GGCCTGGGAA
TACCAGCCGC TCGGGCCGTT TTTGTCGAAG AACTTCGCCA GCACGCTCTC GCCCTGGATG
GTGACGCTCG ATGCGCTGGC GCCGTTTCGC GCGCCCTTCA CCCGGCCAGA AGGCGACCCT
GAACCGCTCG AATACCTTGA TTCGCCCGCC AACCGGGCGC AAGGCGCTAT CAATATTGAA
CTGGAGGTGT GGCTGCAGAC CGCCCGGATG CGGCAAGCCG GCCATGCGGG CGAGCGCATT
TCGCAGTCCA ACTACCGCGA CGCCTACTGG ACCATGGCGC AGCTGGTGGC GCACCACACC
GTCAACGGCT GCAACCTGCG CGCCGGCGAC CTGTTCGGTT CGGGCACGCT GTCGGGGCCA
AAGCCGGAGC AAGGCGGCTC GCTGATGGAA CTGAGCGCGG GCGGCAAGCA GCCCCTGGCC
CTGTCCAATG GCGAAACCCG CAGCTGGCTC GAAGACGGCG ACAGCGTCAT CCTGCGCGGC
TACTGCCAGC AGGACGGCTT CAGGCGCATC GGCTTTGGCG AATGCCGGGG ATTGGTACTC
GCTGCCACTG TGTGA
 
Protein sequence
MSPAIDGTHD AALQSWVESA NDPASDFPIQ NLPFGRFRGE DDADWHVGVA IGDQVLDLHA 
ARLIASRDMN RLMRLPPETR HILRQTISQG LRAGSAQQKT FQAALRAQSK VELGLPCQIG
DYTDFYTSVH HATTIGKQLR PDNPLLPNYK WLPIGYHGRA SSIIPSGQGF HRPKGQTKAP
GQDAPTFGPC RRLDYELELG MLIARPNVLG EPIAMADAES HVFGVSLFND WSARDIQAWE
YQPLGPFLSK NFASTLSPWM VTLDALAPFR APFTRPEGDP EPLEYLDSPA NRAQGAINIE
LEVWLQTARM RQAGHAGERI SQSNYRDAYW TMAQLVAHHT VNGCNLRAGD LFGSGTLSGP
KPEQGGSLME LSAGGKQPLA LSNGETRSWL EDGDSVILRG YCQQDGFRRI GFGECRGLVL
AATV