Gene Pnap_1560 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_1560 
Symbol 
ID4689468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp1646981 
End bp1648879 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content66% 
IMG OID639834563 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_981795 
Protein GI121604466 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG1082] Sugar phosphate isomerases/epimerases
[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.288895 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCATT CAATCGCCAC TGTCTCGCTG TCTGGAATGC TGCGCGAGAA GCTGCAGGCA 
GCAGCCGCTG CGCACTTCGA TGGCGTGGAA ATCTTTGAAA ACGACCTGCT GCAGTTCCCG
GGAACGCCGG CCGAGGTGCG CCGCATTTGC GAGGATCTGG GCTTGCGCAT CGACATGTTC
CAGCCGTTTC GTGACTTCGA CGGCACCACG CCCGCGCAGG TCGCTCGCAG TCTGGAACGC
GCCGAACGCA AGTTCGACGT GATGCAGGAG CTGGGCACTG AATTGATTCT GGTGTGCTCC
AACGTGCAGC CCGACGCGCT CGGCGATGTC GATCAACTCG CCGGGCAATT CCACCAGCTC
GCCGAACGCG CCGGCAGGCG CGGCATGCGC ATTGCCTACG AGGCGCTGGC CTGGGGCAGC
CAGGTCAAGC TCTGGTCCCA GGCCTGGAGC GTGGTCGAGC GCGTCAACCA CCCGCACATG
GGGCTGGCGC TGGACAGCTT TCACACGCTG TCGCTGCGCG ACGACCCGGG CGGCATCGCC
CGGCTGCCGG GCGAGAAGAT TTTCTTTGTC CAGCTGGCCG ATGCCCCGTG GGTGAACACC
GACGTGTTGA CGCACAGCCG CCATTACCGC TGCTTTCCGG GCCAGGGCGA GATGGAGGTG
ACCAGGTTCA CTGCGGCCGT CATCGAGTCG GGCTACAGCG GGCCGCTTTC GCTGGAAATC
TTCAACGATG AATTCCGCTC CGCGCCGGCG CGGGCCAATG CGGTCGATGC CAGGCGCTCG
CTGCTGTGGC TCGAAGACCA GGTGTGCCAG AGTCGGGCGA AGGATGCACC GCCTTGCAAG
GCGCCGCTGT TCGCGCCGCC GCCAGCGCCC GTGCTGACTG GATGGGCGTT CGTCGAGTTC
GCCGTGGACC CGGCCGCAGG CGGCCGGCTG GCCGCGTGGC TTCAGGCCAC TGGTTTTCAC
CGCATCGGCC ATCACCGCTC GAAGAAGGTT GATCTTTATG GCCAGGGCGA GGTGCGCATC
ATCGTCAATC TGGAAGAGGA TTCGCTGGCC CGCAGCCATT TTGAACAGCA CGGCACCTCG
GCCTGCGCCG TGGCGCTGGC CACGCCGGAT GTCGCCGGGG CGCTGGCGCG CGCCGAAGCC
TTGCTGTGCC CGCGTGTGAA CGGGCGTGTG GGTCAGAACG AGCTGACCAT TCCCTCCGTG
CGCGCACCCG ACGGCAGCCT GCTCTATTTT TGCGAAGCGC CGCAAGGTGG CCGCTACGCC
TTTGAAGCCG ACTTTGTGAT GGACGAATCC GCCCTGCACG GCGGTGCACT GGGTGACAGG
GCACGTTTCG ACCACCTGGT GCAGGCTGTG CCGGCCGGGC AGGTCGAGCC GTGGGTGCTG
TTCCATCGTG CGGTGCTGGG CCTGGCGCCC GAGCGCAACG TCGTGATGCA TGACCCCTAC
GGCGTGATCA GGAGCCGCGA GATTGAATCG ACCGACCGTG AAGTGCGCGT GTCCATCACT
GTTTCCGAAC GCGACAACAC CTCGGTGTCC CGTGCCGTGT CGAGCTTTCG TGGCGCTGGC
GTTCAGCAAA TTGCCATCGC AGTGACCGAC CTGGTGGCAA CGGCACGCGC ACTCAAGGCA
TCGGGCGCGC CGCTGCTGCC CGTGCCGGCC AACTACTACG ACGACCTGCT CGCCAGGTAC
GACATCGACG CCGGCTTGCT GGCGGCCATG CGCGAACTGG GCATCCTGTA CGACCGCGAA
GCGGACGGTG GCGAGTTCCT GCATTTGTAC CTCACGCCGT TCGACGACCG TTTCCATTTT
GAACTGGTCG AGCGACGCGC AGGCTACGCC GGCTATGGCG CGCCGAACGC CCCGTCGCGA
CTGGCGGCCA TGGCGCAGTG GCGCCAGGCG CAGCAGTGA
 
Protein sequence
MRHSIATVSL SGMLREKLQA AAAAHFDGVE IFENDLLQFP GTPAEVRRIC EDLGLRIDMF 
QPFRDFDGTT PAQVARSLER AERKFDVMQE LGTELILVCS NVQPDALGDV DQLAGQFHQL
AERAGRRGMR IAYEALAWGS QVKLWSQAWS VVERVNHPHM GLALDSFHTL SLRDDPGGIA
RLPGEKIFFV QLADAPWVNT DVLTHSRHYR CFPGQGEMEV TRFTAAVIES GYSGPLSLEI
FNDEFRSAPA RANAVDARRS LLWLEDQVCQ SRAKDAPPCK APLFAPPPAP VLTGWAFVEF
AVDPAAGGRL AAWLQATGFH RIGHHRSKKV DLYGQGEVRI IVNLEEDSLA RSHFEQHGTS
ACAVALATPD VAGALARAEA LLCPRVNGRV GQNELTIPSV RAPDGSLLYF CEAPQGGRYA
FEADFVMDES ALHGGALGDR ARFDHLVQAV PAGQVEPWVL FHRAVLGLAP ERNVVMHDPY
GVIRSREIES TDREVRVSIT VSERDNTSVS RAVSSFRGAG VQQIAIAVTD LVATARALKA
SGAPLLPVPA NYYDDLLARY DIDAGLLAAM RELGILYDRE ADGGEFLHLY LTPFDDRFHF
ELVERRAGYA GYGAPNAPSR LAAMAQWRQA QQ