Gene Pnap_1120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_1120 
Symbol 
ID4688857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp1193049 
End bp1194131 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content64% 
IMG OID639834124 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_981357 
Protein GI121604028 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGATT TGTTTGACAA CCCGATGGGC CTGATGGGTT TTGAATTCGT CGAGTTCGCC 
TCGCCCACGC CCGGCGTGCT CGAACCCGTG TTCGAACGCA TGGGCTTCAC CCTGGTGGCC
CGGCACCGCT CCAAGGACGT GGTGCTGTAC CGCCAGGGCG ACATCAACTT CATCGTCAAC
CGCGAACCCA AAAGCGTCGC CGGCTACTTT GCCGCCGAGC ACGGCCCGAG CGCCTGCGGC
ATGGCGTTTC GCGTCAAGGA TGCGCACCTT GCCTACAACC GCGCGCTCGA ACTCGGCGCG
CAGCCGATTG ACATTCCAAC CGGGCCGATG GAGTTGCGCC TGCCCGCCAT CAAGGGCATA
GGCGGCGCGC CGCTGTACCT GATTGACCGC TTCGAGGACG GCAAGTCGAT CTACGACATC
GACTTTGTGT TTCTTGATGG CGTGGACCGG CATCCGCCCG GCCATGGCCT CAAGCTCATC
GACCACCTGA CGCACAACGT GTACCGGGGC CGCATGGCGT TCTGGGGCGG CTTTTACGAA
AAGCTGTTCA ACTTCCGCGA GATCCGCTAC TTTGACATCC AGGGCGAATA CACCGGCCTG
ACGTCGCGCG CCATGACCGC GCCCGACGGC AAGATCCGCA TTCCGCTGAA CGAGGAATCC
AAGCAGGGCG GCGGCCAGAT CGAGGAGTTC CTGCTCAAGT TCAACGGCGA AGGCATCCAG
CACATCGCGC TGATCTGCGA CGACCTGCTG GCCACCGTGG ACAAGCTGCA GCTGGCCGGC
GTGCCGCTGA TGACGGCGCC CAGCGACAGC TACTACGAGA TGATCGACGC CCGCCTGCCG
GGCCACGGCC AGCCGGTGGC CGAATTGCAG ACGCGCGGCA TCTTGCTCGA CGGCAGCACC
GAAGGCGGCA CGCCGCGCCT GCTGCTGCAG ATCTTCTCGC AGCCGCAGCT CGGGCCGGTG
TTCTTCGAGT TCATCCAGAG GAAGGGCGAC GAAGGTTTTG GCGAGGGCAA CTTCAAGGCG
CTGTTCGAGT CGCTCGAGCG CGACCAGATC GAGCGCGGCG CGCTGAGCGT GGAGGCTGCA
TGA
 
Protein sequence
MADLFDNPMG LMGFEFVEFA SPTPGVLEPV FERMGFTLVA RHRSKDVVLY RQGDINFIVN 
REPKSVAGYF AAEHGPSACG MAFRVKDAHL AYNRALELGA QPIDIPTGPM ELRLPAIKGI
GGAPLYLIDR FEDGKSIYDI DFVFLDGVDR HPPGHGLKLI DHLTHNVYRG RMAFWGGFYE
KLFNFREIRY FDIQGEYTGL TSRAMTAPDG KIRIPLNEES KQGGGQIEEF LLKFNGEGIQ
HIALICDDLL ATVDKLQLAG VPLMTAPSDS YYEMIDARLP GHGQPVAELQ TRGILLDGST
EGGTPRLLLQ IFSQPQLGPV FFEFIQRKGD EGFGEGNFKA LFESLERDQI ERGALSVEAA