Gene Pnap_4757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_4757 
Symbol 
ID4685961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008760 
Strand
Start bp138953 
End bp141700 
Gene Length2748 bp 
Protein Length915 aa 
Translation table11 
GC content62% 
IMG OID639826746 
Product2-oxoacid dehydrogenase subunit E1 
Protein accessionYP_973908 
Protein GI121583477 
COG category[C] Energy production and conversion 
COG ID[COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component 
TIGRFAM ID[TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type
[TIGR03186] alpha-ketoglutarate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCTAT TCAAGCTGCA AATTCTGCCA TTTTCTTTCT ACAGTAGTAC ACCTATTGAC 
ACCCTTGGAA TGAATCCTCC CATGCATCAG CAACTTGCCC CTTTTCGCTC CGTTGAGCCC
GTCGATCCTG ATCCGGCAGA AAGCGCCGAA TGGCGAGATG CACTCATGTC CTTGCTGCAG
GCGTCCGGAC CCGGGCGCAC GCGGCAAATT CTTGACATGC TCGATGCGAT GAGCCGCGAC
CCGAAGATCG CCTGGCAGCC GGCACGCGGC ACGCCTTATG TCAACACCAT TGCGGTCGAT
CAGCAGCCCG TGTTTCCCGG TGACTTGGCC ATGGAGGAGC GCCTGGCTTC GCTCGTGCGC
TGGAATGCGC TGGCCATGGT GGTGCGGGCC AACCAGGCCT ATGGCGAGCT CGGCGGCCAC
ATTGCCAGCT ACGCCAGTGC GGCCGACCTG TTCGAGACGG GCTACAACCA TTTTTTTCAT
GCCCGCTGCG AGGCGCCGGG GCGTGAGCAC CTGGGCGACC TGGTGTTCTT CCAGCCGCAC
AGCTCGCCGG GCGTGTATGC ACGGGCCTAT CTGGAAGGCC GACTGGACGT CGAGGATTTG
AGCTACTACC GGCAGGAATT GTCAGCTCCG GCGGCCACCA CCGGGCAGGG ACCACGCGGC
CTGTGCAGCT ATCCGCATCC CTACCTGATG CCCGATTTCT GGCAGTTTCC CACCGGCTCG
ATGGGCATCG GCCCGATCAG CTCGATCTAC CATGCGCGCT TCATGCGCTA CCTCACGCAC
CGCCAGCTGC TCGATTGCTC GGCCAGAAAG GTGTGGGGCG TGTTTGGCGA CGGCGAGATG
GATGAACCCG AATCGATGAG CGCCTTGACG CTGGCCGCGC GTGAAAAGCT GGATAACCTG
GTCTGGGTGG TCAACTGCAA CTTGCAGCGG CTCGACGGCC CGGTGCGCGG CAATGGCCGC
ATCATCGACG AACTCGAGCA GCTCTTTGCC GGTGCGGGCT GGAACGTCAT CAAGCTGGTG
TGGGGCAGCG ACTGGGACGG TCTGTTTGCA CGCGATGTGA GCGGCGCGCT CGCCAAGGTT
TTTGCCAACA CGGTGGACGG CCAGATGCAG ACGTTTGCCG CCAAGGATGG GCGCTACAAC
CGCGAAACAT TCTTTGGCCA GAACGAGGCC CTGGCCGCGC TGGCCCAAGG CATGACGGAC
GAGCAGATCG ACCGCCTCAA GCGTGGCGGC CACGACATGG TGAAGATTTA CGCCGCCTAC
GCCGCTGCTG CGCAGCACAA GGGCCAGCCC ACGGTGATCC TGGCGCACAC CAAGAAAGGC
TACGGCATGG GCAGCGCGGG CCAAGGCAAG ATGACCACGC ACAGCCACAA GAAGTTTGAC
GAGACCGACC TGATTGCGTT TCGCAATCGC TTCCAGCTGC CCTTGACGGA CGATCAGGTC
ACCTCGCTGA GTTTTTACCA GCCGCCAGCC GACAGCCTGG AGATGCAATA CCTGCATGCG
CAGCGCCAGA AACTGGGCGG CTACCTGCCC AAGCGCTACA CCACCTGCGA GCAGGTCGCT
GTACCCGAGA TTGCCAGCTA CGCCCAGTTC GCCCTCAAGG CCGACGGCAA GGAGATGAGC
ACCACCATGG CCTTTGTGCG CATGCTGGGC AACCTGCTCA AGGACAAGGC CCTGGGCCAA
CGGATCGTAC CGATCGTGGC CGACGAGGCC CGAACCTTTG GCATGGCCAA CCTGTTCAAG
CAGGTCGGCA TCTACTCCAG CGTCGGCCAG CGCTACGCGC CCGAGGACAT CGGCTCGGTG
CTGAGCTACC GCGAGGCCAT GGACGGCCAG ATTCTGGAAG AGGGCATTTC CGAGGCGGGC
GCGCTGGCCA GCTGGACGGC AGCAAGCACC AGCTACAGCG TGCATGGCCT GGCCATGCTG
CCGTTCTACA TCTATTACTC CATGTTTGGC TTTCAGCGCG TGGGCGACCA GATCTGGGCA
GCGGCCGACC AGCGCGCGCG CGGCTTTTTG CTCGGTGCCA CCTCGGGCCG GACCACGCTG
GGCGGCGAAG GGCTGCAGCA CCAGGACGGC ACCAGCCACC TGGTGGCCGC CACCATTCCC
AACTGCAAGG CCTACGACCC GGCGTTTGCC GGTGAGCTGG CCGTCATCAT CGACCACGGC
ATGCGAGAGA TGATGGTGGA GCAAAGGGAC ATTTTCTACT ACATCACCAT GATGAACGAG
AACTACGCCC AGCCCACCCT ACCCGCCGGG GTGGAGCACG ATGTAATTCA AGGGTGCTAT
AAATTCAATA GCTATTTGCC AATGAACAAC GAGGGCCATG CCGCTGAAAT ATCCAGAGAA
GTGACCTTGA TGGGGTCGGG GGCCATTCTT CTTGAGGTGA TCAAGGCCGC GCAGCAACTG
GCCCTGCAGG GCATCTCGGT GACCGTGTTC AGCGTGACGA GCTGGAGCGA ACTCGCACGC
GAAGGCCAAG CCAGCACCCA GCACACGCCA GCGGGGAAAA GGAGCGAATC GGTGCCTTTC
ATTGCCAACA TGCTGCGCGC AAGCAGCGGC CCCATCATTG CCGCGACCGA CTATGTCCGT
GCCGTGCCGG AAAGCGTGCG TGCCTTCGTG CCCGATGGGC GCGACTACCT CACCCTGGGC
ACTGACGGTT TTGGTCGGAG TGACACGCGT GCGGCACTGC GTGCCTTCTT TGGCGTCGAT
GCATCCAGCA TTGCGCAGGC CGCCCTCACG CTGCTGGCCC AGGACTGA
 
Protein sequence
MYLFKLQILP FSFYSSTPID TLGMNPPMHQ QLAPFRSVEP VDPDPAESAE WRDALMSLLQ 
ASGPGRTRQI LDMLDAMSRD PKIAWQPARG TPYVNTIAVD QQPVFPGDLA MEERLASLVR
WNALAMVVRA NQAYGELGGH IASYASAADL FETGYNHFFH ARCEAPGREH LGDLVFFQPH
SSPGVYARAY LEGRLDVEDL SYYRQELSAP AATTGQGPRG LCSYPHPYLM PDFWQFPTGS
MGIGPISSIY HARFMRYLTH RQLLDCSARK VWGVFGDGEM DEPESMSALT LAAREKLDNL
VWVVNCNLQR LDGPVRGNGR IIDELEQLFA GAGWNVIKLV WGSDWDGLFA RDVSGALAKV
FANTVDGQMQ TFAAKDGRYN RETFFGQNEA LAALAQGMTD EQIDRLKRGG HDMVKIYAAY
AAAAQHKGQP TVILAHTKKG YGMGSAGQGK MTTHSHKKFD ETDLIAFRNR FQLPLTDDQV
TSLSFYQPPA DSLEMQYLHA QRQKLGGYLP KRYTTCEQVA VPEIASYAQF ALKADGKEMS
TTMAFVRMLG NLLKDKALGQ RIVPIVADEA RTFGMANLFK QVGIYSSVGQ RYAPEDIGSV
LSYREAMDGQ ILEEGISEAG ALASWTAAST SYSVHGLAML PFYIYYSMFG FQRVGDQIWA
AADQRARGFL LGATSGRTTL GGEGLQHQDG TSHLVAATIP NCKAYDPAFA GELAVIIDHG
MREMMVEQRD IFYYITMMNE NYAQPTLPAG VEHDVIQGCY KFNSYLPMNN EGHAAEISRE
VTLMGSGAIL LEVIKAAQQL ALQGISVTVF SVTSWSELAR EGQASTQHTP AGKRSESVPF
IANMLRASSG PIIAATDYVR AVPESVRAFV PDGRDYLTLG TDGFGRSDTR AALRAFFGVD
ASSIAQAALT LLAQD