Gene Pnap_1781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_1781 
SymbolaceE 
ID4687011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp1893611 
End bp1896340 
Gene Length2730 bp 
Protein Length909 aa 
Translation table11 
GC content62% 
IMG OID639834787 
Productpyruvate dehydrogenase subunit E1 
Protein accessionYP_982012 
Protein GI121604683 
COG category[C] Energy production and conversion 
COG ID[COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component 
TIGRFAM ID[TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000127285 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000322845 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCAGCCA ATCCAGACGA AAACCAACAA GCCGGTATCC CCAATGCGGG TTCCAGCAAT 
ACGCAAGACA ACGACACGCA GGAGACGCGC GAGTGGATGG ATGCGCTGTC CGCGGTCATT
GAAAGTGAAG GGCCTGAACG CGCCCACTTC CTGCTTGAGC AACTGCTCGA ACATGCGCGC
CAGAAGAGCA TCGACATGCC CTTTTCGGCC AACACGGGTT ATGTCAACTC GATTGAAACC
GACCAGGAAG AGCGCTCACC CGGCAACCTG CTGATCGAGC AGCGGCTGCG CGCCTACATG
CGCTGGAACG CCATGATCAT GGTGGTCAAG GCCAACCGCC TGCACCCGGC CGATGGCGGC
GATTTGGGCG GGCACATCGG CTCGTTTGCC TCGCTGGCCA GCCTGTTTGG CGCCGGCTTC
AACCATTTCT GGCACGCTGA GAGCGAAAAC CACGGCGGCG ACTGCCTCTA CATCCAGGGC
CATGTGTCGC CCGGCGTGTA TGCCCGTGCC TACCTCGAAG GCCGCCTGAC GGAAGAGCAG
CTGCTCAACT TCCGCCAGGA AGTCGATGGC AAGGGCTTGT CGAGCTACCC GCATCCCAAG
CTGATGCCCA ATTTCTGGCA GTTCCCCACC GTCTCCATGG GCCTTGGCCC GCTGATGGCG
ATTTACCAGG CCCGTTTCCT GAAATACCTG CATGCGCGCG GCATTGCCAA CACCGAAAAC
CGCAAGGTCT GGGTGTTCTG CGGCGACGGC GAGATGGACG AGGTCGAATC GCTGGGCGCC
ATCGGCCTGG CGGCGCGTGA AAACCTCGAC AACCTGGTGT TCGTCATCAA CTGCAACCTG
CAGCGCCTCG ACGGCCCGGT GCGCGGCAAC GGCAAGATCA TCCAGGAACT CGAAGGCGAA
TTCCGCGGCG CCGGCTGGAA CGTCATCAAG CTGATCTGGG GCAGCAGCTG GGATCCGCTG
CTGGCGCGCG ACAAGGACGG CGCGCTGCGC AAGATCATGA TGGAGTGCAA CGACGGCGAT
TACCAGTCGT TCAAGGCCAA CGACGGCGCC TATGTGCGCA AGCATTTCTT CGGCCGCGAC
CCGCGCACGC TGGAAATGGT CGCCAACATG AGCGATGACG ACATCTGGAA GCTCACCCGT
GGCGGCCACG ATTCGCAAAA GGTCTATGCC GCCTTCCATT CGGCCGTCAA CCACACCGGC
CAGCCGAGCG TGCTCCTGAT CAAGACCGTC AAGGGTTTTG GCATGGGCAA GATCGGGGAG
GGCAAGAACA ATGTCCACCA GACCAAGAAG CTCGGCGACG AAGACATCAA GGCCTTCCGC
GACCGCTTCA ACATCCCCAT TCCCGACAGC CAGCTGGCCG AACTGCCGTT CTACAAGCCG
GCCGACGACA CGCCTGAAAT GCAGTACCTG CACGAGCGCC GCAAGGCGCT GGGCGGCTAC
CTGCCGCACC GCCGCACCAA GGCGGACGAG AGCTTCACCG TGCCGGCCCT GGAAACTTTC
AAGGCGGTGA TGGACCCCAC GCCAGAAGGC CGCGAAATCT CGACCACCCA GGCCTATGTG
CGCTTCTTGA CGCAACTGCT GCGCGACCAG GCGCTCGGTC CGCGCGTCGT GCCCATCCTG
GTCGATGAAG CCCGCACTTT CGGCATGGAA GGCCTGTTCC GCCAGATCGG CATCTACAAC
CCTGCCGGCC AGCAATACAC CCCGGTCGAT AAAGACCAGG TGATGTATTA CAAGGAAGAC
AAGAAGGGCC AGATCCTGCA GGAAGGCATC AACGAAGCCG GCGGCATGAG CAGCTGGATT
GCGGCGGCCA CTTCGTACAG CACCAACAAC CGCATCATGG TGCCGTTCTA TGTGTATTAC
TCGATGTTCG GCTTCCAGCG CATCGGCGAC CTGGCCTGGG CGGCGGGCGA CATGCAGGCG
CGCGGCTTTC TGCTCGGCGG CACCTCGGGG CGCACCACGC TCAACGGCGA AGGCCTGCAG
CACGAAGACG GCCACAGCCA CATCCTGGCC GGCACGATTC CGAACTGCAT CAGCTACGAC
CCGACCTTTG CGCATGAAGT CGGCGTGATC CTGCACCACG GCTTGAAGCG CATGGTTGAA
AAGCAGGAAA ACGTGTATTT CTACATCACG CTGCTGAACG AGAACTACGC CATGCCCGGC
CTCAAAGCCG GCACCGAAGA GCAGATCATC AAAGGCATGT ACCTGTGCAA CGAAGGCCCC
AAGCTGGCGC CCACCGTGCA ATTGCTGGGC TCGGGCACCA TCCTGCGCGA GTCGATTGCC
GCGCAGGAGC TGTTAGAGAA AGAGTGGGGT GTGTCGGCCA ACGTGTGGAG TTGCCCAAGC
TTCAACGAAC TGGCGCGCGA CGGCCAGAGC GCCGAGCGCT GGAACCTGCT GCATCCGCTG
GAGACGCCGC GCGTTTCCTT TGTAGCCGAA CAACTTGAAG CGTTTGCCGG TCCGGTGGTC
GCCTCGACCG ACTACATGAA GGCCTATGCC GAACAGATCC GCTCCTATAT TCCCAAGGGC
CGTACCTACA AGGTGCTGGG CACCGACGGT TTTGGCCGCA GCGATTTCCG CAGCAAGCTG
CGCGAGCACT TCGAGATCAA CCGCCACTAC ATCGTCGTGG CGGCGCTCAA GGCGCTCAGC
GAAGAGGGCA CGGTGCCGGT CGCCAAGGTG GCCGAAGCCA TCCAGAAGTA CGGCATCAAC
GCCGACAAGA TCAATCCGCT TTACGCTTGA
 
Protein sequence
MAANPDENQQ AGIPNAGSSN TQDNDTQETR EWMDALSAVI ESEGPERAHF LLEQLLEHAR 
QKSIDMPFSA NTGYVNSIET DQEERSPGNL LIEQRLRAYM RWNAMIMVVK ANRLHPADGG
DLGGHIGSFA SLASLFGAGF NHFWHAESEN HGGDCLYIQG HVSPGVYARA YLEGRLTEEQ
LLNFRQEVDG KGLSSYPHPK LMPNFWQFPT VSMGLGPLMA IYQARFLKYL HARGIANTEN
RKVWVFCGDG EMDEVESLGA IGLAARENLD NLVFVINCNL QRLDGPVRGN GKIIQELEGE
FRGAGWNVIK LIWGSSWDPL LARDKDGALR KIMMECNDGD YQSFKANDGA YVRKHFFGRD
PRTLEMVANM SDDDIWKLTR GGHDSQKVYA AFHSAVNHTG QPSVLLIKTV KGFGMGKIGE
GKNNVHQTKK LGDEDIKAFR DRFNIPIPDS QLAELPFYKP ADDTPEMQYL HERRKALGGY
LPHRRTKADE SFTVPALETF KAVMDPTPEG REISTTQAYV RFLTQLLRDQ ALGPRVVPIL
VDEARTFGME GLFRQIGIYN PAGQQYTPVD KDQVMYYKED KKGQILQEGI NEAGGMSSWI
AAATSYSTNN RIMVPFYVYY SMFGFQRIGD LAWAAGDMQA RGFLLGGTSG RTTLNGEGLQ
HEDGHSHILA GTIPNCISYD PTFAHEVGVI LHHGLKRMVE KQENVYFYIT LLNENYAMPG
LKAGTEEQII KGMYLCNEGP KLAPTVQLLG SGTILRESIA AQELLEKEWG VSANVWSCPS
FNELARDGQS AERWNLLHPL ETPRVSFVAE QLEAFAGPVV ASTDYMKAYA EQIRSYIPKG
RTYKVLGTDG FGRSDFRSKL REHFEINRHY IVVAALKALS EEGTVPVAKV AEAIQKYGIN
ADKINPLYA