Gene Pnap_3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3041 
Symbol 
ID4688483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp3212099 
End bp3213169 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content64% 
IMG OID639836054 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_983261 
Protein GI121605932 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00169] 3-isopropylmalate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCG CAATTCTCCC GGGTGATGGC ATCGGCCCTG AAATCGTCGC CGAAGCCGTC 
AAGGTTCTGA AGGTTCTGGA CCTGAATTTT GAAACCGAAA CCGCACTGGT CGGCGGCGCC
GCCTTTGAAG CCTACGGCCA CCCGCTGCCC GAAGCCACGC TCCAGCTCGC CAAGGATGCC
GACGCCGTGC TGTTTGGCGC CGTGGGCGAC TGGAAATACG ACACGCTGGA CCGGCCGCTG
CGCCCCGAGC AAGCCATCCT GGGCCTGCGC AAGAACCTCG GCCTGTTCGC CAACTTCCGC
CCGGCCATCT GCTACCCGCA ACTGGTCGAT GCGTCGAGCC TGAAGCGCGA ACTGGTGTCG
GGCCTGGACA TCCTCATCAT CCGCGAACTG ACCGGCGACA TCTACTTCGG CCAGCCGCGC
GGCCGGCGCA TTGCCACGGA CGGCCACTTT CCGGGCGCCG AAGAAGCTTT CGACACGATG
CGCTATTCAC GCCCCGAGAT CGAGCGCATC GCCCATGTCG CCTTCCAGGC GGCCCGCAAG
CGCGGCAAGC GTGTCACCAG CGTGGACAAG GCCAATGTGC TGGAGACCTT TCAGCTCTGG
AAGGACGTGG TGACTGAAAT CGGGCTGCAG TACCCCGACG TGGCGCTGGA CCACATGTAC
GTGGACAACG CCGCCATGCA ACTGGTCAGG GCGCCGAAGA AATTTGACGT GGTGGTGACC
GGCAACATGT TCGGCGACAT CCTGTCGGAC GCCGCCGCAA TGCTCACCGG CTCCATCGGC
ATGCTGCCAT CGGCCAGCCT GAACGACAAG AACCAGGGCC TGTACGAGCC CAGCCACGGC
AGCGCGCCCG ACATTGCCGG CAAGGGCGTA GCCAATCCGC TGGCGACCAT TTTGAGCGCG
GCCATGATGC TGCGCTTCAG CCTGAACCAG CCTGAAGCCG CCGCGCGCAT CGAGCAAGCC
GTGGACAAGG TATTGAGCCA GGGCCTGCGC ACGCCCGATA TCTACAGCGA CGGCACGACC
CGGATCGGCA CGGTGGAAAT GGGCGATGCG GTGGTCAAGG CGCTGGGCTG A
 
Protein sequence
MKIAILPGDG IGPEIVAEAV KVLKVLDLNF ETETALVGGA AFEAYGHPLP EATLQLAKDA 
DAVLFGAVGD WKYDTLDRPL RPEQAILGLR KNLGLFANFR PAICYPQLVD ASSLKRELVS
GLDILIIREL TGDIYFGQPR GRRIATDGHF PGAEEAFDTM RYSRPEIERI AHVAFQAARK
RGKRVTSVDK ANVLETFQLW KDVVTEIGLQ YPDVALDHMY VDNAAMQLVR APKKFDVVVT
GNMFGDILSD AAAMLTGSIG MLPSASLNDK NQGLYEPSHG SAPDIAGKGV ANPLATILSA
AMMLRFSLNQ PEAAARIEQA VDKVLSQGLR TPDIYSDGTT RIGTVEMGDA VVKALG