Gene Pnap_3039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3039 
Symbol 
ID4688815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp3209909 
End bp3211360 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content64% 
IMG OID639836052 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_983259 
Protein GI121605930 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACGCA CCCTCTACGA CAAAATCTGG GACGAGCACG TCGTTCATAC CGAAGAAGAC 
GGCACCTCGA TCCTGTACAT CGACCGCCAT CTGGTCCATG AAGTCACCAG CCCGCAGGCC
TTTGAAGGCC TGCGCGAAGC CGGCCGCAAG GTCTGGCGCA TCAGTTCCAT CGTCGCGACC
GCCGACCACA ACACGCCCAC GACCGGCTGG GAACTGGGTT ACGACGGCAT CACCGACCTG
GTCAGCAAGG AGCAGATCAC CACGCTGGAC GCCAACATCA AGGAATTCGG CGCCGCTGCC
TTCTTCCCCT TCATGTCCAA ACGCCAGGGC ATCGTGCATG TGATCGGCCC TGAAAACGGC
GCCACGCTGC CCGGCATGAC CGTCGTTTGC GGCGACTCGC ACACCTCGAC CCACGGCGCG
TTTGGCGCGT TGGCGCACGG CATCGGCACC AGCGAGGTCG AACACGTCAT GGCCACGCAA
ACCCTGCTGG CCAAAAAAGC CAAAAACATG CTCATCAAGG TCGAAGGCGC CGTGACCAAA
GGCGTGACCG CCAAGGACAT CGTGCTGGCC ATCATCGGCA AGATTGGCAC GGCGGGCGGC
ACCGGCTACA CGATTGAATT TGCCGGCTCG GCGATTCGCG CACTCAGCAT GGAAGGCCGC
ATGACGGTCT GCAACATGGC GATTGAAGGC GGCGCGCGCG CCGGCCTGGT GGCCGTCGAT
GCCAAGACCA TCGAGTACCT CAAGGGCCGC CTGCTGGCTC CCGGAACCGA CTCCGTCACC
GGCAAGTTCG TCGGCGGCCC CGAATGGGAC ATGGCCGCGC GCTACTGGGC CACGCTGCAC
TCCGACGCCG ACGCCACCTT TGACGCCGTG GTCGAGCTGG ACGCCAGCCA GATCCTGCCG
CAGGTCAGCT GGGGCACTTC GCCCGAGATG GTGCTGTCGA TTGAAGACCG CGTACCCGAC
CCGGAAAAGG AAAAGGACGC CAACAAGCGC GGCGCCATCG AACGCGCCCT GACCTACATG
GGCCTGGAGC CGGGCAAGGC GCTGAATGAC CTGTACATCG ACAAGGTGTT CATCGGTTCG
TGCACCAACA GCCGCATCGA AGACATGCGC GAAGCGGCTG CCGTGGTGAA GCATATCGGC
CAGAAGGTCG CCAAAAATGT CAAGCTGGCA ATGGTCGTGC CGGGCTCTGG CCTGGTCAAG
GAGCAGGCCG AGCGCGAAGG CCTGGACAAG ATCTTCATCG CCGCCGGCTT TGAATGGCGC
GAGCCCGGCT GCTCGATGTG CCTGGCGATG AATGCCGACC GGCTGGAGCC GGGCGAGCGC
TGCGCCTCCA CCAGCAACCG CAACTTTGAA GGCCGCCAGG GTGCGGGCGG GCGCACCCAC
CTGGTCAGCC CGGCCATGGC GGCGGCGGCG GCGGTGCATG GCCACTTTGT CGATATCCGT
AAATTTGTCT GA
 
Protein sequence
MGRTLYDKIW DEHVVHTEED GTSILYIDRH LVHEVTSPQA FEGLREAGRK VWRISSIVAT 
ADHNTPTTGW ELGYDGITDL VSKEQITTLD ANIKEFGAAA FFPFMSKRQG IVHVIGPENG
ATLPGMTVVC GDSHTSTHGA FGALAHGIGT SEVEHVMATQ TLLAKKAKNM LIKVEGAVTK
GVTAKDIVLA IIGKIGTAGG TGYTIEFAGS AIRALSMEGR MTVCNMAIEG GARAGLVAVD
AKTIEYLKGR LLAPGTDSVT GKFVGGPEWD MAARYWATLH SDADATFDAV VELDASQILP
QVSWGTSPEM VLSIEDRVPD PEKEKDANKR GAIERALTYM GLEPGKALND LYIDKVFIGS
CTNSRIEDMR EAAAVVKHIG QKVAKNVKLA MVVPGSGLVK EQAEREGLDK IFIAAGFEWR
EPGCSMCLAM NADRLEPGER CASTSNRNFE GRQGAGGRTH LVSPAMAAAA AVHGHFVDIR
KFV