Gene Pnap_2666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_2666 
Symbol 
ID4689109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp2807062 
End bp2808801 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content63% 
IMG OID639835674 
Productdihydroxy-acid dehydratase 
Protein accessionYP_982889 
Protein GI121605560 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.37627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCCTA CCCCAAAACG TCCGCTGCGA TCCCAAGAAT GGTTTGGCTC GGCAGACAAG 
AACGGCTTCA TGTACCGCAG CTGGATGAAG AACCAGGGCA TCCCGGACCA TGAGTTCCAG
GGCAAGCCCA TCATTGGCAT CTGCAACACC TGGTCGGAGC TGACCCCGTG CAATGCCCAT
TTCCGCAAAA TCGCCGAGCA TGTCAGGCGT GGCGTGTTCG AGGCCGGCGG CTTTCCGGTC
GAGTTCCCGG TGTTTTCCAA TGGCGAGTCC AATTTGCGGC CGACGGCCAT GTTCACCCGC
AACCTGGCCA GCATTGATGT GGAAGAGTCC ATCCGGGGCA ATCCGATTGA CGCCGTGGTA
CTGCTGGCCG GCTGCGATAA AACCACCCCC GCCTTGCTGA TGGGCGCGGC CAGTTGCGAC
ATTCCCGCCA TCGTGGTCAG TGGCGGGCCG ATGCTCAACG GCAAGCACCA GGGCAAGGAT
ATTGGATCGG GCACGGTCGT CTGGCAGCTG CATGAAGCGG TCAAGGCCGG GCACATCAGC
ATGCATGAAT TCATGTCCGC CGAAGCCGGC ATGTCGCGCT CGGCCGGCAC CTGCAACACC
ATGGGAACCG CCTCCACCAT GGCTTGCATG GCCGAGGCGC TGGGCACCTC GTTGCCGCAT
AACGCCGCCA TTCCGGCGGT TGACGCGCGG CGCTATGTGT TGGCCCATAT GTCCGGCATG
CGCATCGTCG AAATGGCCCT GGAAGACCTT CGCCTGTCGA AAATCCTGAC GCGGGAAGCC
TTCGAGAACG CCATCCGGGT CAATGCCGCC ATTGGCGGAT CGACCAATGC GGTGATCCAT
CTGAAGGCGA TTGCCGGACG GCTGGGAATT CCGCTGGAAC TGGAGGACTG GACGAGCATC
GGCCGCGGCA CGCCGACCAT CGTGGACCTG ATGCCCTCGG GCCGCTTTTT GATGGAAGAC
CTGTACTACG CCGGCGGCCT GCCTGCCGTG CTGCGCCGCA TGGGCGAAGC CGATCTGCTG
CGGCATCCGG GCGCGCTGAC CGTCAACGGA AAGTCACTCT GGGAAAACGT GCGCGAAGCA
CCGATCTACA ACGACGAGGT GATTCGTCCG CTGGCCAAGC CGCTGATCGA GGACGGCGCC
ATCTGCATCC TGCGCGGCAA CCTGGCGCCG CGCGGCGCGG TCCTGAAACC GTCGGCGGCC
TCGCCGCACC TGATGCAGCA CCGTGGCCAG GCGGTGGTTT TCGAAGACTT CGACCATTAC
AAATCACGCA TCAACGATCC GGACCTTGAG GTTGACGCGA ACTCGGTGCT GCTGATGAAA
AACTGCGGCC CCAAAGGCTA CCCCGGCATG GCGGAGGTCG GCAACATGGG CTTGCCGCCC
AAGCTGCTGG CGCAAGGCGT GACCGACATG GTCCGCATTT CGGATGCCCG CATGAGCGGC
ACCGCCTACG GCACCGTGGT GCTGCATGTG GCGCCCGAAG CCGCAGCCGG CGGACCGCTG
GCCATTGTGC GCGATGGCGA CTGGATTGAA CTCGATTGCG CAGGGGGGCG CCTGCATCTG
GAAATCGATG AGGCTGAAAT GGCGGCCAGG TTTGAAAACC TGCAAGCCAG GAAGCCGCCA
GAGCGCACGG GCGGCTACCG GGAGCTGTAC ATCGACCATG TCCTGCAGGC CGACCAGGGC
TGCGACTTCG ACTTTCTGCT CGGATGCCGG GGTGCGGAGG TTCCGCGTCA TTCCCATTAA
 
Protein sequence
MSPTPKRPLR SQEWFGSADK NGFMYRSWMK NQGIPDHEFQ GKPIIGICNT WSELTPCNAH 
FRKIAEHVRR GVFEAGGFPV EFPVFSNGES NLRPTAMFTR NLASIDVEES IRGNPIDAVV
LLAGCDKTTP ALLMGAASCD IPAIVVSGGP MLNGKHQGKD IGSGTVVWQL HEAVKAGHIS
MHEFMSAEAG MSRSAGTCNT MGTASTMACM AEALGTSLPH NAAIPAVDAR RYVLAHMSGM
RIVEMALEDL RLSKILTREA FENAIRVNAA IGGSTNAVIH LKAIAGRLGI PLELEDWTSI
GRGTPTIVDL MPSGRFLMED LYYAGGLPAV LRRMGEADLL RHPGALTVNG KSLWENVREA
PIYNDEVIRP LAKPLIEDGA ICILRGNLAP RGAVLKPSAA SPHLMQHRGQ AVVFEDFDHY
KSRINDPDLE VDANSVLLMK NCGPKGYPGM AEVGNMGLPP KLLAQGVTDM VRISDARMSG
TAYGTVVLHV APEAAAGGPL AIVRDGDWIE LDCAGGRLHL EIDEAEMAAR FENLQARKPP
ERTGGYRELY IDHVLQADQG CDFDFLLGCR GAEVPRHSH