Gene Pnap_3688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3688 
Symbol 
ID4686175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp3926429 
End bp3927475 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content65% 
IMG OID639836706 
Productaldo/keto reductase 
Protein accessionYP_983905 
Protein GI121606576 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTACC AGCCCAACGC AGCCCGCTAT GACACCATGC CCTACCGAAG CTGCGGACGC 
AGCGGCTTGA TGCTGCCCGC CATCACCCTG GGGCTGTGGC ATAACTTTGG CGACGCCACG
CCCATGGAAA CGCAGCGCGC CATGCTGCGT ACCGCGTTCG ACCTGGGCAT CACGCACTTT
GACCTGGCCA ACAACTACGG CCCGCCCGGC GGCAGTGCGG AAATCAATTT TGGCGAGCAT
CTGCGGCGCG ACTTCAAGCC CTATCGCGAC GAACTCATCA TTTCCAGCAA GGCGGGCTGG
GACATGTGGC CCGGCCCTTA TGGGCAGGGC GGCGGCTCGC GCAAGCATGT GCTGGCCAGC
CTGGACCAGA GCCTCAAGCG CATGGGGCTT GACTATGTCG ATATTTTTTA TTCGCACCGC
TTTGACCCCG ACACCCCGCT GGAGGAAACC ATGGGGGCGC TGGCCACGGC GGTTCAGCAG
GGCAAGGCCT TGTACGTCGG CCTCAGCAGC TACTCGGCGG CCAAGACGAG CGAAGCCGCG
GCCATTTTGC GTGCCATGGG TGTGGCGCCG TTGATTCACC AGCCCTCTTA CAGCCTGCTG
AACCGCTGGA TTGAGGGCGA GCTGCTCGAC ACCCTGGCTG AAACCGGCAT GGGCTGCATT
GCGTTCAGCG CGCTGGCGCA GGGGCTGCTG ACCGACAAGT ACCTGAACGG CATTCCGGCG
GACGCCCGAA TCAACCGCCC CGGCGGCAGT TCCCTCAAGG CCGAACATCT GAGCGAACAA
AACCTCAAGC ATGCGCGTGC CCTGAATGAG CTGGCGCTGG CGCGCGGACA GAGCCTGGCC
CAGATGGCGA CGGCCTGGGT GCTGCGCGAT GGCCGCGTGA CCTCGGCGTT GATTGGCGCC
AGCCGCCCGG CGCAAATCGC GGAACTGGTC GGCGCGCTGC GCAAGCTTGA GTTTTCCGCC
GAAGAACTGG CGGCCATTGA CCAGCACGCG GTGGACGGCG GCATCAACCT GTGGCAACGC
CCCTCGACCG ATCAGCGCCC GGCTTGA
 
Protein sequence
MNYQPNAARY DTMPYRSCGR SGLMLPAITL GLWHNFGDAT PMETQRAMLR TAFDLGITHF 
DLANNYGPPG GSAEINFGEH LRRDFKPYRD ELIISSKAGW DMWPGPYGQG GGSRKHVLAS
LDQSLKRMGL DYVDIFYSHR FDPDTPLEET MGALATAVQQ GKALYVGLSS YSAAKTSEAA
AILRAMGVAP LIHQPSYSLL NRWIEGELLD TLAETGMGCI AFSALAQGLL TDKYLNGIPA
DARINRPGGS SLKAEHLSEQ NLKHARALNE LALARGQSLA QMATAWVLRD GRVTSALIGA
SRPAQIAELV GALRKLEFSA EELAAIDQHA VDGGINLWQR PSTDQRPA