Gene Pnap_1790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_1790 
Symbol 
ID4688486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp1904723 
End bp1905754 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content66% 
IMG OID639834796 
Producthypothetical protein 
Protein accessionYP_982021 
Protein GI121604692 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0502683 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCAC CTCCCGATTC ACCGCCCCCC TTGCTGCCGC CCATTGGTCA AGGTCAGGCG 
CAGCCTGCGT CCGCGCCCGT GCCCGGCAGT GCCGCGCCCG AGAACATCGC CAACCTGGAA
CTCAAGGCCA GGCTGCTGCT GCTGTTCATG CTGCTGCTGG TGTGCGGCTC GGCCCTGTAT
GTGCTGTATG CGCGCGGCGC CTTTGAATCG ACCCAGCGCC TGGTGCTGAT GGCCGAGGAT
TCCGAAGGCG TGGTGGTCGG CATGGACATC ACGTTTGCCG GATTCCCCAT CGGCCGGGTG
CGGCGCATCG AACTGGCCGA AGACGGCAAG GCGCGCATCC TGGTCGATGT GGCCGAGAAG
GACGCCCACT GGCTGCGCAC CAGCAGCGTT TTCACGCTGG TCAAGGGCAT CGTGGGCGGC
CCCAACATCC GCGCCTATAC CGGGCTGCTC AATGATCCGC CGCTGCCCGA CGGTGCGGAA
CGGTCCGTGC TGCAGGGCGA CGCCAGCGCG GAAATCCCCA AGGTCATTTC CGCCGCCAAG
GACCTGATCG ACAACCTGAA CAGCCTGACG GGCAGCAGCG GCTCGGTCGG CACCAGCCTG
GCCAATCTGC AGGTGCTGAC CGGCCGGCTC AACGGACCCG GCGGCGCGCT GACCGTGTTG
CTGGGCAGCG AAGCAGAAGC CAAAAAGTTT TCCGCCACGC TGGACCGCGC CAATGCCTTG
CTGGCCAAGC TCGACGGCAT GGCCGCCAAG ACCGACACCC AGGTGTTTGG CGACAAGGGC
GTGCTGCCCG AAACACGCGC CACCGTCGTG CAGCTCAATG CCATGCTGGG CGAGGCGCGC
ACCAGCCTGA AGAAGGTCGA TGCCATTTTG GTGGATGCGC AGGCCGTCAC CTCGAATGCC
AGGGACGCTA CGTCCGACCT GGGCGCCTTG CGCGGCGACG TGGAAGCCAG CCTGCGAAAA
GTCGAAGGCC TGGTCAGCGA GATCAACCGC AAATGGCCGT TTGCCCGCGA TGCGGAGATG
AAACTGCCGT GA
 
Protein sequence
MSSPPDSPPP LLPPIGQGQA QPASAPVPGS AAPENIANLE LKARLLLLFM LLLVCGSALY 
VLYARGAFES TQRLVLMAED SEGVVVGMDI TFAGFPIGRV RRIELAEDGK ARILVDVAEK
DAHWLRTSSV FTLVKGIVGG PNIRAYTGLL NDPPLPDGAE RSVLQGDASA EIPKVISAAK
DLIDNLNSLT GSSGSVGTSL ANLQVLTGRL NGPGGALTVL LGSEAEAKKF SATLDRANAL
LAKLDGMAAK TDTQVFGDKG VLPETRATVV QLNAMLGEAR TSLKKVDAIL VDAQAVTSNA
RDATSDLGAL RGDVEASLRK VEGLVSEINR KWPFARDAEM KLP