Gene Pnap_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_0041 
Symbol 
ID4689989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp41693 
End bp42709 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content62% 
IMG OID639833035 
Productaliphatic sulfonate ABC transporter periplasmic ligand-binding protein 
Protein accessionYP_980288 
Protein GI121602959 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.487407 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCTC ATTTCTTCTC GCTTTTCACG ATTTCTCATG CGTTCAAGCG CAGTGTCGCC 
GGCGCCTGCC TGGTGCTGGC AGGCGTGGGT GCGGCCAGCG TGCCGATGCC TGCAGCGGCC
GAAGTCAAGG TCGGCGTGTC CGACTGGCCA GGCTGGGTGG CCTGGTACGT GGCCGAGCAA
AAAGGCTTTT TCAAGAAGAA CGGCGCCGAC GTCAAGCTCG TCTGGTTTGC CAACTACACC
GATTCCATCG GCGCGCTGTC CTCGGGCCAG CTCGACGCCA ACTCCCAGAC CTGGTCCGAC
ACGCTCGGCC CCCTGGCCAA GGGCCTGCCG CTCAAGGCGA TTCTGGTCAA CGACAACTCG
GCCGGCAACG ACGCGCTGAT GGTCGGCCCG AAGATCACCT CCTTCGCCCA GCTCAAGGGC
AAGAAAGTGG CGCTGGAGCA ATTCAGCATT TCGCACTTCG TGCTGGCCAC GGCGCTGGCC
AAGAACGGCA TGAAGCTCGA TGACGTGAAG ATCGTCAACC TGTCCGCCGG CGACGCCGCC
GCCGCCTTCA TCAGCGGCAA GGTCGATGCC GCCGTGCTGT GGAACCCCTG GGTGAACCAG
ATCGAAAAAA GCGGCAAGGG CAAGGCCTTG TTCACCTCCA GGGACATGCC CGGCCTGGTG
CCCGACTTGC TGGTGGCCCA GGACAAGGCC ATCCAGACCA AGCGCAAGGA GCTGGTCGGC
ATGATCAAGG CCTGGTTCGA GACCGAAAAG TTCATCCGCG AGCAACCCGC CGAAGCCGCC
AAAATCATGT CCAAGGTGGT CAGCATGTCG CCCGAGGAAT ACACCGTGTT CCTGCCCGGC
ACCAGGTTCT TCGACGCCGC CGCCAACACC CGTGCTTTTG ACGCCAAACA GGCGCTGTCG
CTGTCCAGCA CCGCGCCCAC CATCGCTGCC TTTTTGACCC AGTACAAGCT GATCGAAGGC
AAGCCTGATG CCGCCAAGGG CATTGACGGC ACGCTGCTGC AAGACGCGTT GAAGTAA
 
Protein sequence
MSSHFFSLFT ISHAFKRSVA GACLVLAGVG AASVPMPAAA EVKVGVSDWP GWVAWYVAEQ 
KGFFKKNGAD VKLVWFANYT DSIGALSSGQ LDANSQTWSD TLGPLAKGLP LKAILVNDNS
AGNDALMVGP KITSFAQLKG KKVALEQFSI SHFVLATALA KNGMKLDDVK IVNLSAGDAA
AAFISGKVDA AVLWNPWVNQ IEKSGKGKAL FTSRDMPGLV PDLLVAQDKA IQTKRKELVG
MIKAWFETEK FIREQPAEAA KIMSKVVSMS PEEYTVFLPG TRFFDAAANT RAFDAKQALS
LSSTAPTIAA FLTQYKLIEG KPDAAKGIDG TLLQDALK