Gene Pnap_2420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_2420 
Symbol 
ID4687537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp2555781 
End bp2556803 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content64% 
IMG OID639835430 
Productaliphatic sulfonate ABC transporter periplasmic ligand-binding protein 
Protein accessionYP_982648 
Protein GI121605319 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.311787 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCAT CCATCACCGC CCCGCCTGAC CTGGCGCCGC CGCCGCCGCC GATTCGGCAA 
AGCCTGCGCG ACCTGCTCGC CAGTGTCCTG CTGGTCGCTA CCCTGGCACT GGCCACCAGC
CTTTTGGCGC TGCCGAAAGC GCACGCCCAG GACAATGTCG AACTGCGCAT CGGCTACCAG
AAATCGGCCA GCCTGTTTGT GCTGCAAAAA GCCCAGGGCA CGCTGGAAAA GCGTCTGGCG
CCGCTGCATG CCTCCGTCAA ATGGGTGGAG TTTCCGGCCG GCCCGCAACT GCTTGAAGGC
CTGAACCTGG GCTCTGTCGA TGTCGGCTAC GTGGGCGAGG CGCCGCCGAT TTTCGCGCAG
GCCGCCGGCG CCAGGTTCGC CTATATCGGC TACGACCCGG CCGCGCCTGA AGCCGAAGCC
CTCCTGGTGC CCAAAACTTC CGCCATCAAG TCCGTGGCTG AGCTGAAAGG CAAAAAAGTC
GCTTTGAACA AGGGCAGCAA CGTGCATTAC CTGCTGGTCA AACTGCTGGA GAAAAACGGC
CTCAAGCTCA GCGATGTTCA GCCGATCTAC CTGGCGCCTG CCGACGCGCG GGCGGCCTTT
GAAAGCGGCA GCGTCGATGC CTGGGTGATC TGGGATCCGT TTGCCGCAGC CGCTGAAAAA
GCCATAGGCG CCCGGGTGCT GGCCAACGGC AAGGGCGTGG TCAACAACTA TGCCTACTAC
CTGGCCGAAC GCAATTTCGC CGCCAAAAAC CCCAAGGTCA TCCAGGCGCT GTTTGACGAC
TCGGTGGAGC GGGGCGCCTG GCTCAAGGCC AATGTGCGCA AGGCGGCCGA ACAGATCGCG
CCGCTGCAGG GCCTGCCGGT CGAGGTCGTC GAACTCAGCC TGCACCGCTA TGAATTCAAG
GTCAAGCCGG TGCCCGACAG CGTCATTGCC GACCAGCAAA AGCTCGCCGA CACCTTCTTT
GACCTCAAGC TGATCCCCAA AGCCATCGCG GTGCGCGACG CCGCCTACCA GGCCGCGCCC
TGA
 
Protein sequence
MTSSITAPPD LAPPPPPIRQ SLRDLLASVL LVATLALATS LLALPKAHAQ DNVELRIGYQ 
KSASLFVLQK AQGTLEKRLA PLHASVKWVE FPAGPQLLEG LNLGSVDVGY VGEAPPIFAQ
AAGARFAYIG YDPAAPEAEA LLVPKTSAIK SVAELKGKKV ALNKGSNVHY LLVKLLEKNG
LKLSDVQPIY LAPADARAAF ESGSVDAWVI WDPFAAAAEK AIGARVLANG KGVVNNYAYY
LAERNFAAKN PKVIQALFDD SVERGAWLKA NVRKAAEQIA PLQGLPVEVV ELSLHRYEFK
VKPVPDSVIA DQQKLADTFF DLKLIPKAIA VRDAAYQAAP