Gene Pnap_2107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_2107 
Symbol 
ID4688716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp2246034 
End bp2247215 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content61% 
IMG OID639835118 
Productbenzoate transporter 
Protein accessionYP_982337 
Protein GI121605008 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3135] Uncharacterized protein involved in benzoate metabolism 
TIGRFAM ID[TIGR00843] benzoate transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.07884 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.334527 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG ACTTTTCCCT GTCAGCCGTC ACGGCTGGAT TCCTGGCCGT GCTGATTTCC 
TATGCCGGTC CGCTGGTGAT ATTTTTCCAG GCCGGCCAAA GTGCCCATGT CTCCGCGGAA
ATGATGTCGT CCTGGGTCTG GGCCATTTCA ATCGGGGCAG GGGTGTCGGG CATCTTGCTG
AGCTGGTGGC TCAAGGTGCC GGTCGTGACG GCCTGGTCCG CACCGGGCAC GGCCTTGCTG
GTGACCTTGT TCCCCGCCAT CACGCTCGGG CAGGCCGTCG GGGCTTACCT GGTGTCGGCG
GTCATTATTT TCATCATTGG CGTGTCCGGT TATTTTGACA AGCTGGTGCA GGCCATCCCC
AAAGGCATTG CCAGCGCCAT GATGGCGGGC ATTTTGTTCC AGTTTGGCGT CGGCGCGTTT
CAAGCGGTCA CAGCGATGCC GCTCATCACC TTTTGCATGA TGGGCACTTA TTTGCTCTTC
AGGCGCTTGC TGCCGCGCTA TTGCCTGGTG ATTTTGCTGG TGATCAGTCT GGTGCTGGCG
GTGGCCCTGG AGGGCGTGAG CCTGGCCGGC GTGACATTCA CTCTGGCCAG CCCGGTGTTC
ATCACACCCG AGTGGACCTG GGGCGCAACC CTGAGCCTGG CGCTGCCGCT GGTGCTGGTC
AGCGTGACGG GGCAGTTCTT GCCGGGCATG GCCATCTTGC GCAGTTCGGG CTACAGCACG
CCGGCCCGTC CCATCATCAT CACGACCAGC CTGGCCTCGC TGGGCGTGGC ATTTTTCGGC
GGCATCACGA TTGTGATTGC GGCCATCACG GCGGCGCTTT GCACTGGCAA GGACGCGCAT
GAAGACGCCA CAAAACGCTA TGTGGCAGGC ATTGCCAACG GCGTGTTTTA CCTGGTGGGC
GGCTGCTTTG CCGGCACCAT CATCTTGTTT TTTGCAGCCT TGCCCAAAGC GCTGATCGCG
GTGCTGGCCG GACTGGCCCT GGTGGGCGCG ATTGGGGGTA GTCTGGCCGG TGCAATGAAC
GAAGCCGATC ACCGGGAGGC CTCGATCATC ACCTTCTTGG CCACAGCGTC GGGCATGACG
TTCTGGGGCC TGGGGTCGGC GTTCTGGGGA GTGGTCATTG GCGCGCTAGC CTATTTGCTG
CTGCATAAGC AATGGTTTCT TCCCGCGAAG GCGAGGCTTT GA
 
Protein sequence
MKKDFSLSAV TAGFLAVLIS YAGPLVIFFQ AGQSAHVSAE MMSSWVWAIS IGAGVSGILL 
SWWLKVPVVT AWSAPGTALL VTLFPAITLG QAVGAYLVSA VIIFIIGVSG YFDKLVQAIP
KGIASAMMAG ILFQFGVGAF QAVTAMPLIT FCMMGTYLLF RRLLPRYCLV ILLVISLVLA
VALEGVSLAG VTFTLASPVF ITPEWTWGAT LSLALPLVLV SVTGQFLPGM AILRSSGYST
PARPIIITTS LASLGVAFFG GITIVIAAIT AALCTGKDAH EDATKRYVAG IANGVFYLVG
GCFAGTIILF FAALPKALIA VLAGLALVGA IGGSLAGAMN EADHREASII TFLATASGMT
FWGLGSAFWG VVIGALAYLL LHKQWFLPAK ARL