Gene Pnap_0227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_0227 
Symbol 
ID4687472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp233340 
End bp234614 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content58% 
IMG OID639833220 
Productextracellular solute-binding protein 
Protein accessionYP_980473 
Protein GI121603144 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.998322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.257108 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAGA AATATTCGGG ATTTATGGCT GCGCTGACAG CAACACTCGC CACCGCCATG 
CCGGTGCAGG CTCAGACCAT GGTGACGATG TGGGTGCATG CCGGGCCAGG CCCTGAGGCG
GTTGCCTACT CCGCAGCGAC AGATGCCTTC AACGCGCAGA ACAAGGACAT CAAGCTTGAC
CTCGTGAAGT TGCCCGAAGG CAGCTACAGC AACCAGGTGA GTGCTGCTGC ACTTGCCCGC
AAGCTGCCCT GCCTGTTGGA CTTCGATGGT CCTAACGTCT ACAACTACGC ATGGACGAAG
AAGATCATCC CGCTGGACAC TTTTCCGGAG TTGGTCGCGG TAAAAGCAGA CCTGCTTCCA
TCGCTGCTTC GCCAGGGAAC TTACGGCGGA AAACTCTACA GCTTGGGCCA GTTTGACTCA
GGTCTGGGAA TCTGGGGCAA CAAAAAGCTG CTCGAAAAGG CGGGGGTGCG TATCCCTGGC
TCTGTGCTCG AGGCGTGGAC GCTCGCCGAG TTCGAGGACG CGCTGAAGAA GCTCAAAAGC
AGCGGTGTGG CCGCTCCGTT GGACATGAAG TTCAACTATG GCGTCGGAGA GTGGTTCACC
TATGGTTTCT CGCCAATCGT GCAGAGCTTT GGCGCAGACC TGATCGACCG CAAGACATTT
AAATCCTCCA AGGGAGTGAT CAATGGCGAT GCCGCGGTGA AAGCGCTGAC CACGCTGCAG
GGCTGGGTCA AGGCGGGTTA CGTGAATCCT GCCACCAAGG ATGACGGCGA CTTTATCAAG
GGCAAGGCGG CGCTGTCCTA TGTGGGCCAC TGGACCTTCA AGGACTACAA GAAGGCGCTC
GGTGATGATC TGGTGCTGAT TCCCATGCCG GCCTTCGGCG CCAAGCCGGT GACTGGCGCT
GGCTCATGGA ACTTCGGCAT CTCGGCCGAC TGCAAGGATC CGAAGGCGGC AGCCAAGGTG
CTGGCCCATC TGATGTCAAC CCCGGAGATC CTGCGCGTGA CCGAGGCAAA TGGTGCAATG
CCTGGAACCA ATTCAGCGCT GGCCCAGAGC AGGGACTACG GTGTCAAGGG CGGCTTGAAC
ATTTATGTAC AGCAGGTTCG CCAGGGAGTG GCGCTGGTAC GCCCTGAAAC GCCGGCTTAT
CCGGCCATCA GCACGGCCTT TGCCGAGGCG TTAAACAACA TTGTGGCGGG CGCTGACGTC
CAAAAGGAGC TCGACCGCGC CTCCAAAAAG ATCGACCAGA ACATAGAGGA CAACAAGGGT
TACCCGATCA AGTGA
 
Protein sequence
MKQKYSGFMA ALTATLATAM PVQAQTMVTM WVHAGPGPEA VAYSAATDAF NAQNKDIKLD 
LVKLPEGSYS NQVSAAALAR KLPCLLDFDG PNVYNYAWTK KIIPLDTFPE LVAVKADLLP
SLLRQGTYGG KLYSLGQFDS GLGIWGNKKL LEKAGVRIPG SVLEAWTLAE FEDALKKLKS
SGVAAPLDMK FNYGVGEWFT YGFSPIVQSF GADLIDRKTF KSSKGVINGD AAVKALTTLQ
GWVKAGYVNP ATKDDGDFIK GKAALSYVGH WTFKDYKKAL GDDLVLIPMP AFGAKPVTGA
GSWNFGISAD CKDPKAAAKV LAHLMSTPEI LRVTEANGAM PGTNSALAQS RDYGVKGGLN
IYVQQVRQGV ALVRPETPAY PAISTAFAEA LNNIVAGADV QKELDRASKK IDQNIEDNKG
YPIK