Gene Pnap_0228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_0228 
Symbol 
ID4687991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp234667 
End bp235974 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content62% 
IMG OID639833221 
Productextracellular solute-binding protein 
Protein accessionYP_980474 
Protein GI121603145 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.6533 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.282981 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACC CCCTTGAACT CACCGCAGGT GTGAGAGCCC AGCGTGCGTG CATGGCGTTT 
CTAGCGTTCG GGCTGGCCTG TTCGCTGCCA GCTGCGGCCC AGACGCTGGA CGTCTGGGTT
CACGCCGGGC TGGGTCCAGA GCGCGATGCC TACACGGCAT CGATCAAGGC GTTCAACGAG
GCCGGGCGCA ATCTCGGCGG CAAGGCGCAG GCTGTCCTGG TTCCAGTGCC TGAAGTGGGC
TACAACGAGG CGGTGGCGAA GGCTGCGGCT GACGGGCGGC TGCCTTGCGT GCTTGAGTTC
GATGGCCCCA ATGTCGCGGC CTACGCCGCG GCGGGGCACC TGCTGCCGCT GGAAAAAGTG
CAGTCGCTGG CCAGGATTCG CAATTCGATG CTTGCGTCGC TGGTGCGTCA GGGAACGGTC
AATGGGCGGC TGTACAGCGT GGCTCAATAC GACTCAGGTA TGGCGCTATG GGGAAATCGG
AACATGCTGA ATGCCGCCGG TGTGCGAATT CCAGCCAGAG CGGGCGACGG CTGGACGTTG
ACAGAGTTTG AAGACGTTCT CAAGCGGCTG AAGAACGCTG GCGTGCCTTC GCCGCTGGAC
ATGAAGTTCA ACTATGGCGT CGGAGAATGG TTCACTTATG GGTTTGCCCC GATTGTTCAG
GGTTTCGGCG GCGACCTGAT CGAGCGCGGC AGCATGCGCA GTGCCCAGGG TGTGCTCAAC
GGCCCGGGCG CCGTCAAGGC CATGAGCGCC TTGCAGAGCT GGATCAAGGC CGGTTACGTG
GATGTCATGC CAAAGGATGA TCGCGCATTC ATCGAAGGTC GCTCGGCCCT GTCTTGGGTG
GGGCATTGGG TCTACAAGGA CTACAAGCAA GCGCTTGGAG ACAACCTAGT GCTGATGCCC
CTGCCTCGCT TCGGTGTGCG GCCCGTGGTT GGCTCCGGGT CGTGGAACTT CGGCATCGCG
GCGTCGTGCA AGGAGCCCCA GCTCGCCATT CGCTTTATTG AACACCTGAT GAGTAGCGCT
GAGGTACTGC GCGTCACGGA CGTAAACGGC GCGGTACCCG GCACTGGCGT CGCGATGGCA
TTTAGCCGGC ACTATGGGCC CACCGGAGAG CTGCGCCTGT ACGCCGACCA GCTAATGTCG
GGCCAGGCTC AGGTGCGTCC GGCTTCACCG GATTACCCTG CCATCACTGC CGAGTTTTCA
AATGCCGTGA ATCGCATCGC TCGCGGTGCA GACCCTCAGC AGACACTCAA CCAGGCGGTG
ATCAACATAG ATCGAAAAAT AGAGAAAGCG CGTGCCGCGC GCCCTTGA
 
Protein sequence
MKNPLELTAG VRAQRACMAF LAFGLACSLP AAAQTLDVWV HAGLGPERDA YTASIKAFNE 
AGRNLGGKAQ AVLVPVPEVG YNEAVAKAAA DGRLPCVLEF DGPNVAAYAA AGHLLPLEKV
QSLARIRNSM LASLVRQGTV NGRLYSVAQY DSGMALWGNR NMLNAAGVRI PARAGDGWTL
TEFEDVLKRL KNAGVPSPLD MKFNYGVGEW FTYGFAPIVQ GFGGDLIERG SMRSAQGVLN
GPGAVKAMSA LQSWIKAGYV DVMPKDDRAF IEGRSALSWV GHWVYKDYKQ ALGDNLVLMP
LPRFGVRPVV GSGSWNFGIA ASCKEPQLAI RFIEHLMSSA EVLRVTDVNG AVPGTGVAMA
FSRHYGPTGE LRLYADQLMS GQAQVRPASP DYPAITAEFS NAVNRIARGA DPQQTLNQAV
INIDRKIEKA RAARP