Gene Pnap_2017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_2017 
Symbol 
ID4689397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp2144171 
End bp2145985 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content63% 
IMG OID639835025 
Productextracellular solute-binding protein 
Protein accessionYP_982247 
Protein GI121604918 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGATT TGATTTTTTT ACGGGCTTGG CTGGCCGTTT GCGCGCTGTG CCTTGCGCCG 
GCAAGCTGGG CGGCCCACGC CTATGCCCAG TTTGGCGACA TCAAGTATCC GGCCGGCTTC
ACGCATTTCG GCTATGTCAA TCCCGCAGCC CCCAAGGGCG GCGAAATCCG CATGGTGCCG
CCGACCCGGC CAACCAATTT CGACAAGTTC AACCCGTTCA CCCTGCGCGG CACCGCGCCG
TATGGCCTGG GAATTTTGCT GATCGAAAGC CTGCTGACGG GCAATTCGGA AGAGCCGACC
ACCGCCTATG GACTGCTGGC CGATGACGTG ACGGTCGCGC CCGACAGGCT GTCGGCCACC
TTTCGCCTCA ACGAGAAGGC CCGTTTTCAC AACGGCGCGC CTGTGCTGGC CGCCGATGTG
CTGCATTCGT ACACCCAGCT GACCAGCAAG CTCGCCGCGC CGCAGTACCG CACGATTTAT
GCCGAAGTCA AGAGCGTCAC TGTGGTTTCC GAGCGCGTGG TGCGTTTTGA TTTCCTGACG
CCCAACCCCG AGTTGCCGCT GGTGGTGGGT GGCATGCCGG TGTTCAGCCG CGACTGGGGC
AAGGCCAAGC CCTTCGACAA GATCGTGTCT GAAGTGCCGA TTGGCTCGGG GCCGTACAAA
ATTGCCAGCC CGGCAATGGG GCGCGACATT ACCTATGTGC GCGATCCGGC GTACTGGGGC
AATGAGTTGC CAAGCCGCAA GGGCCAGTTC AACTTTGACC GCATCAGCTT CAAGATTTAC
CTCGACGAAA CCTCACGCTT CGAGGGGCTG AAGGCCGGCG AATTCGATTT CCTGCGTGAA
TTCATCTCGC GCAACTGGGC GCGGCAGTAC ACCGGCAAGC AGTTCACCTC GGGTGAACTG
GTCAAGCGCG CTTTTGAAAA CCGCAACCCC GGCGATTTCC AGGGCTATGT GTTCAACCTG
CGCAAGCCCA AGTTCCAGGA TGCGCGGGTG CGCAAGGCGA TTGGCCTGGC GATGGATTTC
GAGTGGATGA ACCGGCAGCT GTTCTACGGC CTGTACAAGC GCGTCAATGG CTATTTTCCC
AACAGCGAAT TCCATGCGGA AGGCCTGCCC AGGCCCGATG AACTGGCCCT GCTCGACCCC
TTGCGCGCCA GACTCAAGCC CGAAGTTTTC GGTCCCGTGC CGGTGTCGCC CAGCACCACG
CCGCCCGGCA GCCTGCGGGG CAACCTGCGC CAGGCCCAGG CGCTGCTGCG CGAGGCCGGC
TGGACGTACC GCGATGGCGC GCTGCGCAAT GCGAAGGGTG AGGCTTTCAC CATGGAATTC
CTGAACGACC AGCCTTCGCT GGTGCGCATC GTCGGGCCGT TCCAGAAGGC GCTGGAAAAG
CTCGGCATCA CCATGACCTA CCGCATCGTC GATTTCTCGC TGGGCAAGCA GAAGATGGAC
GCCTTCGATT TCGAGGTCAC GACGCTGCGC CTGCCCGGCA GCACCGCGCC CGGCGGCGAG
TTGCTCGAAT TGTTCGGCTC GAAGGCGGCC ACCACGCCGG GGTCTTCAAA TGTCTGGGGC
ATTGCCGACC CGGCCGTCGA TGCGCTGCTG CAAAAGGTCG TGACCGCCAA GACCCGGCCT
GAATTGAGCG CGGCCATGCG CGCGCTGGAC CGGGTGCTGA CCAACGGCTA TTACTCGGTG
CCGCAGTATT ACGGCGACGC CTTCCTGATC GGCTACCGGC CACGTCCTTT TGTGCTGCCA
GCCGTCATCC CGCCGTATTA CCAGCCCGAC ACCTGGGCCA TGAGCACCTG GTGGGCGTCG
CCCTCCAACA AATAG
 
Protein sequence
MRDLIFLRAW LAVCALCLAP ASWAAHAYAQ FGDIKYPAGF THFGYVNPAA PKGGEIRMVP 
PTRPTNFDKF NPFTLRGTAP YGLGILLIES LLTGNSEEPT TAYGLLADDV TVAPDRLSAT
FRLNEKARFH NGAPVLAADV LHSYTQLTSK LAAPQYRTIY AEVKSVTVVS ERVVRFDFLT
PNPELPLVVG GMPVFSRDWG KAKPFDKIVS EVPIGSGPYK IASPAMGRDI TYVRDPAYWG
NELPSRKGQF NFDRISFKIY LDETSRFEGL KAGEFDFLRE FISRNWARQY TGKQFTSGEL
VKRAFENRNP GDFQGYVFNL RKPKFQDARV RKAIGLAMDF EWMNRQLFYG LYKRVNGYFP
NSEFHAEGLP RPDELALLDP LRARLKPEVF GPVPVSPSTT PPGSLRGNLR QAQALLREAG
WTYRDGALRN AKGEAFTMEF LNDQPSLVRI VGPFQKALEK LGITMTYRIV DFSLGKQKMD
AFDFEVTTLR LPGSTAPGGE LLELFGSKAA TTPGSSNVWG IADPAVDALL QKVVTAKTRP
ELSAAMRALD RVLTNGYYSV PQYYGDAFLI GYRPRPFVLP AVIPPYYQPD TWAMSTWWAS
PSNK