Gene Pnap_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_1014 
Symbol 
ID4686425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp1071690 
End bp1072802 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content62% 
IMG OID639834013 
Productputative substrate-binding periplasmic (PBP) ABC transporter protein 
Protein accessionYP_981252 
Protein GI121603923 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0676014 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCACCC TGGCCGTTGG CGGCGTCATG GCTGCCGCGC TGTCGGCGGG CGCTGCCCCG 
ATTGTCGTGG GGCAGGTTGC ACCCTTGAGC GGTATGGAGG CGCGTCAGGG GCGAGCTTAT
TCAATCGGTA TCCGGCTGGC CCTGAACAAG GCCAACACGG CGGGCGGGGT GAATGGCAAC
ACCTTCAGCC TGGTGAGCAA GGACGATGGC GGCCGGTCTG ACGACACCCT GGCGGCCACC
CGGCTGCTGC TGAGCGAGAG CCGCCCCCTG GTGCTGGCGG GTTATTTTGG CGACCGCAGC
ATGGCTGATC TGGCCGGCTC GGGCCTGCTT GGAAAAGAGA AAATCGCCCT GGTTGGTTAC
CGCGTCAATG AGATTCGGGA AGAGGCGCCG CTGATCTACA GCGTTCGCGC CACCTTGCGT
GACGAAATCA ACAAGATCGT CGAGCATCTG GCCACCGTGG GCATCACGCG CCTGGGACTG
TTTTACCCGG ATGGACCCGG TGCGGCACCG CTGATTGCGG CCATGGAAGA CGTGGCGAAG
AAGAAGAATG TCAAACTTCT GGTCAAGGGG TCTTATAAAC CGGACTCGGC CAAGGTGGCC
GGTGCGGTCA TCGATGCGTT TATCGCGGCC GCGCCGCAGG CGATCATCAT TGCATCCAGT
GGCTCGGCGG CAGCGGACTT TATTGAAAAA TACCGGATGA ATGGAGGCGC GGCCCAATTG
TTTGCCCATT CCGGCGCAGA CATCGAGCAC ATCTCCCAGC GCCTGGGCGA AGAGCACATG
AAAGGCATGG CGATTGCACA GGTCACGCCG AATCCGTACA AGATTTCAGG GCTTCTGAGC
AAGGAATTCA TCGACACGGC GGCCAAAACA CCGGACTTGG GCATGCCTGT GAGCTACGCC
ATGATGGAAG GCTATATTGC CGGGTCCGTG ATCGTGGAGG CGGCGCGGCG CATGGGACCC
AAGGTGTCGC GCGAGGGCTT CGTGTCGGCG CTTGAGAGCA TCGACAACCT GGACATGGGC
GGCTACAAGC TGGGCTTCAA GCCGGGCATG CGCTCAGGCT CGAAGTTTGT CGAACTGACC
ATCGTCACTG CCACGGGGCG CATCCGGCAA TAA
 
Protein sequence
MRTLAVGGVM AAALSAGAAP IVVGQVAPLS GMEARQGRAY SIGIRLALNK ANTAGGVNGN 
TFSLVSKDDG GRSDDTLAAT RLLLSESRPL VLAGYFGDRS MADLAGSGLL GKEKIALVGY
RVNEIREEAP LIYSVRATLR DEINKIVEHL ATVGITRLGL FYPDGPGAAP LIAAMEDVAK
KKNVKLLVKG SYKPDSAKVA GAVIDAFIAA APQAIIIASS GSAAADFIEK YRMNGGAAQL
FAHSGADIEH ISQRLGEEHM KGMAIAQVTP NPYKISGLLS KEFIDTAAKT PDLGMPVSYA
MMEGYIAGSV IVEAARRMGP KVSREGFVSA LESIDNLDMG GYKLGFKPGM RSGSKFVELT
IVTATGRIRQ