Gene Pnap_1198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_1198 
Symbol 
ID4689880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp1273356 
End bp1274597 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content66% 
IMG OID639834201 
Productmajor facilitator superfamily transporter 
Protein accessionYP_981434 
Protein GI121604105 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.852259 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCAA GCCCCCTTTC TTCGCCCCCA AAACTCTCCA TGCTCCAGGT GCTGGCCTGC 
GGCGCGGCCA TTGTCACGCT GTCGATGGGC ATCCGGCACG GCTTTGGCCT GTGGCTGCAG
CCCATCACGC AGGCGCAGGG CTGGACGCGT GAAACCTTCG CCTTTGCGAT TGCGGTGCAA
AACCTGTCGT GGGGCATCTT CGGGATTTTC GCCGGCATGG TGGCTGACCG CTTCGGCGCG
TTCCGGGTGA TTGCGGGCGG CGCCGTGCTG TACGCGCTGG GCCTGGTGGG CATGGCGCTG
TCGCCGACCG GCCTGCTGTT CACGCTGACG GCCGGGGTGC TGATCGGGGC CGCGCAGGCA
GGCACCACCT ACGCGGTGAT CTACGGCGTC ATCGGCCGCA ATATTTCGGC CGACAAGCGC
TCGTGGGCGA TGGGAGTCGC CGCTGCGGCG GGCTCGTTCG GCCAGTTCCT GATGGTGCCT
ACCGAAGGCT TCCTGATCAG CAGCCTGGGC TGGCAGGCCG CCCTCCTGGT GCTCGGCGGC
GCCGTGCTGC TGATCGTGCC GATGGCGCTG GGCTTGCGCG AAACCGGCTT TGCCGGCGCA
ACGCCCGCCA AGCGCGACCA GAGCATAGCC CAGGCGCTGC GCGAGGCGCT CAAATACCCG
AGCTTTCAGA TGCTGATGGC GGGCTACTTC GTCTGCGGCT TTCAGGTCGT GTTCATCGGC
GTGCACATGC CCAGCTACCT CAAGGACAAG GGCCTGTCGC CGCAGGTGGC AGGCTATGCG
CTGGCGCTGA TCGGGCTGTT CAATGTCTTC GGCACCTACA TTGCCGGCTC GCTGGGCCAG
CGCATCGCAA AACGCAAAAT CCTGGCGACG ATTTACTTTT CCCGCGCCGT CGTCATCGCC
GTGTTTCTGG CCGCGCCGCT GAGCCCGGCC AGCGTCTATG TTTTCGCCAG CCTGATGGGG
CTGCTGTGGC TCTCGACGAT TCCGCCGACC AACGCGGTGG TGGCGCAGAT CTTCGGCATC
CAGCACATGT CGATGCTCAG CGGCTTCATC TTCTTCAGCC ACCAGATCGG CTCGTTCATG
GGCGTGTGGC TGGGCGGCGT GCTGTACGAC CGCACCGGCA GCTACGACAT CGTCTGGTAC
ATCGCGATTG CGCTGGGCGT GTTCGCGGGA CTGGTGAACC TGCCGGTCCG GGAAGCGCCA
ATCGAGCGCA GCAGCCCGGG CGGATTGCCG CAGGGAGCCT GA
 
Protein sequence
MASSPLSSPP KLSMLQVLAC GAAIVTLSMG IRHGFGLWLQ PITQAQGWTR ETFAFAIAVQ 
NLSWGIFGIF AGMVADRFGA FRVIAGGAVL YALGLVGMAL SPTGLLFTLT AGVLIGAAQA
GTTYAVIYGV IGRNISADKR SWAMGVAAAA GSFGQFLMVP TEGFLISSLG WQAALLVLGG
AVLLIVPMAL GLRETGFAGA TPAKRDQSIA QALREALKYP SFQMLMAGYF VCGFQVVFIG
VHMPSYLKDK GLSPQVAGYA LALIGLFNVF GTYIAGSLGQ RIAKRKILAT IYFSRAVVIA
VFLAAPLSPA SVYVFASLMG LLWLSTIPPT NAVVAQIFGI QHMSMLSGFI FFSHQIGSFM
GVWLGGVLYD RTGSYDIVWY IAIALGVFAG LVNLPVREAP IERSSPGGLP QGA