Gene Pnap_3821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3821 
Symbol 
ID4687559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp4075885 
End bp4077087 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content71% 
IMG OID639836839 
Productmajor facilitator superfamily transporter 
Protein accessionYP_984038 
Protein GI121606709 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.181358 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.71225 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCA CCGAGCGGCG CGCCAGCTTC TCGCTGGCCT CGATCTTTGC CCTGCGCATG 
TTGGGCCTGT TCCTGGTGCT GCCGGTGTTC GCGCTGGAGG CTGCACGCTA TCCGGGCGGC
GACGACCCGG CCCGGGTCGG CCTGGCGATG GGCATTTACG GACTGACGCA GGCGCTGCTG
CAGATCCCCT TCGGGCTGGC GTCGGACCGC CTGGGCCGTA AGCGGGTGAT CGTGGCCGGG
TTGCTGGTGT TCGCGCTGGG CAGCTTTGTC GCGGCGGCTG CCCCCGACCT GAACTGGCTG
CTGGCGGGCC GGGCGCTGCA GGGCGCGGGC GCCATCTCGG CGGCCGTCAC GGCGCTGCTG
GCCGACCTGA CGCGCGACGA GGTGCGCACC AAGGCCATGG CGCTGGTCGG CGGCAGCATC
GGGCTGATGT TTGCCGTGTC GCTGGTGCTG GCGCCGGCGC TCAATGCCCG CATCGGGCTG
AGCGGCCTGT TCATGCTGAC CGGCCTGCTG GCGCTGGCCG GCATTGCCAT GGTGCTGTGG
CTGGTGCCGC CCGAGCCGCT GCTGCACAAG GACATGGCGC GTGGCGGCCT GGCCGGCGTG
CTCGGGCGCG GCGACATGCT GGGCCTGAAC TTTGGCGTGT TCGTGCTGCA TGCCGTGCAG
CTGTCGATGT GGGTGGCGGT GCCGGCGCTG CTGGTGCAGG CCGGGCTGCT CAAGGCCCTG
CACTGGCAGG TCTATCTGCC GGCGGTGCTG GCCTCGTTCG TGGTGATGGG CGGCACGCTG
TTTCCGCTGG AGCGCCGCGG CCATCTGCGC GCCGTGCTGC TGGCCGCGAT TGCCCTGATG
GCGCTGGTGC AGTTCGGTTT CCTGGGGGTC GCGCTGGCGG CCGGCGGCGC GGCGCCGTCG
CTGGCGGTGC TGGGCGGCTT GCTGCTGCTG TTTTTTTGCA GTTTCAACGT GCTCGAAGCC
AGCCAGCCGA GCCTGGTGTC GCGCCTGGCG CACGCCTCAA GCCGTGGCGC GGCGCTCGGG
CTTTACAACA CCTCGCAGTC GCTGGGCCTG TTTGCCGGCG GCGCGCTCGG CGGCGCGATG
CTCAAGTGGG GCGGCACGCA AGGCCTGTTT GCCTCGACGG CGGCACTGTC GCTGCTCTGG
CTGGCCGTGG CCTGGCGAAT GATGCCGGCG CAACGCCCGG TCGCCCGAAA AGCACCGGCT
TGA
 
Protein sequence
MTATERRASF SLASIFALRM LGLFLVLPVF ALEAARYPGG DDPARVGLAM GIYGLTQALL 
QIPFGLASDR LGRKRVIVAG LLVFALGSFV AAAAPDLNWL LAGRALQGAG AISAAVTALL
ADLTRDEVRT KAMALVGGSI GLMFAVSLVL APALNARIGL SGLFMLTGLL ALAGIAMVLW
LVPPEPLLHK DMARGGLAGV LGRGDMLGLN FGVFVLHAVQ LSMWVAVPAL LVQAGLLKAL
HWQVYLPAVL ASFVVMGGTL FPLERRGHLR AVLLAAIALM ALVQFGFLGV ALAAGGAAPS
LAVLGGLLLL FFCSFNVLEA SQPSLVSRLA HASSRGAALG LYNTSQSLGL FAGGALGGAM
LKWGGTQGLF ASTAALSLLW LAVAWRMMPA QRPVARKAPA