Gene RPB_3195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3195 
Symbol 
ID3910996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3653294 
End bp3654766 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content67% 
IMG OID637885097 
Productmajor facilitator transporter 
Protein accessionYP_486802 
Protein GI86750306 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.619069 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAG AACGACTGAT CCCGCTGATC GTGGCCGCCG CGCTGTTCAT GGAGAACATG 
GACGCCACCG TGATCGCCAC GTCATTGCCG GCGATTGCCG CCGATATCGG CACCTCGCCG
CTGACGCTGA AGCTCGCGAT CACGTCCTAT CTGCTGTCGC TGGCGGTGTT CATCCCGGCC
TCCGGCTGGA CCGCCGACCG GTTCGGCGCC CGCACGGTGT TCTCGGCGGC GATCGCGGTG
TTCATGATCG GCTCGATCGG CTGCGCGCTG TCGCAGTCGA TCACCGATTT CGTGCTCGCG
CGCATCCTGC AGGGGCTCGG CGGCGCGATG ATGACGCCGG TCGGACGATT GGTGCTGCTG
CGCTCGATCG ACAAGAGCGC ACTGGTCGGC GCGATGGCGT GGATGACGGT GCCGGCGCTG
ATCGGCCCGG TGATCGGCCC GCCGCTCGGC GGCTTCATCA CCACCTACTT CTCGTGGCAC
TGGATCTTCC TGATCAACAT CCCGATCGGG CTCGCCGGGA TCTTCTTCGC GCAGCGCTAC
ATCGATCCGA TCCGCAGCGA CAATCCGGAG CGGTTCGACC TCTACGGCCT GGTGCTGGCC
GGCATCGGCC TCGCCGGCAT CGCCTTCGGC CTGTCGGTGG CGGGGCTCGG GCTGCTGCCC
TGGCAGGTCG TCGTGGCGCT GATCGCGATC GGCACGATCG CGATGACGCT GTATCTGCTG
CACGCCCGCC GCACCGCCTC GCCGGTGCTG GATTTCTCGC TGCTGCGGCT GACGACGCTG
CGCGCCAGCA TGACCGGCGG CTTCCTGTTC AGGCTCGGCA TCGGCGCATT GCCGTTCCTG
CTGCCGCTGT TGATGCAGAT CGGCTTCGGG CTGTCGCCGT TTCACTCCGG CCTCGTCACC
TTCGCCTCCT CGGCCGGGGC GATGGGGATG AAGCCGCTGG CGGCGCGGAT CATCCGCACC
TTCGGCTTCC GCAAGATCAT GACCATCAAC GCGCTGGTCA GCTCGGTGTT TCTCGCCGCC
TGCGCGCTGT TCACGCCGGC GACGCCGCTG CTGCTGATCA TGATCATCCT GCTGGTCGGC
GGCTTCTTCC GCTCGCTGCA ATTCACCGCG ATCAACACCG TCGCCTATGC CGAGGTGGAG
ACCGCGCAGA TGAGCCGCGC CACCACGCTG GTCAGCGTCG GCCAGCAGCT GGCGATCTCG
GCCGGCGTCG CGATCGGCGC GTTTGCGGTC GAATCCACGA TGGCGTGGCA CGGCACCACG
ACGCTCGGCG CCGACGATTT CGCGCCGGCC TTCGTCGTGG TCGCGGTGCT GTCGGCGTTG
TCGGCGTATT TCTTCTGGCG GATGCCGGAC GATGCCGGCA GCGAGATCTC GGGGCGGAAG
GCGATCGAGA TCTCGAGCCG GAAGGGCGCC GCCGGCGCCG CTGCCAAGAC CGCGACCGAA
GAAACCCAGA CCGCGCGCGA TCAGCGACTG TGA
 
Protein sequence
MNKERLIPLI VAAALFMENM DATVIATSLP AIAADIGTSP LTLKLAITSY LLSLAVFIPA 
SGWTADRFGA RTVFSAAIAV FMIGSIGCAL SQSITDFVLA RILQGLGGAM MTPVGRLVLL
RSIDKSALVG AMAWMTVPAL IGPVIGPPLG GFITTYFSWH WIFLINIPIG LAGIFFAQRY
IDPIRSDNPE RFDLYGLVLA GIGLAGIAFG LSVAGLGLLP WQVVVALIAI GTIAMTLYLL
HARRTASPVL DFSLLRLTTL RASMTGGFLF RLGIGALPFL LPLLMQIGFG LSPFHSGLVT
FASSAGAMGM KPLAARIIRT FGFRKIMTIN ALVSSVFLAA CALFTPATPL LLIMIILLVG
GFFRSLQFTA INTVAYAEVE TAQMSRATTL VSVGQQLAIS AGVAIGAFAV ESTMAWHGTT
TLGADDFAPA FVVVAVLSAL SAYFFWRMPD DAGSEISGRK AIEISSRKGA AGAAAKTATE
ETQTARDQRL