Gene Rpal_4220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4220 
Symbol 
ID6411904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4525482 
End bp4527014 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content65% 
IMG OID642714102 
Productdrug resistance transporter, EmrB/QacA subfamily 
Protein accessionYP_001993191 
Protein GI192292586 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.663619 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGCCG CTCCGGTCAA ACAACCAATC ACGCCGCTAC AAGGCGCCCC CCGCGCGTTG 
GCGGCGGTGA CGTTCGGCCT CGCCTCGTTC ATGGCGGTGG TCGATATCAC CGTCGCCAAC
GTGTCGGTGC CGACGATCTC CGGCAATCTT GGCGTCAGCT CCGAAGAGGG CGAATGGACG
ATCACGTTCT TCGCTATCTC CAACGCGATC TGCATTCCGT TGACCGGCTG GCTCGGCCGA
CGCTTCGGGC AGGCACGGCT GTTCGCGGCC TCGGTCGCCG CCTTTACGCT CGCTTCGATC
CTGTGCGGGC TGGCGCCGAC GTTCGAAACG CTGCTGATCG CCCGCATCCT GCAGGGCGCG
GTCGCGGGTC CGATCGTGCC GCTCAGCCAG GCGCTGCTCG TCGCGGTGTT TCCGCCTGAG
AAGCGCACCT TTGCGGTGGC GATGTGGGCG ATGACCAACA TGGCGGGTCC GGTCGCGGGC
CCGATGCTCG GCGGCTGGAT CACCGACCAA TTCTCCTGGC CGTGGATCTT CCTGATCAAC
GCACCGGTCG GCGTCTTCGT GGTGATCTCG GCCGGAATCC TGCTCCGCGG CAAGGATACG
CCGACGGTCC GGCTACCAGT CGATGTCGTG GGCCTCGCCC TGTTGGCGAT CGCGGTGGGC
TGCCTTCAGG TCACCTTCGA TCGCGGCCGC ACGCTCGACT GGTTTGCGTC GCCGCTGATC
TGTGCCACCG CGACGATCTC GGTGGTCGGC TTCGTTCTCT TGGTGGTCTG GGAACTCGGC
GAAGCTCACC CGATCGTGGA TCTGCGCCTG TTCGAATTTC GCAACTTCGC GATCGGTACG
TTGAGCGTCG CGGTCGGCTT CGGTCTGTAC TTCGCGGCAC TCGTTCTCGT GCCGCTGTGG
CTGCAGACGG ATCTCGGCTA CAGCTCGACC TGGGCCGGCG TCGCGACCGC TCCCATGGGG
GTATTCGGAA TCGTGCTGGC GCCGTTTCTC GGCCGCTGGG TGGCGACCCA CGGGCCGCGC
CTCTATGCCA GCATCGCCTT CGCCGCCTGG GCGCTGGTGG CGCTGTGGCG TTCGACGATG
ACGACCGGCG TGACCGTCGC GGACGTTGCG CTGATCCATC TCGCGCAAGG CATCGGCATC
GCATTTTTCC TCACGCCGGT GGTGAGCCTG TCGCTGGCCG GACTGCCGCC CGACAAACTC
GCGTCCGCCT CTGGCCTGCA GACCGCAATC CGCATGATGG CCGGCAGCCT GTGCGCATCG
ATCGCCCAGA CGTTCTGGGA CGAACGCGCG CGCTTCCATC GCAATCACCT GGTCGATGCC
CTGACGGAGC TGAGCGGTCG AGCCGCAACG GCAATCAATC AGTTACGGAG CGCCGGCCTG
ACCGAACAGC AGTCCTGGAC GGTGATCAAT CAACAGATCG ACGTTCAGGC GCGCATGCTG
TCGCTGAACG ACTTCTTCTA CGTCTCTGCT TTCGCGTTCG CTGCTGCGCT CGGCATCATC
TGGCTCGCAC GCGGACCGAA ACAATCAACC TGA
 
Protein sequence
MTAAPVKQPI TPLQGAPRAL AAVTFGLASF MAVVDITVAN VSVPTISGNL GVSSEEGEWT 
ITFFAISNAI CIPLTGWLGR RFGQARLFAA SVAAFTLASI LCGLAPTFET LLIARILQGA
VAGPIVPLSQ ALLVAVFPPE KRTFAVAMWA MTNMAGPVAG PMLGGWITDQ FSWPWIFLIN
APVGVFVVIS AGILLRGKDT PTVRLPVDVV GLALLAIAVG CLQVTFDRGR TLDWFASPLI
CATATISVVG FVLLVVWELG EAHPIVDLRL FEFRNFAIGT LSVAVGFGLY FAALVLVPLW
LQTDLGYSST WAGVATAPMG VFGIVLAPFL GRWVATHGPR LYASIAFAAW ALVALWRSTM
TTGVTVADVA LIHLAQGIGI AFFLTPVVSL SLAGLPPDKL ASASGLQTAI RMMAGSLCAS
IAQTFWDERA RFHRNHLVDA LTELSGRAAT AINQLRSAGL TEQQSWTVIN QQIDVQARML
SLNDFFYVSA FAFAAALGII WLARGPKQST