Gene Rpic_4301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpic_4301 
Symbol 
ID6285840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia pickettii 12J 
KingdomBacteria 
Replicon accessionNC_010678 
Strand
Start bp597896 
End bp599086 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content67% 
IMG OID642618783 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001892824 
Protein GI187926479 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00710] drug resistance transporter, Bcr/CflA subfamily 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.555384 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGTC GCCCTCTGAG CTTTGCCCTG CTACTGCCGT TGTTACTGTC GGCACAGCCG 
GTTGCGACCG ACAGCTACCT CCCCGCCCTG CCAGCCATTG CCCAGGCGCT GGGATCGGCC
AGCACGAGCC TCACGGTCTT CGCGCTGGCC TTTGGCATCG GGCAACTGCC GATGGGCAGC
CTGGCCGATC GTTTTGGCCG GCGCCATGTG CTGCTGATCG GGCTCGCGTG CTACGCCCTG
GCTGCACTGG CCGGCGCGCT GGCGACAACC GCTTCCATGC TGACCGCTGC GCGCGCGCTG
CAGGGCTTCT CGATGGCCGC CATCCTCGTG TGTGCGCGTG CTGCCGTACG CGATCTGCAC
CCGGCACGCG ATGGTCCGCA CGTCATGGCA CGCGGCCTCA CGGGGCTGGG CTTCGTGGCG
TTGATGGCAC CGATTCTCGG TGCCTTTGTC GCGCAACACG CGGGTTGGCG CTGGGTGCTG
GTGGGCATGA GCCTCTATGC AATCGTGCTG TGGGCGATGT GCTGGTATGG CTTTGCCGAA
ACCCGGCAAG AGACACACAT CCAGGCAGGC AGCGTGCGCG AGATCTTTGC CAGCGCGGAT
TTTCGCGCCT GGGGGCTGCT GGCGGCCACG ACCTACGCCG GCATCTTCTG CTTCCTGCTG
CTCTCGCCGA TGGTCTATAT CGCGTACCTC GGCTTATCGC CCGAGCTTTA TGGTTGGATA
CCGGCTGGCG GCTCGCTCGT CTATATCGTG AGTACGACCG GCTGCCGGCG CCTGCTGCGC
CGCCAGAGCC TCGTGCGGAC GGTGCAGCAA GGCGCCACCC TCAGCCTGGT TGGCGCCGGC
ATCCAGGGTT TGGGCTGCTG GCTGCTTCCC GGACAAGCGT GGCCGCTGCT GCTCGGGCAC
GGCATCTATT GCATGGGCCA CGGCATTCAC CAGCCCTGCG GCCAGGCAGG CGCGGTGGGC
GAGTTGCCGC ACCTTGCGGG GCGCGCGGTG TCATGGTCGG GCTTCATCAT GATGCTGGCG
GCCTTCTGCC TGGGGCAGAC CGCAGCAGCC TTTGATGACA CGTCGCACAG CCTCGGCGCC
TGGCCCATGG TGGTGCCGAT GATGCTTGCC GGCACCGTGC TGGCGTCCAT CGCCTTCCTT
TGGCTGCCGA AGCTGCAGAG CCAGCCGGAT CCACAAAGCG CTGCCGCATA G
 
Protein sequence
MTRRPLSFAL LLPLLLSAQP VATDSYLPAL PAIAQALGSA STSLTVFALA FGIGQLPMGS 
LADRFGRRHV LLIGLACYAL AALAGALATT ASMLTAARAL QGFSMAAILV CARAAVRDLH
PARDGPHVMA RGLTGLGFVA LMAPILGAFV AQHAGWRWVL VGMSLYAIVL WAMCWYGFAE
TRQETHIQAG SVREIFASAD FRAWGLLAAT TYAGIFCFLL LSPMVYIAYL GLSPELYGWI
PAGGSLVYIV STTGCRRLLR RQSLVRTVQQ GATLSLVGAG IQGLGCWLLP GQAWPLLLGH
GIYCMGHGIH QPCGQAGAVG ELPHLAGRAV SWSGFIMMLA AFCLGQTAAA FDDTSHSLGA
WPMVVPMMLA GTVLASIAFL WLPKLQSQPD PQSAAA