Gene Rru_B0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_B0020 
Symbol 
ID3833351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007641 
Strand
Start bp18918 
End bp20465 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content67% 
IMG OID637824039 
Productmajor facilitator transporter 
Protein accessionYP_425056 
Protein GI83582750 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGGCG TATCCACTCC CCGGATTCCA GCGGCGGATG CGCCGTTCGC CTCGCCCGAC 
ACCCCCGCCC GTCGCCAAGC GATCCTCGCG GTCATGGTCG GCTGCGCCGC TCTGGTCTTT
GGCCTGGGCG CCAGCCTGAA CCTAGCGGTC GGGCGTATCG CCACCAGCCC GCTCCATCCT
TCCGCCACGG CGGTGCTGTG GATCGTCGAC ACATATCTCG TGGTCTTCGG TTGCCTGCTG
ATTCCGGCCG GGGCCATCGG GGATCGCTAT GGCCGCAAGC AGGCCATGCT GGCGGGTCTC
ACGTTCCTGG CTGTTGGCTC CCTGCTGTCC GCCGTTGCGG CCACCGTGCC CGTGCTGCTG
GCCGGCCGAG CAGTCGCCGG CGCGGGCGCG GCACTGATCC TGCCGAACAG CCTCGCGCTG
GTCGTCCAGG TCTATCCGGC CGACCAGAAA TCCCATGCCA TCGCTGTATG GACCGGCATG
ACCGGCCTGG GCGGCGCGCT CGGGAATATC CTCGGCGGGC TTGTGCTGCA GTTCGCCGAA
TGGCAGGCGA TCTTCACCGT TGCGGTGCCG TTGGCCCTTG CAGGCCTCGC GCTGACGGCG
TGGCTGGCGC CACGGCAAGC GGGACACGAG CATCCGCTCG ACCTCGTCGG CGCCGGCATT
CTCATGCTGA GCATATTCGC CCTGTTGACG GGACTGATCG AAGGCCAGGA GCTGGGCTGG
GCCTCGACGG AGGTGATCGG CGCCTTATGC GCCGCGGCGG CGCTCCTCGC AGTATTCCTC
ACCACCGCGG CGAGGCGGAA GCACCCCCTC GTCGATCCGC GCATCTTCCG TGCCCGCGGC
CTCTGCGCCG GCATGCTCGG CATCACCGCA TCCTTCATCG CCATGTATTC GCTGTTCTAT
CTGAACGGCC AATACCTGAT GAGCGTGAAG GGCTATCCGC CGGCTCTGGC CGGCATATGC
ACCCTGCCGC TGGTGGTCGT CCTGTTCTGG CTGTCACCGC GCAGCGTCCA GCTCGCCCGT
CGCTTCGGTG CGCGGCCGGT GGTCGCCGTC GGCCTGGCAA TGCTCATCGT CGGGCTGGGC
CTTCTGCGGC TTTGCGGCGC GGATACGTCC TACTGGTTCT ATGCCGCGAG CATCGGCGTT
ATCGGCATCG GCTCGGCGCT GTCCAATCCT GTCCTGTCGA CCGCGATCAT CGGTGCGCTG
CCGCCGCATC AGGCAGGTGT CGGCTCCGGC ATCAACAGCT TCACGCGTGA AATCGGCGGC
GCTTTGGGCG TTGCTCTGTT TGGCAGCCTG CTGGGGAGCA GCTTCCCGTC GCGTTTGCCC
GACACCCTGG CGCAGGCTCA TGGCGCTGTG CAACGGTCGG TCGGCGCCGC ACTGGCCTAT
GCAGAATCCC TGCCAGGAAC GGCGGCCAAT CAGACGGTAC AGGTCGTACG TCAAGCCTTC
TCCGGCGCAA TGGCACAATC GCTGCTCACG GTGATGCTGG TTCTGGCGGT CGCTGCTGTG
TTGGCGGTGC TTTGGTATCC GGCCTCCGCA GGATCGGCCG AAAAGTGA
 
Protein sequence
MSGVSTPRIP AADAPFASPD TPARRQAILA VMVGCAALVF GLGASLNLAV GRIATSPLHP 
SATAVLWIVD TYLVVFGCLL IPAGAIGDRY GRKQAMLAGL TFLAVGSLLS AVAATVPVLL
AGRAVAGAGA ALILPNSLAL VVQVYPADQK SHAIAVWTGM TGLGGALGNI LGGLVLQFAE
WQAIFTVAVP LALAGLALTA WLAPRQAGHE HPLDLVGAGI LMLSIFALLT GLIEGQELGW
ASTEVIGALC AAAALLAVFL TTAARRKHPL VDPRIFRARG LCAGMLGITA SFIAMYSLFY
LNGQYLMSVK GYPPALAGIC TLPLVVVLFW LSPRSVQLAR RFGARPVVAV GLAMLIVGLG
LLRLCGADTS YWFYAASIGV IGIGSALSNP VLSTAIIGAL PPHQAGVGSG INSFTREIGG
ALGVALFGSL LGSSFPSRLP DTLAQAHGAV QRSVGAALAY AESLPGTAAN QTVQVVRQAF
SGAMAQSLLT VMLVLAVAAV LAVLWYPASA GSAEK