Gene Rru_A2789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A2789 
Symbol 
ID3836229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp3223449 
End bp3225155 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content69% 
IMG OID637826900 
Productmajor facilitator superfamily protein MFS_1 
Protein accessionYP_427873 
Protein GI83594121 
COG category[G] Carbohydrate transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0589] Universal stress protein UspA and related nucleotide-binding proteins
[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTCGT CTTCCGCCGG TGCCGCCGCC GGCGCCCCCT CGCTGTTTCG CCAGCCCAAG 
GCGGTGTGGG CGACCGCCTT CGCCGCGGTG ATCGGTTTCA CCAGCATCGG ACTGGTCGAC
CCGATCCTGA CCTCGATCGC CGAGGGGCTG AGCGCCACGC CCAGTCAGGT CTCGCTGTTG
TTCACCAGCT ATTTCTTCGT CACCGCCGTG ATGATGCTGG TCACCGGCTT CGTCTCCAGC
CGCATCGGCG GGCGCAACAC CTTATTGCTT GGCGCCCTGC TAATCGCCGT CTTCGCCGCC
CTGGCCGGCA CCTCCGACTC GGTGGCCGAA CTGGTCGCTT ATCGGGCGGG ATGGGGCTTG
GGCAACGCCT TTTTCGTCGT CACCGCGCTG TCGGTGATCG TCGGCGCCGC TTCGGGCGGC
ACGGCCGGGG CGATCTTGCT TTATGAGGCC GCCTTGGGCT TGGGCATCTC GGCCGGACCG
TTGATCGGCG CGGCGCTCGG CGCCCATTCC TGGCGCTATC CGTTCTTCGG CACGGCGGCG
CTGATGACCA TCGGCTTCCT GGCCATCGCC GTGTTCCTTG ATCCCCAGCC CAAGCCGGCG
CGCAAGATCG GCTTGTCGGG GCCTGTGAAG GCCTTGCGCC ATCCCGGCTT GCTGACGACC
TCGGTCAGCG CCTTTTTCTA TTACTACGCC TTCTTCACCG TGCTGGCCTT CGCGCCCTTC
GTGCTGCGAC TGTCGGCCCA TGCCATCGGC CTGATCTTCT TTGGCTGGGG GGTGGCGCTG
GCCCTGTTCT CGGTGCTGGT CGCCCCCCGT TTGCAGGCGC GCTTTGGCGC CTTCGCCCTG
CTTGCCGTCA GTCTGGTCGG CTTCGCCCTG CTGCTCTGCG TCATGGCCTT CGGCACGATC
CCGATGATCG TGACCGCCGT CGTCGCCTCG GGGGCGCTGA TGGGCGTCAA TAACACCGTC
TATACGGAAA TGGCGCTGGA GGTTTCCGAG CAGCCGCGCC CGGTGGCTTC GGCGGCCTAT
AATTTCCTGC GCTGGTTCGC CGGCGTCATC GCCCCTTATG CCGCCTCGCG CCTGGGCGAG
AGCTCCGGTC CGGCCAGCGC CTTCCTGACG GCGGCGGGCG CCGCGCTGAT CGGTGGGGCG
ATCCTGGTGG CGCTGCGGCG CAATTTGGGC CGCTACGGCC AGACGCGCCA GGACGCAATC
CCGCCGACCG TCCCGTCGAT CGGGCCGATT CTGGTCGGGC TTGATGGTTC GGCCGCCGAT
CGGGCGGTGC TGGCGCGGGC GGTTACCCTG GCGCGCCAGG GCGGCGGGGC GGTGTTCGTG
CTGCACATTC GCCCGATCGA GGTGTTTGGC GAATTCGCCG CCGCCCTGGA AGACCGCGCC
GCCGGCCGGG CGGTTGTCGA GAACGCCGTG GCAAGCCTCG GCGCCCAGGG GATCACCGCC
ATGGGCGAGG TGCTCGAGGA ATCGTCCACC CTGGTCCCCC AGCGGGTCAT CGCCCGCGCC
CGGGCCCTGG CCGCCCGGGT GATCGTGCTG GGCACCCGCC ATCCCGGCGA TCTTGGCAAT
CTGTTGCACG GCTCGGTGGC CGATATCGTC GGCCGCGAGG CCGGACGGCT GGTCGAACTG
GTTCCGAGTT CGGCGGGGGA AGGCGAGGCC GGGGAGACTG CCTTAGAACC CCGGGCGCTC
GCCGCCACGC CAGGACACCG GGTTTAG
 
Protein sequence
MSSSSAGAAA GAPSLFRQPK AVWATAFAAV IGFTSIGLVD PILTSIAEGL SATPSQVSLL 
FTSYFFVTAV MMLVTGFVSS RIGGRNTLLL GALLIAVFAA LAGTSDSVAE LVAYRAGWGL
GNAFFVVTAL SVIVGAASGG TAGAILLYEA ALGLGISAGP LIGAALGAHS WRYPFFGTAA
LMTIGFLAIA VFLDPQPKPA RKIGLSGPVK ALRHPGLLTT SVSAFFYYYA FFTVLAFAPF
VLRLSAHAIG LIFFGWGVAL ALFSVLVAPR LQARFGAFAL LAVSLVGFAL LLCVMAFGTI
PMIVTAVVAS GALMGVNNTV YTEMALEVSE QPRPVASAAY NFLRWFAGVI APYAASRLGE
SSGPASAFLT AAGAALIGGA ILVALRRNLG RYGQTRQDAI PPTVPSIGPI LVGLDGSAAD
RAVLARAVTL ARQGGGAVFV LHIRPIEVFG EFAAALEDRA AGRAVVENAV ASLGAQGITA
MGEVLEESST LVPQRVIARA RALAARVIVL GTRHPGDLGN LLHGSVADIV GREAGRLVEL
VPSSAGEGEA GETALEPRAL AATPGHRV