Gene Rsph17029_3762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3762 
Symbol 
ID4898660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp882881 
End bp884158 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content69% 
IMG OID640114367 
Productmajor facilitator transporter 
Protein accessionYP_001045615 
Protein GI126464502 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCGCCC CGAGCCCTGC CATCCAACGG TGGCGCCACG CCTCCCGCGC GCTGAACGAA 
CCGGCCTACC GGCGCTATTT TCTGGCGCAG GTGCCGCTGG TCATCGGCAC CTGGATCCAT
TCCATCGCGC TCGGCTGGCT GATGTGGCGG CTCAGCGCCT CGCCGATGAT GCTGGGCGTG
CTGGCCCTCT GCGATCTGGG CCCGACCCTG CTTCTGGGGC CGATCACCGG GACGCTGGTG
GACCGGGTCG ACCGGCGCAG GCTGCTGCTC GGCCTCATCT GCATCAATTT CGTGCTGATC
TGCACGCTGG CCGCGCTGGC GATCACCGAC AGCATCACCG TTCTGGCCAT GCTGATCCTG
ACCCCGAGCA TCGGCATCAT CGCGGCCTTC GAGAGCCCGG CGCGTCAGGC TCTGGTGGCC
GAACTGGTGG CACCGGCCGA TCTGCGCAAC GCGCTGGCGC TGAATTCGCT GCTCTTCAAC
ACCGCGCGGC TGATCGGGCC GGCCATCGGC GGTCTGGTCG CCGCCTGGGC GGGCGAGGGC
TGGGCCTTCG TCCTCAAGGC GCTGGCGCTG CTGCCAGCCG CCTATGTCCT CGCGACCATG
CGGTTGCGCG CGCCCGAGCC GCGCAGCCGG GGCCGGTTCT TCGAAGACAT GCGCGCGGGC
CTCGGCTTTG CCCGCAGCCA TGTGGAGGTC GCGCGCATCC TGATCCTGGT CGGCATCTGT
TCGCTGACCT CCGTGCCCTA TTTCTCGTTC CTGCCGGTCC TCGCCGATGA CATGCTGGGC
GCCGACGCGA GCCTTGCGGG CCTGCTCATG AGCGTCACCG GCATCGGCTC GATGGCGGCG
GGGCTGATGC TGACCTTCGG CGACCGTCTG AACGCCATGG CGCTCTGGCC GGTTGCCTCG
GCCTTCCTGC TGGGGGTGCT GCTGATCGGA ATGGGGCTTT CCAGCAGCGT CACGCTGACC
ACCGCCCTCG CCCTGCCGAT GGGCTTCGCC ATCCTGTCGC AGAACCTCGC CTCGAACACT
TTGCTGCAGC ATTTCGCGCC GCCGGGATAC CGCGGCCGGG TCATGGCGCT TTACGCGATG
ATGATGCTGG GCACCGTGCC CGTCGGATCG CTGATCGCCG GGGCGCTGGC CGCCCGCATC
GGTATGCCGT CGGTCTTCAT CCTGGGCGGC GCGCTCTGCA CAGCCACAGC CCTTGCTGCC
GCCTGGCATC GCCGGCGGTA TCCCGGCCCC GACCTCATGG CCGCCGATGC TGCCCCCTCT
TCCGCGTCGC GGGCCTGA
 
Protein sequence
MSAPSPAIQR WRHASRALNE PAYRRYFLAQ VPLVIGTWIH SIALGWLMWR LSASPMMLGV 
LALCDLGPTL LLGPITGTLV DRVDRRRLLL GLICINFVLI CTLAALAITD SITVLAMLIL
TPSIGIIAAF ESPARQALVA ELVAPADLRN ALALNSLLFN TARLIGPAIG GLVAAWAGEG
WAFVLKALAL LPAAYVLATM RLRAPEPRSR GRFFEDMRAG LGFARSHVEV ARILILVGIC
SLTSVPYFSF LPVLADDMLG ADASLAGLLM SVTGIGSMAA GLMLTFGDRL NAMALWPVAS
AFLLGVLLIG MGLSSSVTLT TALALPMGFA ILSQNLASNT LLQHFAPPGY RGRVMALYAM
MMLGTVPVGS LIAGALAARI GMPSVFILGG ALCTATALAA AWHRRRYPGP DLMAADAAPS
SASRA