Gene Rsph17029_1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1033 
Symbol 
ID4895573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1066608 
End bp1067777 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content74% 
IMG OID640111620 
Productmajor facilitator transporter 
Protein accessionYP_001042916 
Protein GI126461802 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.611435 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCAGCG CCTACCGCAA TGCCCTTCTC CTGAGCCTCG GGCCCGCCGC GGGGATCGGG 
CTCGGCCGCT TCGCCTATGC GCTGCTTCTG CCGGCGATGC AGGCCGACCT CGGCTGGAGC
TATGCCGCGG CGGGCTGGAT CAACGCGGCC AATGCCGCGG GCTATCTCGG GGGCGCCATG
CTCGCGCCTG CGCTGGCGCA GCGGGTGGGC GCGGCGCGCG CCTTCGCCGC CGGGCTGGCG
ATGCTGCTGC CGGCGCTGGC CGCGGTGGCC CTGACGCGGG ACGTGGCGGC TCTGGCCGCG
CTCCGCCTGC TGGCGGGCGG CTCGGGTGGG GTCGTCTTCG TCTGCGGCGG CCTCCTTGCC
GTGGGCCTCA GCCTGCGGGC GGGGTCGGGC GGGCTCGTGC TCGGCACCTT CTACGCAGGC
ACCGGGCTCG GGATGATCCT GTCGGCGCTG GCGGTGGCGC CCCTCCTCGG GATCGCGGGC
GCCACCCACT GGCCGCAGGG CTGGCTGATC CTCGCGGGCC TCTCCGCCCT CTGCGCGGCG
CTGGCGCTCC TGCCGCTCAG GGACGGGCTC GGCGCCTCCG TGCGGCAGGC CGGCAGCCGC
GGCCCCACGC CTCTCCGCTT CTGGAGGATC CTCGCCGGCT ATCTCCTCTT CGGCCTGGGC
TCCATCGGCT ACATGACCTT CATCTACGGC CACCTCGCCG AGAGCGCGGG CGGCTGGCCG
CAGGCGATGC TCTTCTGGTG CGCTCTGGGT CTCGCGGCGG TGGCCGCGCC CTCGATCTGG
CGGCGGCTGA TCGGCGGCGC GAGCCCCGAG CGCAGCTTCG CGCTCCTCGT GGCCACCAAT
GCGCTAGGCT CGGTGCTGCC GTTCCTGATG CCGGGCGCGC TCGGCCTCTG GCTCTCGGCC
TTCCTGTTCG GCAGCACCTT CTTCAGCACG GTGGCGGCCA CCAGCGCCTT CGCCAGCGCC
CTGCCGCAGG CCTTCGATCG GGGCCGCGCG ATCCGCGCCT TCACCATCGC CTTCGCGCTG
GGGCAGTTCG GAGGCCCGGT CGTCCTCGGC TGGACGGCCG ATCTCACCGG GCGTCTCGAT
GCGCCGCTGA TGTTCGCAAG CCTCGTCGTG CTCGCAGGCG CGCTGCTGGG TGTTCTCGAG
CGTCGGCCGG ACGCCATCGA CGGGGCATGA
 
Protein sequence
MTSAYRNALL LSLGPAAGIG LGRFAYALLL PAMQADLGWS YAAAGWINAA NAAGYLGGAM 
LAPALAQRVG AARAFAAGLA MLLPALAAVA LTRDVAALAA LRLLAGGSGG VVFVCGGLLA
VGLSLRAGSG GLVLGTFYAG TGLGMILSAL AVAPLLGIAG ATHWPQGWLI LAGLSALCAA
LALLPLRDGL GASVRQAGSR GPTPLRFWRI LAGYLLFGLG SIGYMTFIYG HLAESAGGWP
QAMLFWCALG LAAVAAPSIW RRLIGGASPE RSFALLVATN ALGSVLPFLM PGALGLWLSA
FLFGSTFFST VAATSAFASA LPQAFDRGRA IRAFTIAFAL GQFGGPVVLG WTADLTGRLD
APLMFASLVV LAGALLGVLE RRPDAIDGA