Gene Rsph17025_1849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1849 
Symbol 
ID5084910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1890321 
End bp1891577 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content67% 
IMG OID640483408 
Productmajor facilitator transporter 
Protein accessionYP_001168045 
Protein GI146277886 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.513333 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAAGG TCCTTGCCGC CACCTGGCCG CTTCTGCTGG GCGTCATGCT GCTGATGGTC 
GGCAACGGGG TCCAGGCCTC GCTGCTGGGC ATCCGCGGCG CGATCGAGGG ATTCTCGACC
ACACAGCTGG CCGTCGTGAC CTCGGCCTAC TTCGCGGGAT TCCTCGTGGG CTCGCAGGTC
GTGCCCGACC TGATCCGCCG GGTGGGCCAT GTGCGGGTCT TTGCGGCGCT CGGGTCGATG
ATCTCGGCGG TGCTGGTGGT CTATCCGGTG ATGCCCGATT GGGCGGTCTG GACGCTGCTG
CGGGTGCTGA TCGGTTTCAG CTTCTCGGGC GTCTACATCA CCGCCGAGAG CTGGCTGAAC
AACACCGCGA CGAACGAGAC GCGGGGACAG GCGATGTCGG CCTACATGAT GGTGCAGATG
GTGGGCATCA TCACGAGTCA GGCGCTGCTG AACGCGGCCG ATCCGTCGGG CTTCACCCTC
TTCGTGATCC CCTCGGTGCT CGTGTCGCTG GCCTTCATGC CGATCCTTCT GACCGTGACG
CCCACGCCGA CCTTCGAGAG CACCCGGAGG CTCTCGGTGC GCGAGCTGTT TCGCGTGTCG
CCGCTGGGCA TCGTGGGGAT GCTGATGACC GGCGGGATCT TCTCGGCCAT GTTCGGCATG
GCCTCGGTCT GGGGCACGCT CGAGGGACTC TCGGTGCAGG AGATCTCGAT CTTCATCGGC
TCGATCTATG TCGGAGGCCT CGTGCTGCAA TATCCGATCG GCTGGGCCTC GGACCGGATG
GACCGGCGTC AGCTGATCCT CGGGCTTGCC GTGGTGGCGG GGCTGCTGAT GGGGGTGACC
GTCCTGTTCC AGCCGCCCTT CTGGGGGCTG ATCGCGGTCG CGCTGCTGCT CGGCGGGATC
ACCAACCCCG TCTATTCGCT GCTGATCGCC TATACCAACG ATTTCCTCGG CAAGGAGGAC
ATGGCGGCCG CCTCGGCGGG GCTCTTGTTC ATGAACGGGC TGGGGGCGGT CTGCGGGCCG
CTCGTGACGG GCTGGATCAT GGAACAGGCG GGGCCGCGCG GCTTCTTTCT CTTCATCGGC
CTGCTTTACG GGGCGATGGC GATCTATGCG GGCTGGCGGA TGACGCGGCG TGCGGCGCCC
GCGGTGGCCG ACACGGGCTC CTTTGCATCC GTCGCGCCGA CCGCCTCGTC GGTGGCCGTC
GGCGCCGTCA TGGAAGTGGT CACCGAGGCG CAGGAGGCGC AGCAGGCGGC CGAGTGA
 
Protein sequence
MFKVLAATWP LLLGVMLLMV GNGVQASLLG IRGAIEGFST TQLAVVTSAY FAGFLVGSQV 
VPDLIRRVGH VRVFAALGSM ISAVLVVYPV MPDWAVWTLL RVLIGFSFSG VYITAESWLN
NTATNETRGQ AMSAYMMVQM VGIITSQALL NAADPSGFTL FVIPSVLVSL AFMPILLTVT
PTPTFESTRR LSVRELFRVS PLGIVGMLMT GGIFSAMFGM ASVWGTLEGL SVQEISIFIG
SIYVGGLVLQ YPIGWASDRM DRRQLILGLA VVAGLLMGVT VLFQPPFWGL IAVALLLGGI
TNPVYSLLIA YTNDFLGKED MAAASAGLLF MNGLGAVCGP LVTGWIMEQA GPRGFFLFIG
LLYGAMAIYA GWRMTRRAAP AVADTGSFAS VAPTASSVAV GAVMEVVTEA QEAQQAAE