Gene Rsph17029_1616 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1616 
Symbol 
ID4897169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1701192 
End bp1702448 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content67% 
IMG OID640112207 
Productmajor facilitator transporter 
Protein accessionYP_001043498 
Protein GI126462384 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0546554 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAAGG TCCTCGCCGC CACCTGGCCG CTGCTGCTGG GCGTCATGCT GCTGATGGTG 
GGCAATGGCG TGCAGGCCTC GCTTCTGGGC ATCCGCGGCG CTCTGGAAGG CTTTTCCACC
ACGCAGCTCG CCATCGTGAC CTCGGCCTAT TTCGCAGGCT TCCTCGTGGG CTCTCAGGTG
GCCCCCGACA TGATCCGCCG CGTGGGCCAT GTGCGCGTCT TTGCGGCGCT GGGGTCGATG
ATATCGGCGG TGCTCGTGGT CTATCCGGTG CTGCCCGACT GGACGGCCTG GACGCTGCTG
CGGGTGCTGA TCGGCTTCAG CTTCTCGGGC GTCTATATCA CGGCCGAAAG CTGGCTGAAC
AACACCGCCA CCAACGAGAC CCGCGGGCAG GCGATGTCGG CCTACATGAT GGTGCAGATG
GTGGGCATCA TCACCAGTCA GGCGCTGCTG AATGCGGCCG ATCCGTCCGG CTTCACGCTC
TTCGTGATCC CTTCGGTGCT GGTGTCGCTG GCCTTCATGC CGATCCTGCT CACCGTCACG
CCCACGCCGA CCTTCGAGAC GACGCGGCGG CTGTCGGTGC GCGACCTGTT CCGCGTGTCT
CCCCTGGGCG TGGTGGGGAT GCTGATGACG GGCGGGATCT TCTCGGCCAT GTTCGGCATG
GCCTCGGTCT GGGGCACGCT CGACGGGCTC TCGGTGCAGG AGATCTCGAT CTTCATCGGC
TCGCTCTATG TGGGCGGGCT CGTGCTGCAA TATCCGATCG GCTGGGCCTC GGACCGAATG
GACCGGCGCC AGCTGATCCT CGGGCTTGCG GTGGTGGCGG GGCTGCTCAT GGCCCTGACC
GTGGCGCTGG CGCCGCCCTT CTGGGGGCTG ATCGGGGTCG CGCTGCTTCT GGGCGGGATC
ACCAACCCGA TCTATTCGCT GCTCATCGCC CATACGAACG ATTTTCTGGG CAAGGAGGAT
ATGGCGGCGG CCTCGGCCGC GCTCCTGTTC ATGAACGGGC TCGGCGCGAT CTGCGGCCCG
CTGGTGACGG GCTGGATCAT GGAGCAGGCG GGGCCGAGCG GCTTCTTCCT CTTCATCGGC
ATCCTCTATG GCGCGATGGC GGCCTATGCC GGATGGCGGA TGACGCGGCG CGCGGCGCCC
GCGGTGGCCG ACACCGGCTC GTTCGCGACC GTGGCGCCCA CGGCCTCGTC GGTTGCGGTC
GGAGCGGTCA TGGAAGTGGT GACCGAGGCG CAGGAGGCGC AGCAGGCGGC CGAGTGA
 
Protein sequence
MFKVLAATWP LLLGVMLLMV GNGVQASLLG IRGALEGFST TQLAIVTSAY FAGFLVGSQV 
APDMIRRVGH VRVFAALGSM ISAVLVVYPV LPDWTAWTLL RVLIGFSFSG VYITAESWLN
NTATNETRGQ AMSAYMMVQM VGIITSQALL NAADPSGFTL FVIPSVLVSL AFMPILLTVT
PTPTFETTRR LSVRDLFRVS PLGVVGMLMT GGIFSAMFGM ASVWGTLDGL SVQEISIFIG
SLYVGGLVLQ YPIGWASDRM DRRQLILGLA VVAGLLMALT VALAPPFWGL IGVALLLGGI
TNPIYSLLIA HTNDFLGKED MAAASAALLF MNGLGAICGP LVTGWIMEQA GPSGFFLFIG
ILYGAMAAYA GWRMTRRAAP AVADTGSFAT VAPTASSVAV GAVMEVVTEA QEAQQAAE