Gene RoseRS_3333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3333 
Symbol 
ID5210310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4180855 
End bp4182075 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content63% 
IMG OID640596931 
Productmajor facilitator transporter 
Protein accessionYP_001277644 
Protein GI148657439 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.230737 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTTCGT TTGCCGCCCG CGCGCGCGCG ACGATTGCGC GGATTTCGCC ACAGGTGTGG 
CGCGTGCTGA CGCACAGTTT GCTCTTCGGG CTGGCTGGCA GTATTGCCGA TCTGCTCTTC
AACTTCTATC TGGTAAGCCT GGGGTATGGC GCCGACACCG CCGGATTGAT GGCGACCGTC
TATCGCGGCG CAGGGGCGCT GCTCGGTCTG CCGCTCGGCA TCCTGATCGA CCGGTTTGGT
GCACGAGCGT TGCTGGTGGT GGGCGCTATC GGTTTTGGTA TCGCGTATGC GCTGGTGTTG
ATGGTATCGC AACTCTGGGC GCTGATCCTG TTCGTCTTCC TGGCTGGTGC GGCAAATGTG
CTGACGCTCA CGGCGGTTGT GCCGCTGCTC ACCGGGATCA CCGACGAGGA GGAACGGGCT
GCGGTGTTTG GCATGAATGC GTCAGCCGGA CTGATCATTG GTCTGGTCGG GAGCGGTGTG
GGCGGGTTGC TTCCCGGAAC GGCAGCACTC TTCCTGGGAG TGGCGACGAA TGATACTGCC
GCCTACCGGA TGGCGCTGTC AATCGTGGTT GTGCTGGGTT GTCTCTCAGC GCTGCCGGTG
CTGATCGGAT TCCGCGCCAG GCAACCGGTG TTCTCACCGG CGCCTCTTGT GGCAGCACCC
CAACGACACA TGCCGCCAAT GCGCCTGGTG CGCTTTGCAC TTCCCTCGCT CCTGCTCGGC
ATCGGCGGTG GGTTGTTCCT GCCATTTCAG AACCTCTTCT TTCGCACTGT CTTCGGACTG
AACGACGCGG TCGTCGGTGT GATGCTGGCG ATGGGCGCGC TGGGTATGGG GCTTGGCGCG
CTGATGGGTG CGCCAGTAGC CGCCCGTCTG GGGTTGCGCC GGGCTGCCAG CTCCCTGCGC
TTCGGGGCGG TATTCGCCGT GACACTGATG TTCGCGCCAG TTTTGCCGGT GGTGGTTGTG
GGATACATGT TGCGCGGCGC TTTTGTTGCA GCCAGTTATC CGTTGAATGA TGCGCTGGTG
ATGCAGTTGA CCCCGTTACG ACAACGCGGG ATCGCAATCA GTCTCATGAG TGTCCTCTGG
TCGCTCGGCT GGTCGGCAGC GGCGTGGATC AGCGGACACA TTCAGGTGCA CTACGGCTTT
ACCCCGGTGC TCGCCGCATC GCTCGTGGCG TATGCGCTCT CAGCGTGGGC GATCTGGACG
TTGCGGGAGG AGGGGCGGTG A
 
Protein sequence
MFSFAARARA TIARISPQVW RVLTHSLLFG LAGSIADLLF NFYLVSLGYG ADTAGLMATV 
YRGAGALLGL PLGILIDRFG ARALLVVGAI GFGIAYALVL MVSQLWALIL FVFLAGAANV
LTLTAVVPLL TGITDEEERA AVFGMNASAG LIIGLVGSGV GGLLPGTAAL FLGVATNDTA
AYRMALSIVV VLGCLSALPV LIGFRARQPV FSPAPLVAAP QRHMPPMRLV RFALPSLLLG
IGGGLFLPFQ NLFFRTVFGL NDAVVGVMLA MGALGMGLGA LMGAPVAARL GLRRAASSLR
FGAVFAVTLM FAPVLPVVVV GYMLRGAFVA ASYPLNDALV MQLTPLRQRG IAISLMSVLW
SLGWSAAAWI SGHIQVHYGF TPVLAASLVA YALSAWAIWT LREEGR