Gene RoseRS_1778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1778 
Symbol 
ID5208737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2195829 
End bp2197253 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content61% 
IMG OID640595386 
Productmajor facilitator transporter 
Protein accessionYP_001276118 
Protein GI148655913 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000929318 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0431251 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCGCA ACGAAGAACG TACCCGGAAC CGCATCCTGC TTGTGCTGTT TGTTGGTGTT 
CTGATGGCAG CGCTTGATAT TGCCATCGTT GGACCAGCGC TGCCGGCGTT GCGCGAGCAT
TTTGGAATCG ACGCGCGTGC GGCATCGTGG ATGTTCGTCG TCTACGTGCT CTGCAACCTG
GTTGGAACGC CGTTCATCGC CAAACTGTCG GACCGGCTGG GGCGGCGCAC GCTCTACACT
GCCTGTGTTG CCCTGTTTGG TCTGGGGTCA TTGATCGTCG TGGCGGCGCC GACGTATGCG
CTGGTGCTGG CGGGTCGCGC TATCCAGGGG TTGGGCGCGC GTGGCATTTT CCCGGTTGCC
AGTGCGGTGA TCGGTGATAC ATTTCCGCCG GAGAAGCGCG GCAGCGCGCT GGGACTGATC
GGCGCGGTGT TTGGTATTGC ATTCCTGATC GGACCGATCA TCGGCGGTGT GCTGCTCCTG
CTTGGATGGC AGTGGCTCTT TCTGATCAAT CTGCCGATTG CACTTGCGCT GATCGGGTTT
GGCGTGAAAT TATTGCCCGC CATCCGCGCG GCAACGCCGC GTCCCTTCGA CTGGGGCGGG
ACGGTCGTGC TTGGGGTGAT CCTGGCATCG CTGGCTGTGG CGCTGAGCGA TCTTGCCTAC
CTGCTCGACG AAGCGAGTGT GAGCGGTCTG GTGAATGCAA TCAGTGCATC ATCGACCTGG
TTCCTGCTGG TGCTGGCACT GGCGCTCATC CCGCTATTCT TGCAGATCGA GCGTCGTGCG
GATGACCCGG TGCTTGACCT GAATCTGTTC CGCAACTGGC AGATTGCGCT GGCTGGCGCA
CTCTCCTTTG GCGCAGGCTT GAGCGAGGCG GTAACGTTGT TCGTGCCATC GTTGCTTGTC
GCAGCGTTTG GCGTCACGCC ATCAACTGCG AGTTTTATGC TTGTGCCGAT GGTGCTGGCG
ATGGCGGTCG GTTCACCGCT GTCGGGGCGC ATGCTTGACC GGATTGGCTC GAAAATTGTG
GTGCTGACCG GTACGGCGTT GATAGCAACA GGTTTGCTGC TGGAAGGGAT GCTGGCAACC
TCTCTCGTCG CGTTCTACGG CTTTGCTGCG CTGTTTGGCA TTGGTATCGG CGTATTGCTC
GGCGCATCGC TCCGGTACAT CCTGTTGAAC GAAGCGCCAG CTGCGGAACG TGGCGCGACG
CAAGGGGTGC TGACGGTATT TATCAGCATT GGTCAGTTGA TCGGCGCGGT GGTGCTCGGC
GCGGTTGCAG CAGCGCGCGG TAGCGATGTC GGCGGATACG CAGCGGCGTT TCTGGTTGTC
GGCGTCGTGA TGCTGGCGCT CTTCATCGCT TCGTTCGGTT TGAAGAGTCG CGCCGAGGAA
CTGGCGACCC GCCAGCAATG GCAGAGCGGA GCTTCGGCGG CATGA
 
Protein sequence
MTRNEERTRN RILLVLFVGV LMAALDIAIV GPALPALREH FGIDARAASW MFVVYVLCNL 
VGTPFIAKLS DRLGRRTLYT ACVALFGLGS LIVVAAPTYA LVLAGRAIQG LGARGIFPVA
SAVIGDTFPP EKRGSALGLI GAVFGIAFLI GPIIGGVLLL LGWQWLFLIN LPIALALIGF
GVKLLPAIRA ATPRPFDWGG TVVLGVILAS LAVALSDLAY LLDEASVSGL VNAISASSTW
FLLVLALALI PLFLQIERRA DDPVLDLNLF RNWQIALAGA LSFGAGLSEA VTLFVPSLLV
AAFGVTPSTA SFMLVPMVLA MAVGSPLSGR MLDRIGSKIV VLTGTALIAT GLLLEGMLAT
SLVAFYGFAA LFGIGIGVLL GASLRYILLN EAPAAERGAT QGVLTVFISI GQLIGAVVLG
AVAAARGSDV GGYAAAFLVV GVVMLALFIA SFGLKSRAEE LATRQQWQSG ASAA