Gene RoseRS_1572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1572 
Symbol 
ID5208527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1921592 
End bp1922839 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content62% 
IMG OID640595178 
Productmajor facilitator transporter 
Protein accessionYP_001275914 
Protein GI148655709 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.58248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCAT CCACGTCTTC GATGAACCCC TCCCGTTTCG TGGCGCTGCG CTATCGCGAC 
TTCCGGCTCC TCTGGCTTGG TCAGTTTGTG TCGATCACCG GCACGCAGAT GCGCAACGTG
GCTATCGCCT GGCAGATCTA TCAACTGGCG CGTGTTGACA GCAGCATTCA GCCTGAACTC
GCACTCGGTC TGATCGGTCT GGCGCGCGTC GTCCCACTGG TTGTCACGGC GCTCGTGAGC
GGTATGGTCG CAGATCGCGT CGAACGACGC AGCATGCTGA TCCTGACTTC GCTGGCAGCG
CTTGGGTGCT CAGTTGTACT CGCGTTCGCC GCCGAAACGG AGCGCCCGCC GCTGGCGCTG
ATCTACGCGA TGGTGGCGCT GGCATCGGTG GCAGGCGCGT TCGAGTTGCC AGCGCGTCAG
GCGATTATTC CCAATCTCGT GGCGCCGCAG CATCTGCCCA ATGCCCTGAG CCTGAATATC
GTCGCCTGGC AACTTGCGAC CGTGATCGGT CCGGCGTTGT CCGGCATCCT GATCGCGGCG
GTTGGGGTTG CACCGGTGTA CTGGATCGAT GCTGCCAGTT TTCTCGCAGT GGTTGTTGCA
GCGCTACTTA TGCGCACGCG CAATCTCCCC GCGCGCGCTG AACCGGTCTC GTTGCAGGCG
GCGCTGGCAG GGTTGCGCTT CGTCTTTTCG CATCGCCTGA TTGCAGCAAC GATGCTGCTT
GATTTCTTTG CCACGTTCTT TGGCGCTACC GGTGTGTTGC TGCCGGTCTT CGCCGATCAG
GTGCTGCGGG TTGGTCCAAC CGAACTGGGC TGGATGTATG CAGCGCCATC GGTCGGCGCG
GTGATCGCCG CCACGCTGCT GAGCGGCGTG CGCATCCCGC GTCAGGGGAT GACGCTCCTG
GCGGCAGTGC TGCTCTTTGG CGTATGCGTC GTGATCATCG GCGTGTCGCG CTGGCTGCCG
TTGACACTGC TGGCGCTGGC AGGCATGGGC GCGGCGGATA CGGTCAGCAT GGTGATCCGC
GGCACGATCC GTCAGTTGCT GACCCCCGAT GAGTTGCGCG GAAGAATGGT GGCGGTCACG
ATGATCTTTT TTGCTGGCGG TCCGCAACTG GGTGAAACCA ATGCCGGGTT TATCGCCAGT
CTCATCGGCG CGCCTGCAGC AGTGGCGATC GATGGTGCGG CGTGTATCGT CATAGTGATC
GGGACGGCGC TCAAGGTTCG TGAGTTGCGC CAGTATGACG GTTCGTGA
 
Protein sequence
MNPSTSSMNP SRFVALRYRD FRLLWLGQFV SITGTQMRNV AIAWQIYQLA RVDSSIQPEL 
ALGLIGLARV VPLVVTALVS GMVADRVERR SMLILTSLAA LGCSVVLAFA AETERPPLAL
IYAMVALASV AGAFELPARQ AIIPNLVAPQ HLPNALSLNI VAWQLATVIG PALSGILIAA
VGVAPVYWID AASFLAVVVA ALLMRTRNLP ARAEPVSLQA ALAGLRFVFS HRLIAATMLL
DFFATFFGAT GVLLPVFADQ VLRVGPTELG WMYAAPSVGA VIAATLLSGV RIPRQGMTLL
AAVLLFGVCV VIIGVSRWLP LTLLALAGMG AADTVSMVIR GTIRQLLTPD ELRGRMVAVT
MIFFAGGPQL GETNAGFIAS LIGAPAAVAI DGAACIVIVI GTALKVRELR QYDGS