Gene RoseRS_4478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4478 
Symbol 
ID5211463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5615401 
End bp5616687 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content60% 
IMG OID640598057 
Productmajor facilitator transporter 
Protein accessionYP_001278760 
Protein GI148658555 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000434229 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000716178 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAACCTGC GTTCGCCAAA ACTCTTTTTG TTCCTGACCG TCTTGATCGA TCTGCTCGGT 
ATCGGCATTG TGTTGCCGCT GATGCCGTAC TATCTCAAGA TCGTCGAGCA GTCGAGCATT
CCATGGCTGG CAGCCAATCG CGCGATCATC GTCGGCGCAT TGATGGCGTC CTTTGCGCTG
ATGCAGTTTC TCTTCACACC GGTGCTCGGC GCTCTGTCCG ACCGGTATGG GCGCCGACCG
ATCCTGCTCA TCAGCGTCCT GGGCAGCGGG CTGTCGTATG TGCTGTTCGG GTTTGCCGAA
TACCTGTCGT TTCTTGGGGT CGAAACAGTC CTGGCAATCC TGTTTATCGG TCGGATGCTG
AGCGGAATTA CCGGCGCAAG TATTTCGACT GCGCAGGCAT ACATTGCCGA CACGACCACC
CCCGAAGAGC GCACGAAGGG CATGGGCATG ATCGGCGCAG CATTCGGTCT GGGTTTCATG
CTCGGTCCGG CGCTCGGCGG ATTGTTGAGC ACAATCAGCC TGGAAGCGCC AGCATTCGTT
GCCGCCGGTC TTGCATTCGC AAATGTGATC TTTGGTTACT TCAAGTTGCC GGAGTCGCTG
CCGCCTGAGC GACGCATGGT CACGCCGATG CGTGGGATGA ATCCGGTGTC GCGCCTGAGC
GCGCTGTTGC GGCGATCCAG CATTCGTCCG CTGCTGATCG GCATCTTCCT GCTCAATATG
GCATTTTCCG GCTTGCAGAG CAACTTTGCC GTGTTCAGCG ATGTGCGCTT CGGTTTCGGT
CCGCTCGATA ATGCGCTGAT CTTCACGCTG GTCGGGTTGC TGGCGGTGGT GATGCAGGGT
TTTCTGATCC GCCGTTTGGT GCTTGCCTTT GGTGAGACGC GACTGGCAAT CGCTGGCATG
ACGATGATGG CAGGCGCATT CATTGCGGTC GCCCTGGCGC CGGAGGCATG GATGCTCTTC
CCGGCGGTTG GCGCCATCGC TATTGGTGAT GGAATGGCAA CACCGGCGTT GACCGGTCTG
ATCTCGCGGC GGGTGGACGC GCACGAGCAG GGAGCGACGC TGGGCGGGAC GCAGGGGCTG
ATCAGCCTGA CGCGGATCGC TGCGCCGATC CTGGCAGGTA CGACGTTCGA TCTGATCAAC
GTGAGTGCGC CATATTACCT GGGCGGCGCG CTGATCGCCG TGGCCGTCGC AGTTGTCGGT
TCGGCGTTGT TGCCAGCATT GCGGAGCGGC GTTGGTCATG ATCAGCCGCA GGGTGCGGTG
ATGATCGGAA GCGCAAAAGC GGAATGA
 
Protein sequence
MNLRSPKLFL FLTVLIDLLG IGIVLPLMPY YLKIVEQSSI PWLAANRAII VGALMASFAL 
MQFLFTPVLG ALSDRYGRRP ILLISVLGSG LSYVLFGFAE YLSFLGVETV LAILFIGRML
SGITGASIST AQAYIADTTT PEERTKGMGM IGAAFGLGFM LGPALGGLLS TISLEAPAFV
AAGLAFANVI FGYFKLPESL PPERRMVTPM RGMNPVSRLS ALLRRSSIRP LLIGIFLLNM
AFSGLQSNFA VFSDVRFGFG PLDNALIFTL VGLLAVVMQG FLIRRLVLAF GETRLAIAGM
TMMAGAFIAV ALAPEAWMLF PAVGAIAIGD GMATPALTGL ISRRVDAHEQ GATLGGTQGL
ISLTRIAAPI LAGTTFDLIN VSAPYYLGGA LIAVAVAVVG SALLPALRSG VGHDQPQGAV
MIGSAKAE