Gene RoseRS_0803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0803 
Symbol 
ID5207746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp995605 
End bp996912 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content61% 
IMG OID640594419 
Productextracellular solute-binding protein 
Protein accessionYP_001275167 
Protein GI148654962 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCCAA CGAAGTGGTT GAAACTCATC AGCGCATTTG CGATCGTTAC ACTCGCGCTG 
GCAGCCTGTG GCGCGCCGTC CACCGCGCAA CCGACTGCTG CCCCCGCACA GCCGACGACT
GCCCCCGCGC AGACCGGCGG CGGGAAACTC GAAATCTTCA GCTGGTGGAC GAACGGCGGT
GAAGCGGATG GGCTGAATGC GATGTTCAGC ATCTATCAGC AGAAGAATCC CGGTGTCGAA
ATTGTGAATG CCACGGTCGC CGGCGGCGCC GGCACGAACG CCAAGACGGT GCTGAAGACG
CGCCTCCAGG GTGGTCAACC GCCGGATAGC TGGCAGGTGC ACGCCGGTAA AGAACTGACC
GCATATGTAG ATGCCGGTCA GATGGAGCCG CTGACCCAGT TCTTCAAAGA GCAGGGTTTC
GACAAGGTGA TGCCGCCGAA ACTGCTCGAA CAGATCACCT ACAACGGCGA AATCTGGTCG
GTTCCGGTCA ACATTCACCG CTCGAACGTG CTCTGGTACA ACATCAAAAT CTTCCAGGAG
AATGGTCTGA CCCCGCCCAA GACCATCGAC GACTTCTTCA TGGTCGCCGA GGCGCTCAAG
GCAAAGGGGA TCATCCCCCT GGCAGTCGGC GGCAAGGACA AGTTTGAGAC GCCGCACCTG
TTCGAGAGCG TGTTGCTGGC AGTTTTCGGA CCGGACGACT ACCCCAAACT GTTCCAGCCC
GGCGCAGACT GGAGCGATCT GCGGGTGCGC CAGGCTGCCG AGATCGCCAA ACGCATGCTG
GAATACTCCA ACAGTGATCG CTCCTCCCTC GGTTGGGCTG ACGCAGCGCA ACTGGTGCTC
GACGGCAAGG CGGGCATGAC GATCATGGGC GACTGGGCGC ACGGCTACTT CATCAGCAAG
GGCGCGAAAG TCGGCGTGGA TTACGGCTAT GCCGCAGCCC CCGGCAACGA CGGCGTCTTT
ATGTGGCTGT CGGACAGTTT CGGGCTGGCG AAGGGTGCGC CGAACCCGGA GCAGGCAAAG
GCATGGCTGG CGCTCTGCGG TTCGCGTGAG GGGCAGGATG CCTTCAACCC GAAGAAGGGT
TCGATCCCGG CGCGCACTGA CGCCGATGTG AGCCTGTATG ATGAATATCT GAAGTACTCG
ATCAAAGCCT TCGGCAGCGA GAAACTGGTT CCCAGCGTCG TGCATGGCGC TGCCGCTCCC
GAAGCGTATA TGACCGAGTA CGGCAATGCG CTGAATGTCT TCGCCGGCGA TCTCGATGTT
GATGCCGTCG TGCAGGCATT GCAGGACGCC GCGAAAGACC TGAAGTAG
 
Protein sequence
MLPTKWLKLI SAFAIVTLAL AACGAPSTAQ PTAAPAQPTT APAQTGGGKL EIFSWWTNGG 
EADGLNAMFS IYQQKNPGVE IVNATVAGGA GTNAKTVLKT RLQGGQPPDS WQVHAGKELT
AYVDAGQMEP LTQFFKEQGF DKVMPPKLLE QITYNGEIWS VPVNIHRSNV LWYNIKIFQE
NGLTPPKTID DFFMVAEALK AKGIIPLAVG GKDKFETPHL FESVLLAVFG PDDYPKLFQP
GADWSDLRVR QAAEIAKRML EYSNSDRSSL GWADAAQLVL DGKAGMTIMG DWAHGYFISK
GAKVGVDYGY AAAPGNDGVF MWLSDSFGLA KGAPNPEQAK AWLALCGSRE GQDAFNPKKG
SIPARTDADV SLYDEYLKYS IKAFGSEKLV PSVVHGAAAP EAYMTEYGNA LNVFAGDLDV
DAVVQALQDA AKDLK