Gene RoseRS_2818 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2818 
Symbol 
ID5209787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3511311 
End bp3512777 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content60% 
IMG OID640596417 
Productextracellular solute-binding protein 
Protein accessionYP_001277139 
Protein GI148656934 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.179577 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACCC ATCACGAGGA AGAACTCGCG CATGAGTTGA AACAACGCGG TTTGAGTCGA 
CGGGAAATCT TGCAGATCGG CGCGCGTCTT GGTTTGAGTG GCGCGGCAAT CGCAACAGTG
CTGGCAGCAT GCGGGCAGGC GCCGCAGCAA CCCGGAACGG GCGCTGCGCC GACCATTCCG
CCAGCAACTG AACCGACGCC AGCATCCCTC TATGATCCGG AGGAAGGCAA CAGCGGATGG
CCCACCAACG CCGTCTCCGA TCCGACAGAG CGTGTGGAGA TCAGCGTCGC TCATGCATGG
GACGCAGTCT TCTTCGAGCG CCAGAAGCAG TTCGATGCCC TTTTCATGCA GCGTCATCCG
AACATCGTCG TGAAAGCGGA AAATACGCCG TTTGGCGAGT ATCGACAGAA GTATGTCGCG
CAGGCTGCTG GCAACGCCCT GCCCGATATC ATGTACTGTC AGTTCTCCTG GGCGCAGGAG
TTCATCAAGA ATGGTCTGTT CCGCCCGCTC GACGACTACA TCGCCAGAGA GAAGGACTTC
AACCTCCAGG ACTTCACGCC GCAATCTCTC GTCTCCTACC AGCGCGACGG CAAACTGTGG
GGCATCCCAT ACGATGAGGG TCCTGCCAAT CTGTACTACA ACAAGGATAT CTTCGATGCC
GCTGGCATCC CGTATCCCGA TGAAACGTGG GACCTTGAGA AATTGAAGGA AGTTGCACTG
AAACTGACCC AGGGCGAAGG ACCGAACAAG ATTTTCGGTT TGGGTGAACT TCCAACCCTC
GGTGACTCGC TGGTTGCGCC GCCATACCTG ATGCCGTTCG GCGCCCAATA CCTGCGCGAG
CCGAAGGAGG ACGAGTGTCT GATCAATAAA CCGGAAGCGG TCGCAGCGCT CGAATGGTGG
CAGGAGTTGC GCGATAAGGG CGCCGTGCCC AGTCCGGCGG ATCTGCAGAA CGTCGCCTGG
CCCGCCTTCC AGTTCGGCAA GATCGCCATG ACCTTGCAGG GTTCGTGGGC GACCCCGCCG
ATCCGGGCCG GCGCAAAGTT CAACTGGGAC ATCGCCATGT GGCCCCGTGG TCCGAAGGCG
CATGTCACCT TCTCCGCCGG CAGCGCCTAC ATGATCACCC GCGACAGCAA GAACCCCGAC
GCGGCATGGA TCTACCTGAA CGAGTATCTC TCAACCGCCG GACAATCGTA TATGTGGGGG
ATTACCGGTC GCGGCAGCCC GGCGCGCCTC TCGGCGTGGC CCTCGTACCT CAACTCGAAG
TTCGCCCCTC CTGGCGCAAA ATATGTCGAA CAGGCGATGC GCACGATTGC CAGCCACGAC
ATCATCGATC AACCGACCGG TCCGCAGGTG ACGCAGGCGG CAGGACCGAT CTGGGACCTG
GTGGTGGCCG GTCAGTTGAG CGTGAAAGAA GCCTGTGACC AGGTGTTTGC CGCCGTTGAT
CCGATCATTG CCGTCAACCG GGCGTAA
 
Protein sequence
MATHHEEELA HELKQRGLSR REILQIGARL GLSGAAIATV LAACGQAPQQ PGTGAAPTIP 
PATEPTPASL YDPEEGNSGW PTNAVSDPTE RVEISVAHAW DAVFFERQKQ FDALFMQRHP
NIVVKAENTP FGEYRQKYVA QAAGNALPDI MYCQFSWAQE FIKNGLFRPL DDYIAREKDF
NLQDFTPQSL VSYQRDGKLW GIPYDEGPAN LYYNKDIFDA AGIPYPDETW DLEKLKEVAL
KLTQGEGPNK IFGLGELPTL GDSLVAPPYL MPFGAQYLRE PKEDECLINK PEAVAALEWW
QELRDKGAVP SPADLQNVAW PAFQFGKIAM TLQGSWATPP IRAGAKFNWD IAMWPRGPKA
HVTFSAGSAY MITRDSKNPD AAWIYLNEYL STAGQSYMWG ITGRGSPARL SAWPSYLNSK
FAPPGAKYVE QAMRTIASHD IIDQPTGPQV TQAAGPIWDL VVAGQLSVKE ACDQVFAAVD
PIIAVNRA