Gene RoseRS_0937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0937 
Symbol 
ID5207883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1155799 
End bp1157559 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content62% 
IMG OID640594551 
Productextracellular solute-binding protein 
Protein accessionYP_001275296 
Protein GI148655091 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATAAGT CATCGTCGTC AATCCATACC CCTCGTTCTC TCAGACAGTT GTTCCAGTTG 
CTCAGCGCCA GCGTCTGGGT CGCTGCGACG ATACTTGTGC TGCTTGCAGC TTGCGGCGCG
CCAGACCAGC CGGTTGCGAC CGGTGAAGCC TCGCCTACTT CTGCGCCCAC ATCAGCGCCC
CCGACTCCTA CCGCGCTGCC GCGTGGCGGA AATCTCACCA TCCGTCTGTC CGACGATAGT
GGCACATTGC AACCCTGGCA TCCGCGCACA CGCGGCGAAG AGCAGATCAT CGGGTTGATC
TACAGTGGTC TGATGCGCCT GGACGCAACC CTGGCGCCGC AACCGGACCT GGCGACGGGT
TGGGTAGCCT CAGCCGATGG ACGGGTCATC ACGTTCACCC TGCGCAGCGA TGCCGTCTGG
CACGATGGGC GTCCGGTCAC GGTTGATGAT GTTGTCTTTA CGCTCGATGC ATTGCGCGCA
CTGCCGCCGT CTACAGCGTT GCTCGCCGGG TTGCGGCGAA TTGTCGAAGT GACGGCGCCG
GAGAGCGATA CGATTGTCAT CCGCCTCGAT GAACGCTACG CACCCATCTT CAGCCTGCTT
ACAACGCCGG TGCTGCCGCG TCACCTGCTG ATCGGCAGAA ACCTGGCAGA ATTCAACGCC
TGGGATGTGC CGTTTGGCAG CGGTCCCTTC CGTTTTGAGC AGCGCGAGCC GGGCGTGGCA
ATCACACTGG CGGCGAATCA GGCATTCTAC CGGGGTGCGC CGCTCCTTGA TCGGGTGGTG
TTCGTCATTG CGCCCGATGC ACAGGTGGCA GCGTCGGCGT TACAGGATGA ACGTTTGCTG
CTGGCTGAAC TTCCCTGGAG CGTCGGCAGC GTCATAACCG AAACATCGCC GATGCTGCAA
TGGGGCGCAT ATGCTGAGAA CGGGTATTAT TTTCTCGCCT TCAACCTGCG TTCCGACCGA
ATCTTCAGCG ATCCGAGGCT GCGCGAGGCG CTGGCAGCGA CGATCGATCT GCCGCGCATT
GTGCAGGAGG TGACCGATGG GCAGGGGATG CTCATCGGCA GCAGCGCCGC GCCGGGTTCC
TGGGCAGATC TGACGCCGCC GCCAGCCGGA ACGGTCGATC TGGATCGCGC GCGGGCGCTG
CTGGACGAAG CCGGATGGCG TCTGCCGCCG GAAGGCGCCA TTCGCCAGCG CGACGGCGTG
CCGCTGACGG TGCAACTCTT CGTGCGCGCC GACGATCCGC GTCGGGTGCG CGCCGCCGAG
TTGATTGCCA GCGCTGCCGA ACAGATCGGG ATGGATATTG TGGTGCAGCC TGCCGATTTT
GCCACCGTCA TTCGCTCGAA ATATGCGCCG CCCTACGACT TCGATCTGCT GATCGGCAGT
TGGGTCAATG GCGTCGCCGA TCCGGATTTC GCCGACTACG CTTTCTACGA CCCGGACGAT
TTTGCGCTCT TTCATTCGAG TCAGATCAAC CAGGGGCTGG CGGATACACG TCCGACGCTC
AACTTTGTCG GGTTCAGCGA TCCGATCTAC GATAATCAGG CGGGTGCGGC ACGGCAGTTA
TACGATCTGA GCGAACGGGC GCAGGCGATC CAGCGTGCCC AGGAACGGGT GGCGCTTCTA
CGCCCATATC TGTTTCTCTG GGCGGATCGA ATAGCGGTAG TGTGCAATCC GCGGGTAAAA
ACGCCGGACG GACCGGTCAC CCTGATGACG CCCAACTATA TGTGGAATAT CGAGCGCTGG
TATGTCGAGG TGGAAGGTTG A
 
Protein sequence
MNKSSSSIHT PRSLRQLFQL LSASVWVAAT ILVLLAACGA PDQPVATGEA SPTSAPTSAP 
PTPTALPRGG NLTIRLSDDS GTLQPWHPRT RGEEQIIGLI YSGLMRLDAT LAPQPDLATG
WVASADGRVI TFTLRSDAVW HDGRPVTVDD VVFTLDALRA LPPSTALLAG LRRIVEVTAP
ESDTIVIRLD ERYAPIFSLL TTPVLPRHLL IGRNLAEFNA WDVPFGSGPF RFEQREPGVA
ITLAANQAFY RGAPLLDRVV FVIAPDAQVA ASALQDERLL LAELPWSVGS VITETSPMLQ
WGAYAENGYY FLAFNLRSDR IFSDPRLREA LAATIDLPRI VQEVTDGQGM LIGSSAAPGS
WADLTPPPAG TVDLDRARAL LDEAGWRLPP EGAIRQRDGV PLTVQLFVRA DDPRRVRAAE
LIASAAEQIG MDIVVQPADF ATVIRSKYAP PYDFDLLIGS WVNGVADPDF ADYAFYDPDD
FALFHSSQIN QGLADTRPTL NFVGFSDPIY DNQAGAARQL YDLSERAQAI QRAQERVALL
RPYLFLWADR IAVVCNPRVK TPDGPVTLMT PNYMWNIERW YVEVEG