Gene RoseRS_1645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1645 
Symbol 
ID5208600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2019189 
End bp2020379 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content57% 
IMG OID640595251 
Productextracellular solute-binding protein 
Protein accessionYP_001275987 
Protein GI148655782 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.222493 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.608221 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAAA GACGAATGAG ATTGGGTACA CTGCTGCTCG TGGTGCTGCT GGCGCTGGCA 
GCGTGCGGCG GACAGCCGAC CGGAAGCCCT GGCAATGAGT ATGGCAGCGG CGGTGCAACC
GAGCCGACCA CGGCGCCCGG CGCAACGCAA CCGCCAGCTG GCGATGAGTT GCAGGTTGAT
CGCTCCAGAC TCTCGCGTGA ACTCAGATTC TTCAACTGGA CCGATTACAT CGATCCCTCG
ATCCTCGAAG ACTTCGAGAA AGAGTATGGC GTCAAAGTGA TCGTCGATCT GTTCGACGCC
AACGAAGACA TGCTCGCCAA AGTGCGCGCC GGTCGCTCCG GCTACGACAT CGTCACCCCC
TCGGATTACG CCGTCGAGAT CATGTGGCGC GATGGACTGA TCGCAAAACT CGACAAATCG
CTGCTGCCCA ATCTGAAGCA TATCGATCCC GATCTGCTCG ATAAATACTT CGATCCGGGG
AATGTCTACT CCGTACCATA CATGTACGGC ATTACCGGAA TCGCTTACAA TCGATCCTTC
TTCCCGAACG GCGTCGATAG TTGGGCGGCA CTATTCGACA CAGCCCAGAT CGAGAAGTAT
CGCGGGCAAT TCAGCATGCT CGACGATGAG CGCGAAACCC CTGGCGCTGC GCTGAGATTC
CTCGGCTACT CACTGAACGA AACCTCGCCA GAGGCGCTGA AGAAAGCGCA GGACCTGCTG
ATTGCGCAGA AGCCGTACCT GGCAGGGTAC AACAGCAGCG ACGTGAACCG GAAACTGGCG
AGCGGCGAGT ATGTCATCGC GCATGCGTGG AGCGGCTCGG CGTTACAGGC GCGCAATGGG
CTTGGAGACG AGTTCTCCGG CAACCCGGAT ATTGCCTTCG TCATCCCGAA GGAAGGCGGG
ATGATCTGGA TGGATAACAT GGTTATTCTG GCAGACTCAC CCAACGCCTA CACTGCGCAT
GTGTTTATGA ATTTTCTGAT GCGCCCCGAC ATCGCTGCAC GCAACGCTGA ATACATCGGC
TATCTCTCGC CGAACGTCGA AGGGATCAAA CTGTTGCCGC AGGAGATCAT CGACCTGTAC
AACGAAGGGT TCGCACCGAA CGACGAAGTG ATGAAACGCC TGGAATGGGC GATACGCAAC
GAGCAGACAG CGGCGTTCAC CGATCTGTGG ACGGCGGTCA AGGGGGAGTA G
 
Protein sequence
MLKRRMRLGT LLLVVLLALA ACGGQPTGSP GNEYGSGGAT EPTTAPGATQ PPAGDELQVD 
RSRLSRELRF FNWTDYIDPS ILEDFEKEYG VKVIVDLFDA NEDMLAKVRA GRSGYDIVTP
SDYAVEIMWR DGLIAKLDKS LLPNLKHIDP DLLDKYFDPG NVYSVPYMYG ITGIAYNRSF
FPNGVDSWAA LFDTAQIEKY RGQFSMLDDE RETPGAALRF LGYSLNETSP EALKKAQDLL
IAQKPYLAGY NSSDVNRKLA SGEYVIAHAW SGSALQARNG LGDEFSGNPD IAFVIPKEGG
MIWMDNMVIL ADSPNAYTAH VFMNFLMRPD IAARNAEYIG YLSPNVEGIK LLPQEIIDLY
NEGFAPNDEV MKRLEWAIRN EQTAAFTDLW TAVKGE