Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1645 |
Symbol | |
ID | 5208600 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 2019189 |
End bp | 2020379 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640595251 |
Product | extracellular solute-binding protein |
Protein accession | YP_001275987 |
Protein GI | 148655782 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0687] Spermidine/putrescine-binding periplasmic protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.222493 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.608221 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAAA GACGAATGAG ATTGGGTACA CTGCTGCTCG TGGTGCTGCT GGCGCTGGCA GCGTGCGGCG GACAGCCGAC CGGAAGCCCT GGCAATGAGT ATGGCAGCGG CGGTGCAACC GAGCCGACCA CGGCGCCCGG CGCAACGCAA CCGCCAGCTG GCGATGAGTT GCAGGTTGAT CGCTCCAGAC TCTCGCGTGA ACTCAGATTC TTCAACTGGA CCGATTACAT CGATCCCTCG ATCCTCGAAG ACTTCGAGAA AGAGTATGGC GTCAAAGTGA TCGTCGATCT GTTCGACGCC AACGAAGACA TGCTCGCCAA AGTGCGCGCC GGTCGCTCCG GCTACGACAT CGTCACCCCC TCGGATTACG CCGTCGAGAT CATGTGGCGC GATGGACTGA TCGCAAAACT CGACAAATCG CTGCTGCCCA ATCTGAAGCA TATCGATCCC GATCTGCTCG ATAAATACTT CGATCCGGGG AATGTCTACT CCGTACCATA CATGTACGGC ATTACCGGAA TCGCTTACAA TCGATCCTTC TTCCCGAACG GCGTCGATAG TTGGGCGGCA CTATTCGACA CAGCCCAGAT CGAGAAGTAT CGCGGGCAAT TCAGCATGCT CGACGATGAG CGCGAAACCC CTGGCGCTGC GCTGAGATTC CTCGGCTACT CACTGAACGA AACCTCGCCA GAGGCGCTGA AGAAAGCGCA GGACCTGCTG ATTGCGCAGA AGCCGTACCT GGCAGGGTAC AACAGCAGCG ACGTGAACCG GAAACTGGCG AGCGGCGAGT ATGTCATCGC GCATGCGTGG AGCGGCTCGG CGTTACAGGC GCGCAATGGG CTTGGAGACG AGTTCTCCGG CAACCCGGAT ATTGCCTTCG TCATCCCGAA GGAAGGCGGG ATGATCTGGA TGGATAACAT GGTTATTCTG GCAGACTCAC CCAACGCCTA CACTGCGCAT GTGTTTATGA ATTTTCTGAT GCGCCCCGAC ATCGCTGCAC GCAACGCTGA ATACATCGGC TATCTCTCGC CGAACGTCGA AGGGATCAAA CTGTTGCCGC AGGAGATCAT CGACCTGTAC AACGAAGGGT TCGCACCGAA CGACGAAGTG ATGAAACGCC TGGAATGGGC GATACGCAAC GAGCAGACAG CGGCGTTCAC CGATCTGTGG ACGGCGGTCA AGGGGGAGTA G
|
Protein sequence | MLKRRMRLGT LLLVVLLALA ACGGQPTGSP GNEYGSGGAT EPTTAPGATQ PPAGDELQVD RSRLSRELRF FNWTDYIDPS ILEDFEKEYG VKVIVDLFDA NEDMLAKVRA GRSGYDIVTP SDYAVEIMWR DGLIAKLDKS LLPNLKHIDP DLLDKYFDPG NVYSVPYMYG ITGIAYNRSF FPNGVDSWAA LFDTAQIEKY RGQFSMLDDE RETPGAALRF LGYSLNETSP EALKKAQDLL IAQKPYLAGY NSSDVNRKLA SGEYVIAHAW SGSALQARNG LGDEFSGNPD IAFVIPKEGG MIWMDNMVIL ADSPNAYTAH VFMNFLMRPD IAARNAEYIG YLSPNVEGIK LLPQEIIDLY NEGFAPNDEV MKRLEWAIRN EQTAAFTDLW TAVKGE
|
| |