Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1303 |
Symbol | |
ID | 5208255 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 1605836 |
End bp | 1607482 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640594918 |
Product | hypothetical protein |
Protein accession | YP_001275657 |
Protein GI | 148655452 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.480323 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGTC ATAGCCAACA GCAGATAGGG CTTATGGAGG CGCTCGATGT CGTTCTTGAT CGCCTGTTGC GCGGCGCCGA CATCGATGAG TGCCTGAGTC TGTACCCGCA TCTGGCGGTT GAGCTCGAAC CGCTGTTGCG TGTTGCGGGG ATGGTACGCG CAGAGGTGAC GCAACCGCTG CCGCCTGAAA TGGAACGCTG GCTGGCGACC GGGGCGCAGG AGTTTGCCGC AATTGCCGAT CAGATGCTTG CGCGTCGGCA TGCCCGGCGC AATCTGCTCA AACCGCTGCG CAAGGCTGCC GTTCAACGTG TCCTGGTCGG CGCCCTGGCA GTGACGGTTC TGCTTGCATC GGTTGACACG GCGTCGGCGC AAAGTCTGCC GGGCGACCCG CTCTATGTCT GGAAAGTGGC ACGGGAAGAT CTGACGCTTT CGATGACGTC CGATCCAGTC CAGCGGAGTA AACTGCACGT CACCTATGCC CGCCGCCGCC TTCTGGAAAT TAATGAGATG CTCGCCAGTG ATGCGGCAAT CGATCCACAG GCGCTCAGGG AGCCGCTTGC CCTTCTCAGC AGTCACATCC GCGGCGCTGT CATCGAAAGC CGCGACATGG ATGTTGTCGA TGTGTCGGTC GATATCACTG CACTCCTCGG TGAGGTGAGG ACTGCTCTGT CGCGGCTTGC GTCGAAAGTT CCAGATGCCT CTCCGCTGCT CGAAAACGTC CAGGAGCAGA TCGACTCGGT GATTGAACCG ACAGCGTCGC CGGTTCCGAT TGCGACCGCG TCGCCAGCAC CCTCATTGCC GCCGACAACG CCGGTCGAAG CGCCGACAAC TGTGGAAATC ACGGCTCCGG AAGCTGCACC TCAGACCGGG CGCGAGCCTG AACGGGTCGA TGCCGCCACC CCTATCGCAA CACCGCCGCC GACCAGTCGC CCGCCTCGCC CGTCGCCGAC GCCCGGTCAG ATTGAGCCAA CCGCAAGCAA CGCCCCGGCG CCGACTGCAA CGCCTGCGCC CCGATCACCA ACGGCGACGC CGCCCCCGCC ACCAACAATG ACGCCGACGG CGCCGCCAAC TGAGGCGTCG CCGACAAATA CGCCGCTGCC AACCAACACG CCATCACCAA CGGCAACGCC ACCGCCGACG GCGACTCGTG TTCCACCGAC CGAACCGCCG TCAGCCAGTT CGACGCCACA ACCGCCGCCA ACAGCGCGTC CGCCGCGTCC AACGGCGACA CCACGCCCAA CGATAACCCC GACGTCGGCG CCGACCGCAA CACCGACTGA TCCCCCGGCG CCGACTGCAA CACCGACTGA TCCCCCGGCG CCGACCGCAA CGCCGACGCC AGCGCCGACC GCAACGCCGA CGCCAGCGCC GACCGCAACG CCGACGCCAG CGCCGACCGC AACGCCGACG CCAGCGCCGA CCGCAACACC GACTGATCCC CCGGCGCCGA CCGCAACGCC GACTGAGACG CTGCCACCCA CCCCGTCGAT CACACCAACT GATGAGGCGG GACAACCAAC GGTGACGCCA GCGGGTTCAG AGCCGTCAGG TATGCCGACT CTAACGCCAA CACCAGCGGG TTCAGAGCCG TCAGGTATGC CGACCCTAAC GCCAGATGAC GCAGGCGCAC ATGGCGCCAA TCTGTAA
|
Protein sequence | MIRHSQQQIG LMEALDVVLD RLLRGADIDE CLSLYPHLAV ELEPLLRVAG MVRAEVTQPL PPEMERWLAT GAQEFAAIAD QMLARRHARR NLLKPLRKAA VQRVLVGALA VTVLLASVDT ASAQSLPGDP LYVWKVARED LTLSMTSDPV QRSKLHVTYA RRRLLEINEM LASDAAIDPQ ALREPLALLS SHIRGAVIES RDMDVVDVSV DITALLGEVR TALSRLASKV PDASPLLENV QEQIDSVIEP TASPVPIATA SPAPSLPPTT PVEAPTTVEI TAPEAAPQTG REPERVDAAT PIATPPPTSR PPRPSPTPGQ IEPTASNAPA PTATPAPRSP TATPPPPPTM TPTAPPTEAS PTNTPLPTNT PSPTATPPPT ATRVPPTEPP SASSTPQPPP TARPPRPTAT PRPTITPTSA PTATPTDPPA PTATPTDPPA PTATPTPAPT ATPTPAPTAT PTPAPTATPT PAPTATPTDP PAPTATPTET LPPTPSITPT DEAGQPTVTP AGSEPSGMPT LTPTPAGSEP SGMPTLTPDD AGAHGANL
|
| |