Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_2896 |
Symbol | |
ID | 5209865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 3614809 |
End bp | 3616167 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640596492 |
Product | extracellular solute-binding protein |
Protein accession | YP_001277214 |
Protein GI | 148657009 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.034507 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCA AACTCTCACG ACGCAGGTTC CTCAAGGTTG CTGCCGCGGG CGCAGGCGGC ATTGGTGCAA CGGCGCTGCT GGCGGCTTGT GGCGGCGCAG CGCCTCAGGG CGGTCAACCA ACGGGCGGGC AGGCGCAACC TGCAGCGCCG GTTCAGGGTG ACACAGTTGT CACGGAAATC ACCTTCTGGT GGTGGGATCA GGTCGGTGAG GTCTGGAAAG AACCGTTCGA GAAGGCGCAC CCCAACATCA AACTCAACTT CGTCAACACC CCCTTCGCCG ATGCGCACGA CAAACTTCTG ACCTCGTTCG CCGCCGGAAG TGGCGCTCCC GATGTCGCCT CGATCGAGAT TGGTCGCGTC GGCAATTTTA CCGCCAAGGG CGGGCTTGCC GATCTGCTGG CGCCGCCGTT CGATGCGGGC AGTCTCAAGA ATGATATGGT CGCCTATAAA TGGACCCAGG GATCGACTGC CGATGGTCGC CTGGTCTGCC TGCCGTGGGA TATCGGTCCT GCTGGGGTCT GGTATCGCAC GGATATTTTC GAGGCGCTCG GTTTGCCAAC CGAACCGGAA GCGGTAGAGG AGTTGATCGG CGGTCCGAAC CGCACGTGGG ACGATTTCTT CGCCTTTGCC AAACAACTCA AGGAAAAGAG CGGCGGGAAG ACGTCCCTCT TTGCCGATGC CGGCACTGAT ATTTATGGCG CCGTCTATCG CCAGCAGGGT GAGGGGTATG CCGATGGCAA CAAAGTGCTG ATCGAAGAGA AGGCGACCCG TCCGTTCCAG CTCGCGGCGC GCGCCCGCAA GGAGGGGATC GATGCCAACA TTCCCTGGTG GGGCGCCGAG TGGCAGACCG GCTTGAAGGA CAATGCCTTT GCCGGAATGG TGATTGCCTG CTGGATGCAG GGCGGTCTGA CACGCGAGCA GCCCGATCTG GTCGGGAAAT GGCGTGTCAT ACGCGCTCCA GAAGCCAATT ACAACTGGGG CGGTTCGTTC ATGGCGATCC CGGAGCAGAG CAAGAACAAG GAGGCGGCCT GGACGTTCGT CAAGTGGGCA TGCGCAACGG CGGAAGGGCA GAACATCATG TTCAAGGCGT CCGGCGTGTT TCCCGCATAC AAGCCAGCCT GGCAGGATCC ACTCTACGAC GAACCGGTGC CGTTCTTCGG CGGTCAGCGC GCCTATCGCT TGTGGACCGA AATCGGTGAC AATATCAAAG CTATCTTCCG TACACCGAAC GATCTCCAGC TCGATGACAT CGTTGGCGCA GAACTGACGA AGGTCTTGCA GGATGGCAAG GACCCCGTTC AGGCTGCGAA GGACGCCGAA GCAGAAGCGC TCAGGCGCAT CCCCGATCTG CAAGGATAG
|
Protein sequence | MTTKLSRRRF LKVAAAGAGG IGATALLAAC GGAAPQGGQP TGGQAQPAAP VQGDTVVTEI TFWWWDQVGE VWKEPFEKAH PNIKLNFVNT PFADAHDKLL TSFAAGSGAP DVASIEIGRV GNFTAKGGLA DLLAPPFDAG SLKNDMVAYK WTQGSTADGR LVCLPWDIGP AGVWYRTDIF EALGLPTEPE AVEELIGGPN RTWDDFFAFA KQLKEKSGGK TSLFADAGTD IYGAVYRQQG EGYADGNKVL IEEKATRPFQ LAARARKEGI DANIPWWGAE WQTGLKDNAF AGMVIACWMQ GGLTREQPDL VGKWRVIRAP EANYNWGGSF MAIPEQSKNK EAAWTFVKWA CATAEGQNIM FKASGVFPAY KPAWQDPLYD EPVPFFGGQR AYRLWTEIGD NIKAIFRTPN DLQLDDIVGA ELTKVLQDGK DPVQAAKDAE AEALRRIPDL QG
|
| |