Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1879 |
Symbol | |
ID | 5539357 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 2412968 |
End bp | 2414326 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640894016 |
Product | extracellular solute-binding protein |
Protein accession | YP_001431987 |
Protein GI | 156741858 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.29051 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCA AACTCTCACG ACGCAGGTTC CTCAAGGTTG CTGCCGCAGG CGCGGGCAGC ATTGGCGCAG CAGCGCTGCT GGCGGCGTGC GGCGGCGCGG CGCCGCAGGG CGGGCAACCG ACAGGCGGGC AGGCGCAACC TGCCGCGCCG GTTCAGGGTG ACGCCGTTGT CACCGAGATC ACCTTCTGGT GGTGGGATCA GGTCGGTGAG GTGTGGAAAG AACCGTTTGA GAAGGCGCAC CCCAACATCA AACTCAACTT CGTCAACACC CCCTTCGCCG ACGCGCACGA CAAACTCCTG ACCTCCTTCG CCGCCGGAAG CGGCGCTCCC GATGTCGCTT CAATTGAGAT CGGGCGTGTC GGCAATTTCA CCGCCAAAGG CGGCCTCGCC GATCTGCTGG CGCCGCCGTT TGATGCCGGC AGCCTGAAGA ACGATATGGT TGCCTACAAG TGGACACAAG GCTCCACTGC CGATGGCCGT CTCGTCTGCC TGCCGTGGGA CATCGGACCG GCTGGCGTTT GGTACCGCAC CGACATTTTC GAGGCGCTTG GGTTGCCAAC CGATCCAGAG GCGGTCGAGG AGTTGATCGG CGGTCCCAAC CGCACGTGGG ACGACTTCTT CGCGTTCGCA AAGCAACTGA AAGAGAAGAG CGGCGGCAAG ACCTCGCTCT TCGCCGATGC CGGCACCGAC ATCTATGGCG CCGTTTACCG CCAGCAGGGT GAGGGATATG CCGATGGCAA CAAAGTGCTG ATCGAAGAAA AGGCGACCCG TCCATTCCAG CTGGCTGCGC GCGCGCGCAA AGACGGGATC GATGCCAATA TTCCCTGGTG GGGCGCCGAG TGGCAGACCG GCTTGAAAGA CAACGCCTTT GCCGGGATGG TCATCGCATG CTGGATGCAG GGCGGGTTGA CGCGTGAGCA GCCCGATCTG GTTGGGAAGT GGCGCGTGAT ACGCGCTCCA GAAGCCAACT ATAACTGGGG CGGCTCGTTC ATGGCGATCC CGGAGCAGAG CAAGAATAAA GAAGCCGCCT GGACCTTCGT CAAATGGGCA TGCGCAACGG CAGAGGGGCA GAACATCATG TTCAAGGCGT CGGGTGTGTT TCCGGCATAT AAGCCCGCCT GGCAGGACCC GCTGTACGAT GAGCCGGTGC CGTTCTTCGG CGGTCAGCGC GCCTATCGTC TCTGGACGGA GATCGGCGAC AACATCAAGG CGATTTTCCG CACGCCGCAC GATCTCCAGC TCGATGACAT CGTTGGCGCG GAATTGACCA AAGTGCTGCA AGAAGGGAAA GACCCCGTCC AGGCGGCGAA AGACGCCGAG GCGGAAGCGA TTCGGCGCAT TCCCGATATG CAGGCGTGA
|
Protein sequence | MTTKLSRRRF LKVAAAGAGS IGAAALLAAC GGAAPQGGQP TGGQAQPAAP VQGDAVVTEI TFWWWDQVGE VWKEPFEKAH PNIKLNFVNT PFADAHDKLL TSFAAGSGAP DVASIEIGRV GNFTAKGGLA DLLAPPFDAG SLKNDMVAYK WTQGSTADGR LVCLPWDIGP AGVWYRTDIF EALGLPTDPE AVEELIGGPN RTWDDFFAFA KQLKEKSGGK TSLFADAGTD IYGAVYRQQG EGYADGNKVL IEEKATRPFQ LAARARKDGI DANIPWWGAE WQTGLKDNAF AGMVIACWMQ GGLTREQPDL VGKWRVIRAP EANYNWGGSF MAIPEQSKNK EAAWTFVKWA CATAEGQNIM FKASGVFPAY KPAWQDPLYD EPVPFFGGQR AYRLWTEIGD NIKAIFRTPH DLQLDDIVGA ELTKVLQEGK DPVQAAKDAE AEAIRRIPDM QA
|
| |