Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4566 |
Symbol | |
ID | 5211552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 5723744 |
End bp | 5725105 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640598145 |
Product | extracellular solute-binding protein |
Protein accession | YP_001278847 |
Protein GI | 148658642 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACGA TCAACCGACG ACAATTCCTG CGCGGTGTGG CGGCTGGCGC CGGTGCGCTG ACGCTGGCAG CCTGCGGCGG CGGCACGACG ACCGAACCCA CAGCGGCGCC AGCGCAACCC ACAGCGGCGC CACCCACTGC AATGCCCCAG GGCGCAATGG GTTCAACGGT CGAAATCACC TACTGGGGGT CATTCAGCGG CGTACTCGGT GAGGCGGAGC AGGCGACTGT CGAAGCCTTC AACAGCATGC AGCAGGATGT CAAAGTCAAC TACCAGTTCC AGGGCAACTA CGAAGAGACG GCGCAGAAGT TGACCGCTGC GGTTCAGGCG CGCCAGACGC CGGACGTCAG CCTGCTCTCG GATGTCTGGT GGTTCTCGTT CTATATCAAC GGTCAGTTGC AGCCGCTCGA CGATCTGATG GCAGCCGAAG GCGTCAAGCG CGAAGCCTAC GTGGATGTGC TGCTCAACGA AGGCATCCGC AAGAATACCG TCTACTGGAT CCCGTTCGCA CGCTCGACGC CGCTGTTCTA CTACAACAAG GATGCCTGGG CGGAGGCAGG GCTTGAGGAT CGTGCGCCGA AGACCTGGGA CGAATTCATG GAGTGGGCGC CGAAACTCAA CAAAGAGGGA CGCGCCGCAT TTGCCCACCC CGGCGCTGCA AGTTACATCG CCTGGCTCTT CCAGGGTGTG ATCTGGCAGT TTGGCGGACG CTACAGCGAC CCCGACTTCA CCATCCGCAT CCACGAAGAG GGCGGCATCA AAGCTGGCAA CTTCTACCGT GACACGACCC AGACCTACAA GTGGGCAACC ACGCCGAAGG ACGTGACCCA GGACTTCGTC ACCGGTCAGT CAGCCAGCGC GATGTTGAGC ACTGCTGCGC TGGCAGGCGT CGAGAAGAAT GCGCAGTTCC CGGTCGGCAC CGGCTTTCTG CCGGAAGGAC CGGCTGGCTT CGGGTGCTGC ACCGGCGGCG CGGGTATGGC CATCCTGGCG GGCCTGCCTG CTGAGAAGCA GCAGGCGGCG ATGAAGTGGA TCGCCTTTGC CACCGGTGAG GAGTGGACAA TTAACTGGGC GCAGCGCACA GGATATATGC CGGTGCGCAA GGCAGCGGTG CAGTCGGAGA GTATGCAGAA ATATTTTACC GAGCGCCCCA ACTTCCGCAC GGCGGTCGAA CAGTTGCCGA AGACCCGTCC GCAGGACTCG GCGCGCGTCT ATGTACGCGG CGGCGATCAG ATCATCGGCA AGGGGTTGGA GCGCATTACG GTTGCGGGCG AAGACCCGGC GAAGGTCTGG ATGGACGTCA AGAAAGAACT CGAAGAGGCT GCCGCGCCAA CCGTCGAACT GTTGAAGACG GTTGAGGGTT AG
|
Protein sequence | MATINRRQFL RGVAAGAGAL TLAACGGGTT TEPTAAPAQP TAAPPTAMPQ GAMGSTVEIT YWGSFSGVLG EAEQATVEAF NSMQQDVKVN YQFQGNYEET AQKLTAAVQA RQTPDVSLLS DVWWFSFYIN GQLQPLDDLM AAEGVKREAY VDVLLNEGIR KNTVYWIPFA RSTPLFYYNK DAWAEAGLED RAPKTWDEFM EWAPKLNKEG RAAFAHPGAA SYIAWLFQGV IWQFGGRYSD PDFTIRIHEE GGIKAGNFYR DTTQTYKWAT TPKDVTQDFV TGQSASAMLS TAALAGVEKN AQFPVGTGFL PEGPAGFGCC TGGAGMAILA GLPAEKQQAA MKWIAFATGE EWTINWAQRT GYMPVRKAAV QSESMQKYFT ERPNFRTAVE QLPKTRPQDS ARVYVRGGDQ IIGKGLERIT VAGEDPAKVW MDVKKELEEA AAPTVELLKT VEG
|
| |