Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1359 |
Symbol | |
ID | 5208311 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 1667499 |
End bp | 1668875 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640594970 |
Product | extracellular solute-binding protein |
Protein accession | YP_001275709 |
Protein GI | 148655504 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAGC AAGCCGATCA CCTTGTCCAT CGTTTGAGTC GGCGGCAGTT CTTGCGAGGA GCGGCAATCG GAGGTGCGGC GCTGACATCC GGCATTCTGG CGGCCTGTGG CTCATCGCCG ACTGCGCCAG CCGGTGATGC TCCGACAGCC GCGCCAACGC AGGTCGCCAG CGAGCCAACG AAGATTCGCG CGCTCATGTG GAGCAATGGA CCTGTCATCG ACGAGAACTT CAAAGTCCGC GCGCAGATGT TCAACGAGAC GTTCAAGGGA CAGTACGATC TCGATCTGCA ACTGCTGCCC TACGATCAAT ACTGGCCCCG CATCGATCTG GCGTATGGAG CGAAGAACCC CTATGATCTC TACTTCTTCG ACGTACAGGC GTATGGGCAT TACCGTGCTG GCCTGCTCTC GAACATCCAG CCCTACGTCG ATCTGGCGCC GGAACTGTTG AACGCCGAAG AGTATCCGGT CGCGCTCTAC GACGCCTGGC GCTTCGATGG CAGCAATCTG TACGGACTTC CCGAAAATAT CCAGGTGCTG GCGCTCTACT ACAACCGTGA TCTCTTCGAT GCCGAAGGTC TGGCGTATCC CGATGACACC TGGACATGGG ACGATGTGCT CGACGCGGCA ACGAAACTGA CGAAGCGCAA CGGTGACGAA ACCACCCAGT GGGGGCTGGA TGTGGGTGTG ATGGATATCT GGTGGGGCGC GCAGACGCTG GCGTGGGCGA TGGGCGGCGG GTTCTTCGAC AAGATCGTCG AGCCGACAAA GTTCCAGGTG AGCGATGAAG TCAATGTGCA GGCGTTGACA TTCCTGCGCG ACCTGATCTT TGTTCACAAG GTCGCGCCCA CCAAGACGCA ACGTTCGGCG GCTGCCCAGG ATATCGGCAT TTTCCAGACC GGTAAGGTGG CGATGTTCTT CGATGGCAGC TGGGCGATCA GCGGGTTCCA GGATGTGCCG TTCAAGTGGG ATATGGCGCC GTTGCCGATG TGGAAGGATA AGCGCGTATC CGCCTACTGG CTGGGCGGTC AGGTCATCCC GAAAGACTCG AAAGTCATCG ACGCCGCCTT CGCCTTTGCC CGCTGGTCGG CGACGACCTA TCAGAAGACC ATGGCAGGCA ATCACGACTG GATCCCGATT GCGCGTTCGG CGCGCGAGTC GGAGGAGATG TACGTCGGGC AACCGGCCGG TTTGCGGTCG GTGCTGGGGA CGATCGAGGG CGCGCGGTTG GGTGATTTCT ATTCCCGGAA CAATCAGCAG ATCTTCAGTG AGGTGCTGCT GCCGACATTC GATCTGATGT TCCTCGGAAC CATCACGCCG GAAGAGGCGG CAAAGAAGAT CGACGAAGAA GCAAATGCTC TCCTGGCGAA AGGATAA
|
Protein sequence | MAKQADHLVH RLSRRQFLRG AAIGGAALTS GILAACGSSP TAPAGDAPTA APTQVASEPT KIRALMWSNG PVIDENFKVR AQMFNETFKG QYDLDLQLLP YDQYWPRIDL AYGAKNPYDL YFFDVQAYGH YRAGLLSNIQ PYVDLAPELL NAEEYPVALY DAWRFDGSNL YGLPENIQVL ALYYNRDLFD AEGLAYPDDT WTWDDVLDAA TKLTKRNGDE TTQWGLDVGV MDIWWGAQTL AWAMGGGFFD KIVEPTKFQV SDEVNVQALT FLRDLIFVHK VAPTKTQRSA AAQDIGIFQT GKVAMFFDGS WAISGFQDVP FKWDMAPLPM WKDKRVSAYW LGGQVIPKDS KVIDAAFAFA RWSATTYQKT MAGNHDWIPI ARSARESEEM YVGQPAGLRS VLGTIEGARL GDFYSRNNQQ IFSEVLLPTF DLMFLGTITP EEAAKKIDEE ANALLAKG
|
| |