Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_0747 |
Symbol | |
ID | 5207686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 924275 |
End bp | 925582 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640594361 |
Product | extracellular solute-binding protein |
Protein accession | YP_001275113 |
Protein GI | 148654908 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00118998 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATGAGAG TATTGCGCCG GATCATGCGA TGCCGTCTGA TTGCCATTCT GCTCATTGCT GCGCTGCTCG CCGCGTGTGG GTCGAATGGC GCGCCCGGAA CCTCTTCCGC TCCGACGATT GAGGCAAACT CTTCCAGTGT CGGACCGCTC CTGCTCTGGC ATGGATGGTC GGGCAGCGAG CGGCAGGCGC TTGGGCGACT GGTTGAGCAG TACAACCGCC AGCAGCGTGA TGGGCGCATC GTGCTGCAAG CGGTGCCGCT TGCCAGTTTC GCCGCCGAAC TGCGCACGGC AGTTGCGGCC GGGAGCGGTC CGCATCTGAT ACTCATCCCC AACACCTGGA TCGGTCCGCT CGCAGAGGCG GGCGTTATGC TCCCGCTCGA TGATCTGATC CTGTCGCAGG AGACCAATAC GCTTCTGCCG GTTGCGCTGG CAGGCGCCCG GCTGCGCGAC GCTGCCGGGA CGCAGCGACT GTATGGCGTC CCGATCCGTT TTGATACAAT TGCACTCTAC TACAATACCG CCAATCTAAC CCAACCCCCG GCGGATACCG CAACCATGAT CGCCGTTGGG CGCGGATTGA GCGATCCGGA GGCGCAACCG CCCATCTGGG GACTGGCGCT CAACCTGTCG TATGACAACA CTATTGGCTA TCTGTACGCT TTCGATGGGC GTGTCTTCGA TGACGACGGA CGGGTCGCAC TTGGCAGGGA AGGGCGCGCT GGCGCCGAGC AATGGCTGGC ATGGCTCACC CAGTTGCACA ACGATCCGCG TATTCTGGCA CGCAGCGACA GCAGCATCCT GGTTGATCGC GAATTGAAAG ATGGTCGCGC GATCATGACG TTCGACTGGG CACACCAGAT CGGCGTCTAC CGTGAGCTTT GGGGTAATCA ACTCGGTCTT GCACCGCTGC CGCGCTTGAG TGAAACCGGT CAGATGCCGC GCCCCTATGT ACGCACCGAT GTCCTGGCGA TCAACAGCCT GGTCGGGGCA AACGAACGTG ATGCAGCGGT ACGTTTCCTT CGCTTCATGA TCGGCGAAGA AGCTCAGGCA GCGTTGCTCC AGAGCGATAT GCAACCGGCA TCGCGCACCC TCGCATTGAC CGGAGATTCG CCCCAGATCG CAGCTGCGCA GGTCTTTCGC GCCCAGGCGG AACAGGGGCT GCCGATGCCA AATGCAAACA CACGCGCGTT CGTCGAGCAG GAAATCAGGC GCATGCAACG CCAGGCGTTG CTTGGACTCG CCACACCCGC CGATGCCGTC GCTGAAGCCG ACCGTCGCCT GCGTGAACGA CTCGAACCCG CGCCGTGA
|
Protein sequence | MMRVLRRIMR CRLIAILLIA ALLAACGSNG APGTSSAPTI EANSSSVGPL LLWHGWSGSE RQALGRLVEQ YNRQQRDGRI VLQAVPLASF AAELRTAVAA GSGPHLILIP NTWIGPLAEA GVMLPLDDLI LSQETNTLLP VALAGARLRD AAGTQRLYGV PIRFDTIALY YNTANLTQPP ADTATMIAVG RGLSDPEAQP PIWGLALNLS YDNTIGYLYA FDGRVFDDDG RVALGREGRA GAEQWLAWLT QLHNDPRILA RSDSSILVDR ELKDGRAIMT FDWAHQIGVY RELWGNQLGL APLPRLSETG QMPRPYVRTD VLAINSLVGA NERDAAVRFL RFMIGEEAQA ALLQSDMQPA SRTLALTGDS PQIAAAQVFR AQAEQGLPMP NANTRAFVEQ EIRRMQRQAL LGLATPADAV AEADRRLRER LEPAP
|
| |