Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3619 |
Symbol | |
ID | 5210597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 4522488 |
End bp | 4523954 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640597212 |
Product | extracellular solute-binding protein |
Protein accession | YP_001277924 |
Protein GI | 148657719 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACTGA GACAATACCC CTGGATTCTC TTGCTCGGTG CGTTGATGCT GGTGCTGGCT GCCTGTGGCG GACAACAGAC AGCTTCACCG ACCGCCGCGC CGGGGCAGGC GCCAACCACT GCGCCAGTTA CCGACCTGCC GACTCCCACG CCGGTTCTTG AGTTTGCGCA GCAACCTCAA CCCGGTCAGA AGGTGCTGGT ATGGATGGTG CGCATCAACG CCACCGAGAA CCGCTGGGAG CGCGATGTCG TCCTTCCTGC CTACCAGCAG GTCGCCCCGG ATGTATTCGT AAAGGTGCTC AATATCAACC AGGACGATAT TGCGGTCAAG CGTGAGGCAA TGATCGCCGC GAAGGAGCCG CTGCACGTCT GGTCGTCCAA CTGGGGCGGT GATGGCTTCG CCAGCGACCG TTTCCGCGGG CTGCTGGCAG ATCTGACCCC CTTGATCGAA CGTGATAAGT GGGACACCAG CGACTTCATT CCTGAAGTGT TCGCCATCTA CAATGTCGAG GGCAAGCAGT ACGGCATTCC GTTCCTCACG ACTGGCAGTT ATGTGTACTA CAACATGAAA CTGTTCGACG AGGCTGGCGT GCCCTATCCT CCGAGCGACT GGAACGACAA GTCGTGGACA TGGGATGCGT TCCTCGAAAC CGCCAAAAAA CTGACAAAGA ACCCGGATGA TCCGAGCACG GCTGTGTATG GCGGCGTCAA CGGGCTGTGG CCGCCATTCG ATAGCATTCC CATGATCTGG GGGAAGGATC CGTTCACGAA GGAAGCGCTG GAGAGCGGGT TCTCCGATCC GATCAAACTC GATGAACAGA CAGCGGCAGC CTTCCAGGCA ATCCACGACC TGGTCTACGT CCATAAGGTC GCTCCCGACC AGGCAGCTTC CCAGGCGCTT GATCAACTTG GCGGCGCGTT CCTCTCCGGT CGCGTGGCCA TGTTCATGAC CGGCGGTTGG GGACACTGGA ACTACAAGGA AATTATCGAT GATCCGAATG GGTTCTGCTG GGGCGCAGCG CCAATTCCCT GGGGCTCCCC TGATGCGAAC ATCCGCGCAA CGATCTTCAC CGACCCATGG GTCATCACTG CTGGAATGGA CGCTGAAAAT ACCGATCTTG CCTGGAACTT CGTGAAGTTC CTGGCGTCGG CGGAACAGCA GCGCGCCTAC ACGCTGGCAA CCGGCACCCC GCCTGTGCGT CAGAGCCTGC TCAACGACTA CTACAAGCAG TATGAGAAGT GTGTCCCGGC GGAAAAAACC AAAGAGTCCT TCCAGGGCGC CTTCTCTCAC GGGCGCGAGT CATCGAACCA CCTGCTGGTC AAGTTTGATG AACTCAGCCA GACGTGGGAT AACCTGCTGA GTCCGTTCTG GAATGATCCA AATGCAAAGG CGACCGACCT CATGCCGATC CTTGAAGCGG ATGTGAATGC CGCGTTGGAG CGCATCCGCA AAGAAGCAGG CAGGTAA
|
Protein sequence | MRLRQYPWIL LLGALMLVLA ACGGQQTASP TAAPGQAPTT APVTDLPTPT PVLEFAQQPQ PGQKVLVWMV RINATENRWE RDVVLPAYQQ VAPDVFVKVL NINQDDIAVK REAMIAAKEP LHVWSSNWGG DGFASDRFRG LLADLTPLIE RDKWDTSDFI PEVFAIYNVE GKQYGIPFLT TGSYVYYNMK LFDEAGVPYP PSDWNDKSWT WDAFLETAKK LTKNPDDPST AVYGGVNGLW PPFDSIPMIW GKDPFTKEAL ESGFSDPIKL DEQTAAAFQA IHDLVYVHKV APDQAASQAL DQLGGAFLSG RVAMFMTGGW GHWNYKEIID DPNGFCWGAA PIPWGSPDAN IRATIFTDPW VITAGMDAEN TDLAWNFVKF LASAEQQRAY TLATGTPPVR QSLLNDYYKQ YEKCVPAEKT KESFQGAFSH GRESSNHLLV KFDELSQTWD NLLSPFWNDP NAKATDLMPI LEADVNAALE RIRKEAGR
|
| |