Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3480 |
Symbol | |
ID | 5210457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 4362883 |
End bp | 4365630 |
Gene Length | 2748 bp |
Protein Length | 915 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640597075 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001277788 |
Protein GI | 148657583 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.409586 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACA CCGGAGTCGC TCTCACCAGT GAGCGAGAAA AGCGCATCGA CGCGCTCCTG CAACAGATGA CGCTGGCGGA GAAAGTCGCG CTGCTGGCGG GATCGAGTAT GTGGACCACC ACCCCCATCG AGCGCCTGGG CATTCCCGCC ATCAAAGTCA CAGATGGACC GAACGGCGCG CGCGGTGCGG GCGGGTTTGT TGGCGGCGCC GTCACTGCGG CGTGCTTCCC GGTCGGCATT GCGCTGGCGG CGACATGGAA CACCACACTG CTGGAAGCGG TGGGTGAAGC GCTGGCGGAA GAGGCGCAAT CCAAAGGTGC GCATCTCCTG CTGGCGCCAA CCGTCAACAT TCATCGCTCG CCGCTCAACG GACGCAACTT CGAGTGCTAT TCCGAAGACC CCTATCTCTC CGCACGCATG GCGGTCGCCT ATATTACCGG GCTGCAACGG CGGGGCGTCG GCGCAACGGT CAAGCACTAT GTCTGCAACG ACTCAGAGTT CGAGCGCAAC ACCATCAGTT CCGAGGTCGA TGAGCGCACC CTGCGCGAAA TCTACCTGCC GCCCTTTCGC GCGGCGGTGC AGGAAGCCGG CACATGGGCG GTTATGGCAG CGTACAATCG CGTCAACGGC ATCTATGCCA GTGAGCATCC GATGCTCCTG ATCGATATCC TCAAGCGAGA GTGGGAGTTC GATGGCATCG TTATGTCCGA CTGGTTCGGT ACGAAGAGCG TCGTTGAAGC CGCCGCCAAT GGTCTGGACC TCGAAATGCC GGGACCGACG CGCTGGCGTG GTGAACGATT GCTCGCCGCC GTGGAACAGG GACAGGTAAG CCTGGCAGCC ATCGATGAGT CCGCGCGACG GATGTTGCGC ACCATTGCCC GCGCAGGCGC CTTTGAGAAC CCGGATATTC GCCCCGAACA GGCAATCGAC CGCCCTGCGC ATCGCGCGCT CATCCGTCGC GCCGCCGCCG AGGGGATGGT GCTGCTCAAG AACGAGGGCG GCATCCTGCC GCTCAACCTT GCCAGCCTCT CGTCCATCGC CATTATCGGA CCCAACGCAA AAACGGCGCA GATCATGGGC GGCGGCAGCG CCCAGGTCAA TGCACACTAT GCGATCTCGC CATACGACGG CATTGCGGCG CGGGTTGGCA ATCGGGCGAC TCTGGGATAC GAGATCGGGT GCACCAACCA CCGACAACTT CCCCGCCTGG ATGGAAAACT GGTCGAGGCG GGCGACGGCA GCGGACGCGG GTTCACGATT GTCTACTACA ACTCGCCAGA CCTGACCGGC GATCCGGTTC ATCGGGCGAC GGTCGAGACC GGCGAGCAAG TCTGGCTCGG CGAGGTCGCT CCTGGCGTCG ATCCACGCCA GTTCTCGGCG CGCCTTACCG CGCGCTTCAC ACCGACTGAA AGCGGCGTCC ACACGTTCAG TCTGATCAGC GCCGGCGTGA GCCGCCTGTT TGTAAATGAC ACCCTGCTGG TGGACAACTG GACGAACCCG GTGCGCGGCG ACACCTATTT TGGCGCTGGC AGCACCGAAG TCACCGCATC GATCGCACTC GAAGCCGGTC GCACCTGCGA TCTGCGTCTG GAATACAGCA CCCAGGGCGC AGCCATGCTT GCCGCAGTGC GTCTGGGATA CCTGCCGCCG GTCGCGGAAG ATGCAATTGA ACGCGCGGCG GCGCTGGCGG CGCGTTCCGA TGTTGCGCTC GTCTTCGTCG GGTTGAACGC TGATTGGGAG AGCGAAGGGC ATGATCGCCC CCATATGGAT CTGGTCGGCA GGCAGGCTGA ACTGATCGAG CGCGTGGCGA ACGCCAACCC ACGCACCGTG GTGGTATTGC AAACCGGCTC ACCGGTGACG ATGCCCTGGC TGGACAAGGT GGCCGGGGTT ATCCAGGCAT GGTACCCCGG TCAGGAATGC GGCAACGCCA TCGCTGATGT CCTGTTTGGC GATGTCAACC CGTCGGGTCG GCTCCCGCAA ACCTTCCCGG TGCGGTTGGA AGACAACCCG GCATACATCA ACTATCCGGG TGAGAATGGG CGGGTGCGCT ACGGCGAAGG TATTTTCGTC GGCTACCGCT ACTACGAGAA GAAAAAAGTC GCGCCGCTCT TCCCATTCGG CTTCGGTCTG TCGTATACCA CCTTCCGCTA TGACAATCTT CGCCTGAGCG CCGACACCCT TGCGCCTGAC GAGCGTCTCA CCGTTCAGGT GGACGTCACC AACACCGGAA ACGTTGCCGG GCAGGAGGTC GTCCAGCTCT ACGTTCGTGA CAGCGCGGCG CGTGTCGCGC GACCGGAGAA GGAACTGAAA GGATTTGCCA AAGTCGCGCT GCAACCGGGC GAAACGCAAA CCGTGACGCT GCACCTCGAC CGCGAAGCGC TGGCATACTG GGACGACGCG CAGCATGCCT GGGTGGCGGA AGCAGGCGAG TTCGAGGTGC TGATCGGCAG TTCCTCGCAG GACATCCGGG CACGCGCAAC ATTCCGCCTG AGCGAAACGG TGTCCTTCGG CGGACCGGCA AAACCTCCGG TGAAACTGAG CATCGATTCG CCCGTCAAAG CATTGCTCGA ACATGAAGGA GCGCGTCATA TACTGGAACG CTATCTGCCG GGATTTGCCG AACAGGCGGG CGTTGGAGTT ATGATGGGGT TGACGCTGGC GCAGATGGCA GGGTTTGCAC CGGACAGAAT CACGCCGGAA CACCTGGCAG CGATCGCCGC CGATCTGGCG CAACTGCGTG ACCAGTAA
|
Protein sequence | MTDTGVALTS EREKRIDALL QQMTLAEKVA LLAGSSMWTT TPIERLGIPA IKVTDGPNGA RGAGGFVGGA VTAACFPVGI ALAATWNTTL LEAVGEALAE EAQSKGAHLL LAPTVNIHRS PLNGRNFECY SEDPYLSARM AVAYITGLQR RGVGATVKHY VCNDSEFERN TISSEVDERT LREIYLPPFR AAVQEAGTWA VMAAYNRVNG IYASEHPMLL IDILKREWEF DGIVMSDWFG TKSVVEAAAN GLDLEMPGPT RWRGERLLAA VEQGQVSLAA IDESARRMLR TIARAGAFEN PDIRPEQAID RPAHRALIRR AAAEGMVLLK NEGGILPLNL ASLSSIAIIG PNAKTAQIMG GGSAQVNAHY AISPYDGIAA RVGNRATLGY EIGCTNHRQL PRLDGKLVEA GDGSGRGFTI VYYNSPDLTG DPVHRATVET GEQVWLGEVA PGVDPRQFSA RLTARFTPTE SGVHTFSLIS AGVSRLFVND TLLVDNWTNP VRGDTYFGAG STEVTASIAL EAGRTCDLRL EYSTQGAAML AAVRLGYLPP VAEDAIERAA ALAARSDVAL VFVGLNADWE SEGHDRPHMD LVGRQAELIE RVANANPRTV VVLQTGSPVT MPWLDKVAGV IQAWYPGQEC GNAIADVLFG DVNPSGRLPQ TFPVRLEDNP AYINYPGENG RVRYGEGIFV GYRYYEKKKV APLFPFGFGL SYTTFRYDNL RLSADTLAPD ERLTVQVDVT NTGNVAGQEV VQLYVRDSAA RVARPEKELK GFAKVALQPG ETQTVTLHLD REALAYWDDA QHAWVAEAGE FEVLIGSSSQ DIRARATFRL SETVSFGGPA KPPVKLSIDS PVKALLEHEG ARHILERYLP GFAEQAGVGV MMGLTLAQMA GFAPDRITPE HLAAIAADLA QLRDQ
|
| |