Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_2492 |
Symbol | |
ID | 8665778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 2717943 |
End bp | 2720879 |
Gene Length | 2937 bp |
Protein Length | 978 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | endo-1,4-beta-glucanase/xyloglucanase, putative, gly74A |
Protein accession | YP_003338211 |
Protein GI | 271964015 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0351052 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.027518 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAGAA GGCTGTTCAG CGTGCTCGCC GCCGTGGCGG TGGCGTGCGC GGCGGTCGGC ACGGCCGTCG GGGGCGGGGC CGCGTCGGTG TCCCCCTCCT CTGCCGCCGC CGAGCCGTAC GAGTGGAAGA ACGTCCGGAT CGACGGCGGC GGGTTCGTGC CCGGGATCGT CTTCAACCAG ACCGAGAAGG ACCTGATCTA CGCCCGCACC GACATCGGCG GCGCCTACCG CTGGGAGCAG GCGAGCAGGA GCTGGACCCC GCTGCTCGAC TGGGTGGGCT GGGACAAGTG GGGCTACAAC GGCGTGGTCA GCCTCGCCAC CGACCCGGTG GAGACCGGCC GGGTCTACGT CGCCGCCGGG ATGTACACCA ACGACTGGGA CCCCAACAAC GGCGCGGTCC TGCGCTCCGC GGACAAGGGG AGGACCTGGC AGGCCACCCC GCTGCCGTTC AAGCTCGGCG GCAACATGCC GGGCCGGGGG ATGGGCGAGC GGCTGGCCGT CGACCCCAAC GACAACGGCG TGGTGTATCT GGGGGCGCCC GAGGGCAACG GCCTGTGGCG CAGCACCGAC AAGGGCGTGA CCTGGGCCAA GGTCACGAGC TTCCCCAACC CGGGGAACTA CGCCCAGGAT CCCTCCGACC CGAACGGATA CCTCAGCCAC CGGCCGGGCG TGGTCTGGGT GACCTTCGAC CCGTCCTCGG CGACCGCGGG CCAGAGGACC CAGAGGATCT ACGTCGGCGT CGCGGACAAG GAGAACACGG TCTACAGCTC CGCCGACGCC GGCGTCACCT GGACCAGGGT TGCCGGGCAG CCGACCGGCT ACATCGCGCA CAAGGGCGTC CTCGACCACG CCGGCCACGC GCTCTACATC GCCACCAGCG ACACCGGCGG GCCTTACGAC GGGGGCAAGG GCGACGTGTG GAAGCTCGAC ACGGTCACCG GCGCGTGGAC CCGGATCAGC CCGATCCCGT CCAGCTCGGC CGACGACTAC TTCGGCTACA GCGGGCTCAC GATCGACCGC CTGCACCCCG GCACGCTGAT GGTCGCCACC CAGGTCTCCT GGTGGCCTGA CGTGATCTTC TTCCGCTCCA CCGACGCCGG CGCCACCTGG ACCCGGATCT GGGACTGGGC CTCGTACCCC AACAGAACCT TCCGCTACAA GATGGACATC TCCTCCTCGC CCTGGCTGAC CTTCGGGGCG AACCCGCAGC CACCGGAGAC GACCCCCAAG CTCGGCTGGA TGACCGAATC GCTGGAGATC GACCCGTTCG ACTCCAACCG CATGATGTAC GGCACCGGCG CCACGCTCTA CGGCACCGAG GACCTGCTCA AGTGGGACAC CGGTGGCCAG TTCACCATCA AGCCGATGGT GAGGGGGCTG GAGGAGACCG CCGTCCTCGA CCTGATCAGC CCGCCCAGCG GGGCCCCGCT GGTCAGCGGT CTCGGCGACA TCGGCGGCTT CCGCCACACC GACCTCGCCG CCGTGCCGCC GATGATGTTC ACCTCGCCCG TTTTCACCAG CACGACCAGC CTCGACTACG CCGAGACCAA GCCCGCCGTG ATGGTCCGCG CGGGCAACTT CACCGACGCC GACCGCCCCT CCGACAGCCA CGTCGCCTTC TCCACCGACG GCGGTGCCAA CTGGTTCCAG GGCACCGAAC CGGGCGGGAT CAACGAGGGC GGCACGGTCG CGGCGGCGGC CGACGGCTCC CGGTTCGTCT GGGCGCCCAA GGGCGTGGCC GTCCACCGCT CGACCGGGTT CGGCACCTCC TGGACCGCGT CCACCGGCAT CCCCGCCGGG GCCGTGGTGG AGTCGGACCG GGTGAACCCG GCCAGGTTCT ACGGGTTCGG CGCCGGGCGG TTCTACGCCA GCACCGACGG CGGCGCCTCC TTCGCCGCCA CCGCCGCGAC CGGGCTGCCG GCCACGGGCA ACGTCAAGTT CAAGGCCGTC CCCGGCCGCG AGGGAGACAT CTGGCTGGCG GGCGGCGACA CCGCGGCCGG CGGCACGTCC GGGATCTGGC ACTCCACCGA CGGCGGCGCG TCCTTCACCA AGCTGTCCGG CGTCACGAGC GCGGTCAACA TCGGCTTCGG CAAGGCCGCG CCGGGCAGGG CCTACCAGGC GCTGTACGCG GTCGCCACAG TCGGCGGGGT GAACGGCGTC TTCCGCTCCG ACGACACGGG CGCGAGCTGG GTCAGGATCA ACGACGACCG GCACCGGTAC GGCAACATGG GCGAGGCCGT CACCGGCGAC CCCCGGGTCT ACGGCCGGGT CTACCTCGGC ACCAACGGCC GGGGCATCCT CTACGCCGAC AACGGCGGCA CCGTGCCGCC CGACGACACC ACCCCGCCGT CCAGGCCCGG CACGCCGGCC GCCTCGGCGA TCACCTCGTC GGGCGCGACG CTGACCTGGA CGGCGTCCAC CGACGACACC GCGGTGACCG GCTACGACGT CTACCGGGAG GCCGGGGCGA CGGACGTGAA GGCCGGCTCG TCCTCCTCGG CGTCGTTCGC GCTGACCGGA CTGGCCGCCG ACACCTCCCA CACCTACTAC GTGGTCGCCC GCGACGGGGC GGGCAACAGC TCCACCGCCT CCGGCCCGGT CACCTTCAGG ACGGCCGCCG GGCCGGTGGG CGGCGGGTGC GCCGCCGCCT ACAAGGTGAC CAACTCCTGG CCCGGCGGCT TCCAGGGCGA GGTGACGGTG AAGAACACCG GAACCTCGGC GATCAGCGCC TGGACCGTCA AGTGGTCGTT CCCGGACGGT CAGACGATCA CCCAGCTCTG GAGCGGCGTC CACACCCAGA CCGGCGCCGA CGTCACCGTC GGGAACGCGG GCTGGAACGG CGGCCTGGGA GGCGGGGCGT CCACGGCGTT CGGCTTCGGC GGCAGCTGGA CCGGCGCCAA CGGCGTCCCC GCCACCGTCA CCTGCACGGC CGGATAG
|
Protein sequence | MRRRLFSVLA AVAVACAAVG TAVGGGAASV SPSSAAAEPY EWKNVRIDGG GFVPGIVFNQ TEKDLIYART DIGGAYRWEQ ASRSWTPLLD WVGWDKWGYN GVVSLATDPV ETGRVYVAAG MYTNDWDPNN GAVLRSADKG RTWQATPLPF KLGGNMPGRG MGERLAVDPN DNGVVYLGAP EGNGLWRSTD KGVTWAKVTS FPNPGNYAQD PSDPNGYLSH RPGVVWVTFD PSSATAGQRT QRIYVGVADK ENTVYSSADA GVTWTRVAGQ PTGYIAHKGV LDHAGHALYI ATSDTGGPYD GGKGDVWKLD TVTGAWTRIS PIPSSSADDY FGYSGLTIDR LHPGTLMVAT QVSWWPDVIF FRSTDAGATW TRIWDWASYP NRTFRYKMDI SSSPWLTFGA NPQPPETTPK LGWMTESLEI DPFDSNRMMY GTGATLYGTE DLLKWDTGGQ FTIKPMVRGL EETAVLDLIS PPSGAPLVSG LGDIGGFRHT DLAAVPPMMF TSPVFTSTTS LDYAETKPAV MVRAGNFTDA DRPSDSHVAF STDGGANWFQ GTEPGGINEG GTVAAAADGS RFVWAPKGVA VHRSTGFGTS WTASTGIPAG AVVESDRVNP ARFYGFGAGR FYASTDGGAS FAATAATGLP ATGNVKFKAV PGREGDIWLA GGDTAAGGTS GIWHSTDGGA SFTKLSGVTS AVNIGFGKAA PGRAYQALYA VATVGGVNGV FRSDDTGASW VRINDDRHRY GNMGEAVTGD PRVYGRVYLG TNGRGILYAD NGGTVPPDDT TPPSRPGTPA ASAITSSGAT LTWTASTDDT AVTGYDVYRE AGATDVKAGS SSSASFALTG LAADTSHTYY VVARDGAGNS STASGPVTFR TAAGPVGGGC AAAYKVTNSW PGGFQGEVTV KNTGTSAISA WTVKWSFPDG QTITQLWSGV HTQTGADVTV GNAGWNGGLG GGASTAFGFG GSWTGANGVP ATVTCTAG
|
| |