Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3559 |
Symbol | |
ID | 8666847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 3944630 |
End bp | 3947623 |
Gene Length | 2994 bp |
Protein Length | 997 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | chitinase |
Protein accession | YP_003339236 |
Protein GI | 271965040 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00432534 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.147083 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGGTCG TCTGCGCGGC CGCCACGGTG GGCATGACCA CGGTCGGCAC ACCGGTCGCG CATGCCGCGG CGTCGCTGTC CGCCAAGTTC ACCGCCGCCG ACCGCGGCAC GTGGTGGCAG GGCGGTTACG AGGTGAAGAA CACCGGTGAC ACCGCCGCCA CCACCTGGAC CCTCGAATTC GACCTCAACT CCGGCCAGTC CATCGGCAAC TGGTGGAACG GCACGCCGAC CGTGAGCGGC GGCCACGTGA CCGTGAGGCC GTCGAGCACC AACGCCAACG TGCCCGCGGG CGGCACCACC GGCGGCAACA GCTTCGGGTT CGTCGGGATG GGACCGAGCA CCCCGCCCGC CAACTGCAAG ATCAACGGCA ACCCCTGCGA GGGCGGGCCC CCGCCGGACA CCCCGCCGAC CGCGCCGGGC AGGCCCGTCG CGACCCCGGA CATCACCAGC GTGGCCCTGA CCTGGACCGC CTCCACCGAC GACAAGCAGG TCGCCGGGTA CGACGTCTAC CAGGGCGGCG CCAAGGTCAG GTCGGTCACG ACCAGCGCGG CCACGGTGGA CGGCCTCGCG GCCGACACCG AGTACACCTT CACCGTCAAG GCGAGGGACA GCGCGAACCA GGAATCGGCG TCCTCCCCGG CGACGACCGT GCGCACGCTC AAGGACCCGA CCCCGGACAC CGAGGCGCCG ACCACGCCGG GCACGCCGGT CGTCACCGGC AAGTCGGCCA CCGGCGTGAC GCTGCAGTGG GGAGCGTCGA CCGACAACCG GGTTGTCACC TCCTACGAGG TCCACAACGG CGACACGCTC GCCACCACGG TGACCGGGAC CCCGCCGGGC ACCACCACGA CGGTCGGCGG CCTCACCGAG GACACCGAGT ACACCTTCAA GGTCCGCGCC GGGGACGGGG CGGGCAACCG GTCGGCCTTC TCCGGGACGG TCACCGTCCG GACCGACAGG CAGCCCACCG ACCAGTCGGC CTGCCGGCCC GACGGGCTCT ACCGGTCGCC CGGCGTGACC GGGGTCCCCT ACTGCTCCGT CTACGACACC GACGGGCGCG AGAAGATGGG CGCCGACCAC CCGCGCCGGG TGATCGGATA CTTCACGAGC TGGCGGACCG GTCGGAACGG CGCCCCCGCC TACCTGGCGA ACGACATCCC GTGGGACAAG ATCACCCACA TCAACTACGC CTTCGCGCAC ATCGACGGCC AGGGCAAGGT CTCCGTCGGC ACCCCGGGAC CGGACAACCC CGCCCTCGGC ATGCAGTGGC CGGGCGTCGC CGGAGCCGAG ATGGACCCGT CGTACTCGTA CAAGGGCCAT TTCAACCTGC TCAACAAGTT CAAGAAGCAG CACCCCGGCG TCAAGACGAT CCTGTCGATC GGCGGCTGGG CCGAGACCGG CGGCTACCTC GACGACAACG GCGTACGGCA GGCCACCGGC GGCTTCTACA CGATGGCGGG CTCGCAGGCG GGCATCAACA CCTTCGCCGA CTCGGCCGTG AAGTTCATCA GGGATTACGG CTTCGACGGC GTGGACATCG ACTACGAGTA CGCGACGTCC GCCCCGAGCG CCGGCAACCC GGACGACTTC GCCTTCTCCA ACCCGCGCCG CGCCACGCTG ATGGCCGGCT ACGTCAACCT GATGAGGACG CTGCGCCAGA AGCTGGACGC GGCCGCCGCG GCCGACGGCA AGTACTACCT GCTGACCGCC GCCACCAGCG CGTCCGGCTG GATCCTGCGC GGCAGCGAGA GCTACCAGGT CACCCCGTAC CTCGACTACG CCAACATGAT GACCTACGAC CTGCACGGGG CGTGGAACCA GTTCGTCGGC CCGCTGCAGG CGCTGTACGA CGACGGCACC GACGCCGAGA TGAAGCACTG GAACGTGTAC GGCACCTACA GCGGCATCGG CTACCTGAAC GGCGACTGGG CCTACCACCA CCTGCGCGGC GCGATCCAGT CCGGCCGGAT CAACCTGGGC CTCGGCTACT ACAGCCGCGG CTGGAAGGGC GTCACCGGTG GCACCGACGG GCTCTGGGGA ACCTCCGCGC TGCCCAACCA GAACGACTGC CCGGAGGGCA CCGGAGGAAA GATCGGCTCC ACGGTGCCGT GCGGCGACGG CGCGATCGGC ATCGACAACC TCTGGCACGA CCTGGATAAA ACAGGCAAGG AGGTGCCCGC GGGCGTCAAC CCGGTCTGGC ACTTCAAGAA CCTGCAGGAG GGCAGGCAGG GCAGCTACAT CACCCAGTAC GGGCTCACCC CCGACACCGA CCCGGCCGAC CGCCTCTCCG GCACATACAC CCGCAAGTAC TCCTCCTCCA TGGCCGCCCC CTGGCTGTGG AACAACGACA AGAAGGTCTA CCTGTCCACC GAGGACGACC AGTCCGTCCA GGCCAAGGCC GACTACGTGG TCGGCAAAGG CCTCGGTGGC ATCATGATCT GGGAGCTGGC CGGCGACTAC GCCTACCACC CGGGGCGCGA CGGAGGCAGG GGCGAGTACT TCATCGGCGA CACGCTCACC ACGAAGATCT ACAACACCTT CAGGACGGCC GCGCCGTACG GGAACCTGAA GGCCGGCACC AGGGCGATGC CCGCCCAGAC CCTGAACGTC CAGGCCGAGC TGTACGGCTT CGCGGTCGGT GACGCCAACT ACCCGATCTC GCCCAAGCTC AAGTTCACCA ACAACTCCAC GACCACCATC CCCGGCGGTG CCACGATCGA GTTCGACTAC GGCACCTCCG CGCCGGCGAC CATGACCCAG CAGACCGGCT GGACGCTCTC CATCGTGTCC ACCGGCCATA CCGGGCCGAA CACCGGCGGC CTCAAGGGTG ACTTCCACCG GGTCCGGCTG ACGGTCCCGA CCTGGGAGAG CATCGCCCCC GGGCAGTCCA AGGAGGTCCA GCTCCGCTAC GACCTGCCCA TCGCCAGCCC GTCCAACTTC ACCGTGACGT TCGGCGGCCA GTCCTACCGG ATGGCCTTCG ACAACCCGCG CTGA
|
Protein sequence | MAVVCAAATV GMTTVGTPVA HAAASLSAKF TAADRGTWWQ GGYEVKNTGD TAATTWTLEF DLNSGQSIGN WWNGTPTVSG GHVTVRPSST NANVPAGGTT GGNSFGFVGM GPSTPPANCK INGNPCEGGP PPDTPPTAPG RPVATPDITS VALTWTASTD DKQVAGYDVY QGGAKVRSVT TSAATVDGLA ADTEYTFTVK ARDSANQESA SSPATTVRTL KDPTPDTEAP TTPGTPVVTG KSATGVTLQW GASTDNRVVT SYEVHNGDTL ATTVTGTPPG TTTTVGGLTE DTEYTFKVRA GDGAGNRSAF SGTVTVRTDR QPTDQSACRP DGLYRSPGVT GVPYCSVYDT DGREKMGADH PRRVIGYFTS WRTGRNGAPA YLANDIPWDK ITHINYAFAH IDGQGKVSVG TPGPDNPALG MQWPGVAGAE MDPSYSYKGH FNLLNKFKKQ HPGVKTILSI GGWAETGGYL DDNGVRQATG GFYTMAGSQA GINTFADSAV KFIRDYGFDG VDIDYEYATS APSAGNPDDF AFSNPRRATL MAGYVNLMRT LRQKLDAAAA ADGKYYLLTA ATSASGWILR GSESYQVTPY LDYANMMTYD LHGAWNQFVG PLQALYDDGT DAEMKHWNVY GTYSGIGYLN GDWAYHHLRG AIQSGRINLG LGYYSRGWKG VTGGTDGLWG TSALPNQNDC PEGTGGKIGS TVPCGDGAIG IDNLWHDLDK TGKEVPAGVN PVWHFKNLQE GRQGSYITQY GLTPDTDPAD RLSGTYTRKY SSSMAAPWLW NNDKKVYLST EDDQSVQAKA DYVVGKGLGG IMIWELAGDY AYHPGRDGGR GEYFIGDTLT TKIYNTFRTA APYGNLKAGT RAMPAQTLNV QAELYGFAVG DANYPISPKL KFTNNSTTTI PGGATIEFDY GTSAPATMTQ QTGWTLSIVS TGHTGPNTGG LKGDFHRVRL TVPTWESIAP GQSKEVQLRY DLPIASPSNF TVTFGGQSYR MAFDNPR
|
| |