Gene Sros_3559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3559 
Symbol 
ID8666847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3944630 
End bp3947623 
Gene Length2994 bp 
Protein Length997 aa 
Translation table11 
GC content69% 
IMG OID 
Productchitinase 
Protein accessionYP_003339236 
Protein GI271965040 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00432534 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.147083 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGTCG TCTGCGCGGC CGCCACGGTG GGCATGACCA CGGTCGGCAC ACCGGTCGCG 
CATGCCGCGG CGTCGCTGTC CGCCAAGTTC ACCGCCGCCG ACCGCGGCAC GTGGTGGCAG
GGCGGTTACG AGGTGAAGAA CACCGGTGAC ACCGCCGCCA CCACCTGGAC CCTCGAATTC
GACCTCAACT CCGGCCAGTC CATCGGCAAC TGGTGGAACG GCACGCCGAC CGTGAGCGGC
GGCCACGTGA CCGTGAGGCC GTCGAGCACC AACGCCAACG TGCCCGCGGG CGGCACCACC
GGCGGCAACA GCTTCGGGTT CGTCGGGATG GGACCGAGCA CCCCGCCCGC CAACTGCAAG
ATCAACGGCA ACCCCTGCGA GGGCGGGCCC CCGCCGGACA CCCCGCCGAC CGCGCCGGGC
AGGCCCGTCG CGACCCCGGA CATCACCAGC GTGGCCCTGA CCTGGACCGC CTCCACCGAC
GACAAGCAGG TCGCCGGGTA CGACGTCTAC CAGGGCGGCG CCAAGGTCAG GTCGGTCACG
ACCAGCGCGG CCACGGTGGA CGGCCTCGCG GCCGACACCG AGTACACCTT CACCGTCAAG
GCGAGGGACA GCGCGAACCA GGAATCGGCG TCCTCCCCGG CGACGACCGT GCGCACGCTC
AAGGACCCGA CCCCGGACAC CGAGGCGCCG ACCACGCCGG GCACGCCGGT CGTCACCGGC
AAGTCGGCCA CCGGCGTGAC GCTGCAGTGG GGAGCGTCGA CCGACAACCG GGTTGTCACC
TCCTACGAGG TCCACAACGG CGACACGCTC GCCACCACGG TGACCGGGAC CCCGCCGGGC
ACCACCACGA CGGTCGGCGG CCTCACCGAG GACACCGAGT ACACCTTCAA GGTCCGCGCC
GGGGACGGGG CGGGCAACCG GTCGGCCTTC TCCGGGACGG TCACCGTCCG GACCGACAGG
CAGCCCACCG ACCAGTCGGC CTGCCGGCCC GACGGGCTCT ACCGGTCGCC CGGCGTGACC
GGGGTCCCCT ACTGCTCCGT CTACGACACC GACGGGCGCG AGAAGATGGG CGCCGACCAC
CCGCGCCGGG TGATCGGATA CTTCACGAGC TGGCGGACCG GTCGGAACGG CGCCCCCGCC
TACCTGGCGA ACGACATCCC GTGGGACAAG ATCACCCACA TCAACTACGC CTTCGCGCAC
ATCGACGGCC AGGGCAAGGT CTCCGTCGGC ACCCCGGGAC CGGACAACCC CGCCCTCGGC
ATGCAGTGGC CGGGCGTCGC CGGAGCCGAG ATGGACCCGT CGTACTCGTA CAAGGGCCAT
TTCAACCTGC TCAACAAGTT CAAGAAGCAG CACCCCGGCG TCAAGACGAT CCTGTCGATC
GGCGGCTGGG CCGAGACCGG CGGCTACCTC GACGACAACG GCGTACGGCA GGCCACCGGC
GGCTTCTACA CGATGGCGGG CTCGCAGGCG GGCATCAACA CCTTCGCCGA CTCGGCCGTG
AAGTTCATCA GGGATTACGG CTTCGACGGC GTGGACATCG ACTACGAGTA CGCGACGTCC
GCCCCGAGCG CCGGCAACCC GGACGACTTC GCCTTCTCCA ACCCGCGCCG CGCCACGCTG
ATGGCCGGCT ACGTCAACCT GATGAGGACG CTGCGCCAGA AGCTGGACGC GGCCGCCGCG
GCCGACGGCA AGTACTACCT GCTGACCGCC GCCACCAGCG CGTCCGGCTG GATCCTGCGC
GGCAGCGAGA GCTACCAGGT CACCCCGTAC CTCGACTACG CCAACATGAT GACCTACGAC
CTGCACGGGG CGTGGAACCA GTTCGTCGGC CCGCTGCAGG CGCTGTACGA CGACGGCACC
GACGCCGAGA TGAAGCACTG GAACGTGTAC GGCACCTACA GCGGCATCGG CTACCTGAAC
GGCGACTGGG CCTACCACCA CCTGCGCGGC GCGATCCAGT CCGGCCGGAT CAACCTGGGC
CTCGGCTACT ACAGCCGCGG CTGGAAGGGC GTCACCGGTG GCACCGACGG GCTCTGGGGA
ACCTCCGCGC TGCCCAACCA GAACGACTGC CCGGAGGGCA CCGGAGGAAA GATCGGCTCC
ACGGTGCCGT GCGGCGACGG CGCGATCGGC ATCGACAACC TCTGGCACGA CCTGGATAAA
ACAGGCAAGG AGGTGCCCGC GGGCGTCAAC CCGGTCTGGC ACTTCAAGAA CCTGCAGGAG
GGCAGGCAGG GCAGCTACAT CACCCAGTAC GGGCTCACCC CCGACACCGA CCCGGCCGAC
CGCCTCTCCG GCACATACAC CCGCAAGTAC TCCTCCTCCA TGGCCGCCCC CTGGCTGTGG
AACAACGACA AGAAGGTCTA CCTGTCCACC GAGGACGACC AGTCCGTCCA GGCCAAGGCC
GACTACGTGG TCGGCAAAGG CCTCGGTGGC ATCATGATCT GGGAGCTGGC CGGCGACTAC
GCCTACCACC CGGGGCGCGA CGGAGGCAGG GGCGAGTACT TCATCGGCGA CACGCTCACC
ACGAAGATCT ACAACACCTT CAGGACGGCC GCGCCGTACG GGAACCTGAA GGCCGGCACC
AGGGCGATGC CCGCCCAGAC CCTGAACGTC CAGGCCGAGC TGTACGGCTT CGCGGTCGGT
GACGCCAACT ACCCGATCTC GCCCAAGCTC AAGTTCACCA ACAACTCCAC GACCACCATC
CCCGGCGGTG CCACGATCGA GTTCGACTAC GGCACCTCCG CGCCGGCGAC CATGACCCAG
CAGACCGGCT GGACGCTCTC CATCGTGTCC ACCGGCCATA CCGGGCCGAA CACCGGCGGC
CTCAAGGGTG ACTTCCACCG GGTCCGGCTG ACGGTCCCGA CCTGGGAGAG CATCGCCCCC
GGGCAGTCCA AGGAGGTCCA GCTCCGCTAC GACCTGCCCA TCGCCAGCCC GTCCAACTTC
ACCGTGACGT TCGGCGGCCA GTCCTACCGG ATGGCCTTCG ACAACCCGCG CTGA
 
Protein sequence
MAVVCAAATV GMTTVGTPVA HAAASLSAKF TAADRGTWWQ GGYEVKNTGD TAATTWTLEF 
DLNSGQSIGN WWNGTPTVSG GHVTVRPSST NANVPAGGTT GGNSFGFVGM GPSTPPANCK
INGNPCEGGP PPDTPPTAPG RPVATPDITS VALTWTASTD DKQVAGYDVY QGGAKVRSVT
TSAATVDGLA ADTEYTFTVK ARDSANQESA SSPATTVRTL KDPTPDTEAP TTPGTPVVTG
KSATGVTLQW GASTDNRVVT SYEVHNGDTL ATTVTGTPPG TTTTVGGLTE DTEYTFKVRA
GDGAGNRSAF SGTVTVRTDR QPTDQSACRP DGLYRSPGVT GVPYCSVYDT DGREKMGADH
PRRVIGYFTS WRTGRNGAPA YLANDIPWDK ITHINYAFAH IDGQGKVSVG TPGPDNPALG
MQWPGVAGAE MDPSYSYKGH FNLLNKFKKQ HPGVKTILSI GGWAETGGYL DDNGVRQATG
GFYTMAGSQA GINTFADSAV KFIRDYGFDG VDIDYEYATS APSAGNPDDF AFSNPRRATL
MAGYVNLMRT LRQKLDAAAA ADGKYYLLTA ATSASGWILR GSESYQVTPY LDYANMMTYD
LHGAWNQFVG PLQALYDDGT DAEMKHWNVY GTYSGIGYLN GDWAYHHLRG AIQSGRINLG
LGYYSRGWKG VTGGTDGLWG TSALPNQNDC PEGTGGKIGS TVPCGDGAIG IDNLWHDLDK
TGKEVPAGVN PVWHFKNLQE GRQGSYITQY GLTPDTDPAD RLSGTYTRKY SSSMAAPWLW
NNDKKVYLST EDDQSVQAKA DYVVGKGLGG IMIWELAGDY AYHPGRDGGR GEYFIGDTLT
TKIYNTFRTA APYGNLKAGT RAMPAQTLNV QAELYGFAVG DANYPISPKL KFTNNSTTTI
PGGATIEFDY GTSAPATMTQ QTGWTLSIVS TGHTGPNTGG LKGDFHRVRL TVPTWESIAP
GQSKEVQLRY DLPIASPSNF TVTFGGQSYR MAFDNPR