Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_7891 |
Symbol | |
ID | 8671214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 8698920 |
End bp | 8701835 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | Endo-1,3(4)-beta-glucanase |
Protein accession | YP_003343292 |
Protein GI | 271969096 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.798967 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCCCT CCATCCGGAC CACGGGTCCG GACAGTGCGC AGCCCCGTGA AAGGCGACTC ATCCACCGTC CGTGGGCCCG CAGGCGCGCC CTCGTCGCCG GCGTGGCCGT CGCGATGGTG GCCGGCATGT TCGGCCTCGG TCTCGCCCGG TCCGCCGAGG CGGCCACGGT CGGCGCCGGC AGCTATGCCG ACACGCTGCC CGCCGGCCGC TCGCTGCCGA CCGGCTGCGG TTCCATCTCC ACCAACCCGC GCCAGTGGGT GACGGCCAAC GCTCCCGCGG GAGCCGTTCC GACCAACGAC TGGTGGTCGT CGATCCTCTA CAAGAAGACC GACTGCTCCT ACGGCGAGCC GCTGCACGCC CACCCGATCT CCTACGACAC CTCCGGCACC GGCCTCAGCT TCTCCTACAC CACCACGCCC GCGATCAGCG GGACGGCCAC CGGGGTAGGG GAGTTCCACT ACCCCTACGT CCAGGACATC CAGGTCGGCG TCGCCGGGCT GAACTCACCC GACGTCAAGG TCGACGGCTG GAGCGACTGG ACCGTCACGC CGTACTGGAG CGGCGGCGGC CGGACCATGA AGGCCACCAT CGGCCACGGC CTGCCGTTCG GCTACTTCCA GATCACCGGC GGGGACGCGC GGATCACCGC GGCCGGCACC CCGTCCGTCT GGTCCAACAG CGGCGCCACC ATCGGTTTCA CTGTCAACGG GCACGACTAC GTCGCCTACG CGCCGACGGG CGCCACCTGG GCGGTCAGCG GGACGACAAT CAGCTCCTCC CTCGCGGGCC GGGGCTTCTT CTCGGTGGCG GTCCTGCCCG CGGGCGGCGA CCGCGCGGCC CTCGCGAACA CCTACGGCCA GTACGCGCAC GCGCACGTCA CCGGCACCCG GGTCTCCTAC TCGTACAACC CGGCGAGCAG CACGCTGAGC ACGACGTACG CCTTCACCAC CACGGCCCGG CAGGGCGGCG CGACCGGCAC GGTCACGGCC CTCTACCCGC ACCAGTGGAA CCATCTGACC GGCTCGACCC CGCTCGCGCA GACCTACGTC TCCGCGCGCG GCCAGATGAA GATCGTCACC GGGACGCAGT TCACGACGTC CATGAAGTAC ACCGGCGTGC TGCCCGAGGT GCCCGCCGTC GGCGACGGCA CCGGAGCCGA CCTGGCCACG GTCACCGGCC TCCTCAACGC CGAGCTCGGC AACCCGATGG ACAACCGGGG CGACGACACC TACTGGACCG GCAAGGGACT GGGACGCGCC GCGCGCATCG CCGAGATCGC CGACCAGCTC AACCTGACCT CGGTGCGCGA CGCGGCCCTG GGCGCCATCC GCACCCGCCT CAACGACTGG TTCACCGCAT CGCCGGGCAA GACCTCCCGG GTCTTCTACC TCGACCCCGC CTGGGGGACG CTGATCGGCT ACCCGGCCTC CTACGGCTCC GACCAGGAGC TCAACGACCA TCACTTCCAC TACGGCTACT ACGTCGCGGC CGCCGCGACC CTGGCCAAGT ACGACCCGAA CTGGGCGAAG ACCAGCCAGT ACGGCGGCAT GGTCGACCTG CTGATCCGCG ACGCCAACAA CTACGACCGC GGCGACACCC GCTTCCCCTA CCTGCGTGAC TTCGACATCT ACGCCGGCCA CGACTGGGCG TCGGGCCACG GCGCGTTCGG CGCGGGCAAC AACCAGGAGT CCTCGTCCGA GGGTATGAAC TTCGCCAACG CGCTGATCCA GTGGGGGCAG GCCACCGGGA ACACCGCGGT CCGCGACGCC GGCGTCTACA TCTACACCAC GCAGGCGGCG GCGATCCAGG AGTACTGGTT CGACGTGCGC GACCAGAACT TCCCGGCGGC CTTCGGTCAC AGCACGGTCG GCATGGTCTG GGGCGACGGC GGCGCCTACG CCACCTGGTT CAGCGCCGAG CCGGAGATGA TCCAGGGCAT CAACATGCTC CCGATCACCG GCGGCCACTT CTACCTGGGC GACAACCCCG CCTACGTGAC CACCAACTAC AACGAGCTGA CCAGGAACAA CGGCGGGCCG CCCACGGTGT GGCGGGACAT CCTCTGGGAG TTCCTCGCGC TCGGCAACGG AGACGCCGCC CTGGCGAACT TCCGCGCCAA CAGCGGCCTC ACCTCCGAGG AGGGGGAGAG CAAGGCCCAC ACCTTCCACT GGATCCGGAA CCTGGCCGCG CTGGGCACGG TGGACACCTC CGTCACCGCC AACCACCCGC TGGCCAAGGT GTTCAGCAAG AACGGCGCCC GGACCTACGT GGCTGCCAAC ATCACCGGCG CCGCGATCAC GGTGACCTTC TCCAACGGCA CCACGCTCAA CGTGCCGGCG GGCAAGACGG TCACCTCGGG CGCCCACACC TGGAGCGGCG GCAACGCCGG GGGCGGCACC GGCCCGAGCC CCCAGCCCAC CCCGACGGTC ACGCCGACCG TGGGCCCCTT CGCCGCGACG CGCTATCCGC AGGCGGGCGG GGGCCTGCCG GGCGCGGCGG GTACGGCGGG CACCGTCACG CTCGCCGCCG CCAACGGCAA CCACGACGGC ACGCCCGTGA ACGCGCAGAT CTTCACGGCC ACCGGCCTGA CCGCCGCCCA CAACGGCGGG GCCACCGCGT TCGACCTCTT CGTGGACGCG GGCACCACCG TCGCCAACGG CGTCCAGGCC CGGGTCTCCT ACGACCTGAC CGGCGACGGA AGCTGGGACC GGGTGGAGAC TTACCGCTAC TTCGCCACCG ACCCGGTGGC CGGCTGGGAG CACTACACCC AGGCAGCAGG CCTGCAGTCC TCCTCCGGCA CGCTCGGCAA CCTGTCCGGC GGCCGGGTGA GGGTGGAGGT CTGGAACGCC ATCGGCGGCG GGGTGACCAC CCTCGGCACC GGCGACCGTT CCGTGGTGCG GCTGCCGTTC GGCTGA
|
Protein sequence | MHPSIRTTGP DSAQPRERRL IHRPWARRRA LVAGVAVAMV AGMFGLGLAR SAEAATVGAG SYADTLPAGR SLPTGCGSIS TNPRQWVTAN APAGAVPTND WWSSILYKKT DCSYGEPLHA HPISYDTSGT GLSFSYTTTP AISGTATGVG EFHYPYVQDI QVGVAGLNSP DVKVDGWSDW TVTPYWSGGG RTMKATIGHG LPFGYFQITG GDARITAAGT PSVWSNSGAT IGFTVNGHDY VAYAPTGATW AVSGTTISSS LAGRGFFSVA VLPAGGDRAA LANTYGQYAH AHVTGTRVSY SYNPASSTLS TTYAFTTTAR QGGATGTVTA LYPHQWNHLT GSTPLAQTYV SARGQMKIVT GTQFTTSMKY TGVLPEVPAV GDGTGADLAT VTGLLNAELG NPMDNRGDDT YWTGKGLGRA ARIAEIADQL NLTSVRDAAL GAIRTRLNDW FTASPGKTSR VFYLDPAWGT LIGYPASYGS DQELNDHHFH YGYYVAAAAT LAKYDPNWAK TSQYGGMVDL LIRDANNYDR GDTRFPYLRD FDIYAGHDWA SGHGAFGAGN NQESSSEGMN FANALIQWGQ ATGNTAVRDA GVYIYTTQAA AIQEYWFDVR DQNFPAAFGH STVGMVWGDG GAYATWFSAE PEMIQGINML PITGGHFYLG DNPAYVTTNY NELTRNNGGP PTVWRDILWE FLALGNGDAA LANFRANSGL TSEEGESKAH TFHWIRNLAA LGTVDTSVTA NHPLAKVFSK NGARTYVAAN ITGAAITVTF SNGTTLNVPA GKTVTSGAHT WSGGNAGGGT GPSPQPTPTV TPTVGPFAAT RYPQAGGGLP GAAGTAGTVT LAAANGNHDG TPVNAQIFTA TGLTAAHNGG ATAFDLFVDA GTTVANGVQA RVSYDLTGDG SWDRVETYRY FATDPVAGWE HYTQAAGLQS SSGTLGNLSG GRVRVEVWNA IGGGVTTLGT GDRSVVRLPF G
|
| |