Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_0936 |
Symbol | |
ID | 8664209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 954873 |
End bp | 957812 |
Gene Length | 2940 bp |
Protein Length | 979 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | cellulose 1,4-beta-cellobiosidase |
Protein accession | YP_003336683 |
Protein GI | 271962487 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCGAT TCCCCATCAG AGCGGGAGCG CTCCCACGCT TCCGGAGACG GGTCGCCGCG CTGGCGCTGC TCAGCCTGGC GGCCGGGACG ACCACCGCGC TGGCAGCCTC CCCCGCCGTC GCGGCGATCT CCTGCGAGGT GACCTACGCG ACCAACGACT GGCAGGGCGG CTTCACCGCC AACGTGTCGA TCAAGAACCT GGGCGACCCG CTGAACGGCT GGACACTCGG CTTCACCTTC CCCGACGCCG CGCAGAAGGT CACCCAGGGG TGGAGCGCCA CCTGGTCGCA GAGCGGCCAG GCCGTCACCG CCAGGAACCT CGACTGGAAC GGCAACCTGG CCACGGGCGC CTCGACCAGC ATCGGCTTCA ACGGCAGCTG GTCGGGGGCC AACCCCAAGC CGGCCGCCTT CACGATCAAC GGCACCACCT GCGGCGGGAC CGGGCCGGTC AACCAGCCCC CCACCGCGAA GATCACCAAG CCGGTCGCGG GGGCCACCTT CACCGCGCCC GCCACGGTGG ACATCACCGC TGACGCCGCC GACGGCGACG GTACGGTCGC CAAGGTCGAG TTCTTCAACG GCACCACGCT GCTGGGCACC GACACCACCG CGCCCTACGC CTACAGCTGG GCGGCCGTAC CCGCCGGTGA CTACTCCCTC ACCGCGAAGG CGACCGACGA CAAGGGCTCG GCCACGACCT CGGCGCCGGT CGGGATCAGC GTCCAGGCGA ACTCCGCCCC CGCCGTGCTT CTCACCCCGG CCACGCTCGC CGTGCCCGAG GGCGGCGGCG CGGACCTGTC GGTCAAGCTC TCCAAGGCCC CGTCCGCGAA CGTGACCGTG ACGACCGCCA GGACGAGCGG CGACGCCGAC CTGACCGTCG CCTCGGGCGG GTCGCTGACC TTCACCCCCG CCAACTGGAA CACCGCCCAG ACGGTCAGGG TCGCCGCCGC CGAGGACACC GACCAGACCT CGGGCAGCGC GGAGTTCACC TCGACCGCCA CCGGCCACAC CCCGGCCAAG GCGACCGCGA CCGAGGTGGA CAACGACACC CCGGGCGGCG ACAACGAGTA CGTCAAGCGG TTCACCACGA TGTACAACAA GCTCAAGGAC CCGGCGAACG GCTACTTCTC GCCGCAGGGC GTGCCGTACC ACTCGGTGGA GACCTTCATG GTCGAGGCGC CGGACCACGG GCACGAGACC ACCTCCGAGG CCTACAGCTA CTACCTGTGG CTGGAGGCGG CCTACGGCAA GGTGACCGGC GACTGGAGCA GGTTCAACGA CGCCTGGGCC TCGATGGAGA AATACATCAT CCCGGCCACC GCGGACCAGC CGACCAACTC CTTCTACAAC CCGTCCAAGC CCGCCACCTA CGCCGGCGAG TGGGACGACA TCAAGCAGTA CCCCTCCAAG CTGGACGGCG GCGTCTCGGT CGGCAGCGAC CCGATCGCGA ACGAGCTGAA GACCGCCTAC GGCACGAACG ACGTGTACGG CATGCACTGG CTGCTCGACG TGGACAACAC CTACGGCTTC GGCCGCTGCG GCGACGGCAC CACCAAGCCC GCCTACATCA ACACCTACCA GCGCGGCCCG GAGGAGTCGG TCTTCGAGAC CATTCCCCAG CCTTCCTGCG ACACCTTCAA GCACGGCGGC AAGAACGGCT ACCTGGACCT GTTCACCGGG GACAGCAGCT ACGCCAAGCA GTGGAAGTAC ACCAACGCCC CCGACGCCGA CGCCCGCGCC GTCCAGGTGG CCTACTGGGC GCACACCTGG GCCAAGGAGC AGGGCAAGGA GGCGCAGGTC GCCTCCTCGG TCACCAAGGC CGCCAAGATG GGCGACTACC TGCGCTACGC GATGTACGAC AAGTACTTCA AGAAGCAGGG CTGCACGAGC ACCACGTGCC CGGCCGGCAC CGGCAAGGAC AGCTCGGCCT ACCTGCTGAG CTGGTACTAC GCCTGGGGCG GCGCCAACGA CACCTCCGCC GGCTGGGCCT GGCGGATCGG CTCCAGCCAC AACCACTCCG GCTACCAGAA CCCGATGGCC GCCTGGGCGC TGTCGAGCGT GGACGCGCTC AAGCCCAAGG GGGCGACCGC CGTACAGGAC TGGAGCACCA GCCTGAAGCG GCAGCTTGAG TTCTACCGCT GGCTGCAGTC GAGCGAGGGC GCGATCGCCG GCGGCGCCAC CAACAGCTGG CAGGGCCACT ACGCGGCGCC GCCGTCCACG CTGCCCACCT TCTACGGCAT GGCCTACGAC TGGCAGCCGG TCTACCACGA CCCTCCGTCC AACCAGTGGT TCGGCTTCCA GGCCTGGTCG ATGGAGCGGG TCGCGGAGCT CTACTACGCG ACCGGCAACG CCGACGCCAA GCTGGTGCTG GACAAGTGGG TCAAGTGGGC GACCGACAAC ACCACGGTCA ACGCCGACGG GACCTTCCGG ATCCCCTCGA CCCTGGTGTG GACCGGCCAG CCCGACACCT GGAACTCGGG CAACCCGGGG CCCAACGCCG GGCTGCACGT CAGCATCCGG GACTACACCA GCGACGTCGG CGTGGCGGGC TCCTACGCCA AGGTGCTGAC CTACTACGCC GCCAAGTCGG GCAACGCCAC GGCCAAGGCC GTCGCCAAGG GCCTGCTCGA CGGCCTCTGG AAGAACAACC AGGACGCCAA GGGCGTCTCG GTGCCCGAGA CCAAGGCCGA CTACAACCGC CTCAACGACC CGGTCTACGT CCCGCCCGGC TGGACCGGCA AGATGCCCAA CGGCGACGTG ATCGACTCCA GCTCCACCTT CATGTCGATC CGGTCCTTCT ACAAGAACGA CCCGGACTGG CCGAAGGTCG ACGCCTACCT GAAGGGCACC GGGCCCGTGC CGTCCTTCAA CTACCACCGG TTCTGGGCCC AGGTCGACGT GGCCGTCGCC CTGGCCGAGT ACGGCCGGCT CTTCCCCTGA
|
Protein sequence | MPRFPIRAGA LPRFRRRVAA LALLSLAAGT TTALAASPAV AAISCEVTYA TNDWQGGFTA NVSIKNLGDP LNGWTLGFTF PDAAQKVTQG WSATWSQSGQ AVTARNLDWN GNLATGASTS IGFNGSWSGA NPKPAAFTIN GTTCGGTGPV NQPPTAKITK PVAGATFTAP ATVDITADAA DGDGTVAKVE FFNGTTLLGT DTTAPYAYSW AAVPAGDYSL TAKATDDKGS ATTSAPVGIS VQANSAPAVL LTPATLAVPE GGGADLSVKL SKAPSANVTV TTARTSGDAD LTVASGGSLT FTPANWNTAQ TVRVAAAEDT DQTSGSAEFT STATGHTPAK ATATEVDNDT PGGDNEYVKR FTTMYNKLKD PANGYFSPQG VPYHSVETFM VEAPDHGHET TSEAYSYYLW LEAAYGKVTG DWSRFNDAWA SMEKYIIPAT ADQPTNSFYN PSKPATYAGE WDDIKQYPSK LDGGVSVGSD PIANELKTAY GTNDVYGMHW LLDVDNTYGF GRCGDGTTKP AYINTYQRGP EESVFETIPQ PSCDTFKHGG KNGYLDLFTG DSSYAKQWKY TNAPDADARA VQVAYWAHTW AKEQGKEAQV ASSVTKAAKM GDYLRYAMYD KYFKKQGCTS TTCPAGTGKD SSAYLLSWYY AWGGANDTSA GWAWRIGSSH NHSGYQNPMA AWALSSVDAL KPKGATAVQD WSTSLKRQLE FYRWLQSSEG AIAGGATNSW QGHYAAPPST LPTFYGMAYD WQPVYHDPPS NQWFGFQAWS MERVAELYYA TGNADAKLVL DKWVKWATDN TTVNADGTFR IPSTLVWTGQ PDTWNSGNPG PNAGLHVSIR DYTSDVGVAG SYAKVLTYYA AKSGNATAKA VAKGLLDGLW KNNQDAKGVS VPETKADYNR LNDPVYVPPG WTGKMPNGDV IDSSSTFMSI RSFYKNDPDW PKVDAYLKGT GPVPSFNYHR FWAQVDVAVA LAEYGRLFP
|
| |