Gene Sros_0936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0936 
Symbol 
ID8664209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp954873 
End bp957812 
Gene Length2940 bp 
Protein Length979 aa 
Translation table11 
GC content69% 
IMG OID 
Productcellulose 1,4-beta-cellobiosidase 
Protein accessionYP_003336683 
Protein GI271962487 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCGAT TCCCCATCAG AGCGGGAGCG CTCCCACGCT TCCGGAGACG GGTCGCCGCG 
CTGGCGCTGC TCAGCCTGGC GGCCGGGACG ACCACCGCGC TGGCAGCCTC CCCCGCCGTC
GCGGCGATCT CCTGCGAGGT GACCTACGCG ACCAACGACT GGCAGGGCGG CTTCACCGCC
AACGTGTCGA TCAAGAACCT GGGCGACCCG CTGAACGGCT GGACACTCGG CTTCACCTTC
CCCGACGCCG CGCAGAAGGT CACCCAGGGG TGGAGCGCCA CCTGGTCGCA GAGCGGCCAG
GCCGTCACCG CCAGGAACCT CGACTGGAAC GGCAACCTGG CCACGGGCGC CTCGACCAGC
ATCGGCTTCA ACGGCAGCTG GTCGGGGGCC AACCCCAAGC CGGCCGCCTT CACGATCAAC
GGCACCACCT GCGGCGGGAC CGGGCCGGTC AACCAGCCCC CCACCGCGAA GATCACCAAG
CCGGTCGCGG GGGCCACCTT CACCGCGCCC GCCACGGTGG ACATCACCGC TGACGCCGCC
GACGGCGACG GTACGGTCGC CAAGGTCGAG TTCTTCAACG GCACCACGCT GCTGGGCACC
GACACCACCG CGCCCTACGC CTACAGCTGG GCGGCCGTAC CCGCCGGTGA CTACTCCCTC
ACCGCGAAGG CGACCGACGA CAAGGGCTCG GCCACGACCT CGGCGCCGGT CGGGATCAGC
GTCCAGGCGA ACTCCGCCCC CGCCGTGCTT CTCACCCCGG CCACGCTCGC CGTGCCCGAG
GGCGGCGGCG CGGACCTGTC GGTCAAGCTC TCCAAGGCCC CGTCCGCGAA CGTGACCGTG
ACGACCGCCA GGACGAGCGG CGACGCCGAC CTGACCGTCG CCTCGGGCGG GTCGCTGACC
TTCACCCCCG CCAACTGGAA CACCGCCCAG ACGGTCAGGG TCGCCGCCGC CGAGGACACC
GACCAGACCT CGGGCAGCGC GGAGTTCACC TCGACCGCCA CCGGCCACAC CCCGGCCAAG
GCGACCGCGA CCGAGGTGGA CAACGACACC CCGGGCGGCG ACAACGAGTA CGTCAAGCGG
TTCACCACGA TGTACAACAA GCTCAAGGAC CCGGCGAACG GCTACTTCTC GCCGCAGGGC
GTGCCGTACC ACTCGGTGGA GACCTTCATG GTCGAGGCGC CGGACCACGG GCACGAGACC
ACCTCCGAGG CCTACAGCTA CTACCTGTGG CTGGAGGCGG CCTACGGCAA GGTGACCGGC
GACTGGAGCA GGTTCAACGA CGCCTGGGCC TCGATGGAGA AATACATCAT CCCGGCCACC
GCGGACCAGC CGACCAACTC CTTCTACAAC CCGTCCAAGC CCGCCACCTA CGCCGGCGAG
TGGGACGACA TCAAGCAGTA CCCCTCCAAG CTGGACGGCG GCGTCTCGGT CGGCAGCGAC
CCGATCGCGA ACGAGCTGAA GACCGCCTAC GGCACGAACG ACGTGTACGG CATGCACTGG
CTGCTCGACG TGGACAACAC CTACGGCTTC GGCCGCTGCG GCGACGGCAC CACCAAGCCC
GCCTACATCA ACACCTACCA GCGCGGCCCG GAGGAGTCGG TCTTCGAGAC CATTCCCCAG
CCTTCCTGCG ACACCTTCAA GCACGGCGGC AAGAACGGCT ACCTGGACCT GTTCACCGGG
GACAGCAGCT ACGCCAAGCA GTGGAAGTAC ACCAACGCCC CCGACGCCGA CGCCCGCGCC
GTCCAGGTGG CCTACTGGGC GCACACCTGG GCCAAGGAGC AGGGCAAGGA GGCGCAGGTC
GCCTCCTCGG TCACCAAGGC CGCCAAGATG GGCGACTACC TGCGCTACGC GATGTACGAC
AAGTACTTCA AGAAGCAGGG CTGCACGAGC ACCACGTGCC CGGCCGGCAC CGGCAAGGAC
AGCTCGGCCT ACCTGCTGAG CTGGTACTAC GCCTGGGGCG GCGCCAACGA CACCTCCGCC
GGCTGGGCCT GGCGGATCGG CTCCAGCCAC AACCACTCCG GCTACCAGAA CCCGATGGCC
GCCTGGGCGC TGTCGAGCGT GGACGCGCTC AAGCCCAAGG GGGCGACCGC CGTACAGGAC
TGGAGCACCA GCCTGAAGCG GCAGCTTGAG TTCTACCGCT GGCTGCAGTC GAGCGAGGGC
GCGATCGCCG GCGGCGCCAC CAACAGCTGG CAGGGCCACT ACGCGGCGCC GCCGTCCACG
CTGCCCACCT TCTACGGCAT GGCCTACGAC TGGCAGCCGG TCTACCACGA CCCTCCGTCC
AACCAGTGGT TCGGCTTCCA GGCCTGGTCG ATGGAGCGGG TCGCGGAGCT CTACTACGCG
ACCGGCAACG CCGACGCCAA GCTGGTGCTG GACAAGTGGG TCAAGTGGGC GACCGACAAC
ACCACGGTCA ACGCCGACGG GACCTTCCGG ATCCCCTCGA CCCTGGTGTG GACCGGCCAG
CCCGACACCT GGAACTCGGG CAACCCGGGG CCCAACGCCG GGCTGCACGT CAGCATCCGG
GACTACACCA GCGACGTCGG CGTGGCGGGC TCCTACGCCA AGGTGCTGAC CTACTACGCC
GCCAAGTCGG GCAACGCCAC GGCCAAGGCC GTCGCCAAGG GCCTGCTCGA CGGCCTCTGG
AAGAACAACC AGGACGCCAA GGGCGTCTCG GTGCCCGAGA CCAAGGCCGA CTACAACCGC
CTCAACGACC CGGTCTACGT CCCGCCCGGC TGGACCGGCA AGATGCCCAA CGGCGACGTG
ATCGACTCCA GCTCCACCTT CATGTCGATC CGGTCCTTCT ACAAGAACGA CCCGGACTGG
CCGAAGGTCG ACGCCTACCT GAAGGGCACC GGGCCCGTGC CGTCCTTCAA CTACCACCGG
TTCTGGGCCC AGGTCGACGT GGCCGTCGCC CTGGCCGAGT ACGGCCGGCT CTTCCCCTGA
 
Protein sequence
MPRFPIRAGA LPRFRRRVAA LALLSLAAGT TTALAASPAV AAISCEVTYA TNDWQGGFTA 
NVSIKNLGDP LNGWTLGFTF PDAAQKVTQG WSATWSQSGQ AVTARNLDWN GNLATGASTS
IGFNGSWSGA NPKPAAFTIN GTTCGGTGPV NQPPTAKITK PVAGATFTAP ATVDITADAA
DGDGTVAKVE FFNGTTLLGT DTTAPYAYSW AAVPAGDYSL TAKATDDKGS ATTSAPVGIS
VQANSAPAVL LTPATLAVPE GGGADLSVKL SKAPSANVTV TTARTSGDAD LTVASGGSLT
FTPANWNTAQ TVRVAAAEDT DQTSGSAEFT STATGHTPAK ATATEVDNDT PGGDNEYVKR
FTTMYNKLKD PANGYFSPQG VPYHSVETFM VEAPDHGHET TSEAYSYYLW LEAAYGKVTG
DWSRFNDAWA SMEKYIIPAT ADQPTNSFYN PSKPATYAGE WDDIKQYPSK LDGGVSVGSD
PIANELKTAY GTNDVYGMHW LLDVDNTYGF GRCGDGTTKP AYINTYQRGP EESVFETIPQ
PSCDTFKHGG KNGYLDLFTG DSSYAKQWKY TNAPDADARA VQVAYWAHTW AKEQGKEAQV
ASSVTKAAKM GDYLRYAMYD KYFKKQGCTS TTCPAGTGKD SSAYLLSWYY AWGGANDTSA
GWAWRIGSSH NHSGYQNPMA AWALSSVDAL KPKGATAVQD WSTSLKRQLE FYRWLQSSEG
AIAGGATNSW QGHYAAPPST LPTFYGMAYD WQPVYHDPPS NQWFGFQAWS MERVAELYYA
TGNADAKLVL DKWVKWATDN TTVNADGTFR IPSTLVWTGQ PDTWNSGNPG PNAGLHVSIR
DYTSDVGVAG SYAKVLTYYA AKSGNATAKA VAKGLLDGLW KNNQDAKGVS VPETKADYNR
LNDPVYVPPG WTGKMPNGDV IDSSSTFMSI RSFYKNDPDW PKVDAYLKGT GPVPSFNYHR
FWAQVDVAVA LAEYGRLFP