Gene Sros_7891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_7891 
Symbol 
ID8671214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8698920 
End bp8701835 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content71% 
IMG OID 
ProductEndo-1,3(4)-beta-glucanase 
Protein accessionYP_003343292 
Protein GI271969096 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.798967 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCCCT CCATCCGGAC CACGGGTCCG GACAGTGCGC AGCCCCGTGA AAGGCGACTC 
ATCCACCGTC CGTGGGCCCG CAGGCGCGCC CTCGTCGCCG GCGTGGCCGT CGCGATGGTG
GCCGGCATGT TCGGCCTCGG TCTCGCCCGG TCCGCCGAGG CGGCCACGGT CGGCGCCGGC
AGCTATGCCG ACACGCTGCC CGCCGGCCGC TCGCTGCCGA CCGGCTGCGG TTCCATCTCC
ACCAACCCGC GCCAGTGGGT GACGGCCAAC GCTCCCGCGG GAGCCGTTCC GACCAACGAC
TGGTGGTCGT CGATCCTCTA CAAGAAGACC GACTGCTCCT ACGGCGAGCC GCTGCACGCC
CACCCGATCT CCTACGACAC CTCCGGCACC GGCCTCAGCT TCTCCTACAC CACCACGCCC
GCGATCAGCG GGACGGCCAC CGGGGTAGGG GAGTTCCACT ACCCCTACGT CCAGGACATC
CAGGTCGGCG TCGCCGGGCT GAACTCACCC GACGTCAAGG TCGACGGCTG GAGCGACTGG
ACCGTCACGC CGTACTGGAG CGGCGGCGGC CGGACCATGA AGGCCACCAT CGGCCACGGC
CTGCCGTTCG GCTACTTCCA GATCACCGGC GGGGACGCGC GGATCACCGC GGCCGGCACC
CCGTCCGTCT GGTCCAACAG CGGCGCCACC ATCGGTTTCA CTGTCAACGG GCACGACTAC
GTCGCCTACG CGCCGACGGG CGCCACCTGG GCGGTCAGCG GGACGACAAT CAGCTCCTCC
CTCGCGGGCC GGGGCTTCTT CTCGGTGGCG GTCCTGCCCG CGGGCGGCGA CCGCGCGGCC
CTCGCGAACA CCTACGGCCA GTACGCGCAC GCGCACGTCA CCGGCACCCG GGTCTCCTAC
TCGTACAACC CGGCGAGCAG CACGCTGAGC ACGACGTACG CCTTCACCAC CACGGCCCGG
CAGGGCGGCG CGACCGGCAC GGTCACGGCC CTCTACCCGC ACCAGTGGAA CCATCTGACC
GGCTCGACCC CGCTCGCGCA GACCTACGTC TCCGCGCGCG GCCAGATGAA GATCGTCACC
GGGACGCAGT TCACGACGTC CATGAAGTAC ACCGGCGTGC TGCCCGAGGT GCCCGCCGTC
GGCGACGGCA CCGGAGCCGA CCTGGCCACG GTCACCGGCC TCCTCAACGC CGAGCTCGGC
AACCCGATGG ACAACCGGGG CGACGACACC TACTGGACCG GCAAGGGACT GGGACGCGCC
GCGCGCATCG CCGAGATCGC CGACCAGCTC AACCTGACCT CGGTGCGCGA CGCGGCCCTG
GGCGCCATCC GCACCCGCCT CAACGACTGG TTCACCGCAT CGCCGGGCAA GACCTCCCGG
GTCTTCTACC TCGACCCCGC CTGGGGGACG CTGATCGGCT ACCCGGCCTC CTACGGCTCC
GACCAGGAGC TCAACGACCA TCACTTCCAC TACGGCTACT ACGTCGCGGC CGCCGCGACC
CTGGCCAAGT ACGACCCGAA CTGGGCGAAG ACCAGCCAGT ACGGCGGCAT GGTCGACCTG
CTGATCCGCG ACGCCAACAA CTACGACCGC GGCGACACCC GCTTCCCCTA CCTGCGTGAC
TTCGACATCT ACGCCGGCCA CGACTGGGCG TCGGGCCACG GCGCGTTCGG CGCGGGCAAC
AACCAGGAGT CCTCGTCCGA GGGTATGAAC TTCGCCAACG CGCTGATCCA GTGGGGGCAG
GCCACCGGGA ACACCGCGGT CCGCGACGCC GGCGTCTACA TCTACACCAC GCAGGCGGCG
GCGATCCAGG AGTACTGGTT CGACGTGCGC GACCAGAACT TCCCGGCGGC CTTCGGTCAC
AGCACGGTCG GCATGGTCTG GGGCGACGGC GGCGCCTACG CCACCTGGTT CAGCGCCGAG
CCGGAGATGA TCCAGGGCAT CAACATGCTC CCGATCACCG GCGGCCACTT CTACCTGGGC
GACAACCCCG CCTACGTGAC CACCAACTAC AACGAGCTGA CCAGGAACAA CGGCGGGCCG
CCCACGGTGT GGCGGGACAT CCTCTGGGAG TTCCTCGCGC TCGGCAACGG AGACGCCGCC
CTGGCGAACT TCCGCGCCAA CAGCGGCCTC ACCTCCGAGG AGGGGGAGAG CAAGGCCCAC
ACCTTCCACT GGATCCGGAA CCTGGCCGCG CTGGGCACGG TGGACACCTC CGTCACCGCC
AACCACCCGC TGGCCAAGGT GTTCAGCAAG AACGGCGCCC GGACCTACGT GGCTGCCAAC
ATCACCGGCG CCGCGATCAC GGTGACCTTC TCCAACGGCA CCACGCTCAA CGTGCCGGCG
GGCAAGACGG TCACCTCGGG CGCCCACACC TGGAGCGGCG GCAACGCCGG GGGCGGCACC
GGCCCGAGCC CCCAGCCCAC CCCGACGGTC ACGCCGACCG TGGGCCCCTT CGCCGCGACG
CGCTATCCGC AGGCGGGCGG GGGCCTGCCG GGCGCGGCGG GTACGGCGGG CACCGTCACG
CTCGCCGCCG CCAACGGCAA CCACGACGGC ACGCCCGTGA ACGCGCAGAT CTTCACGGCC
ACCGGCCTGA CCGCCGCCCA CAACGGCGGG GCCACCGCGT TCGACCTCTT CGTGGACGCG
GGCACCACCG TCGCCAACGG CGTCCAGGCC CGGGTCTCCT ACGACCTGAC CGGCGACGGA
AGCTGGGACC GGGTGGAGAC TTACCGCTAC TTCGCCACCG ACCCGGTGGC CGGCTGGGAG
CACTACACCC AGGCAGCAGG CCTGCAGTCC TCCTCCGGCA CGCTCGGCAA CCTGTCCGGC
GGCCGGGTGA GGGTGGAGGT CTGGAACGCC ATCGGCGGCG GGGTGACCAC CCTCGGCACC
GGCGACCGTT CCGTGGTGCG GCTGCCGTTC GGCTGA
 
Protein sequence
MHPSIRTTGP DSAQPRERRL IHRPWARRRA LVAGVAVAMV AGMFGLGLAR SAEAATVGAG 
SYADTLPAGR SLPTGCGSIS TNPRQWVTAN APAGAVPTND WWSSILYKKT DCSYGEPLHA
HPISYDTSGT GLSFSYTTTP AISGTATGVG EFHYPYVQDI QVGVAGLNSP DVKVDGWSDW
TVTPYWSGGG RTMKATIGHG LPFGYFQITG GDARITAAGT PSVWSNSGAT IGFTVNGHDY
VAYAPTGATW AVSGTTISSS LAGRGFFSVA VLPAGGDRAA LANTYGQYAH AHVTGTRVSY
SYNPASSTLS TTYAFTTTAR QGGATGTVTA LYPHQWNHLT GSTPLAQTYV SARGQMKIVT
GTQFTTSMKY TGVLPEVPAV GDGTGADLAT VTGLLNAELG NPMDNRGDDT YWTGKGLGRA
ARIAEIADQL NLTSVRDAAL GAIRTRLNDW FTASPGKTSR VFYLDPAWGT LIGYPASYGS
DQELNDHHFH YGYYVAAAAT LAKYDPNWAK TSQYGGMVDL LIRDANNYDR GDTRFPYLRD
FDIYAGHDWA SGHGAFGAGN NQESSSEGMN FANALIQWGQ ATGNTAVRDA GVYIYTTQAA
AIQEYWFDVR DQNFPAAFGH STVGMVWGDG GAYATWFSAE PEMIQGINML PITGGHFYLG
DNPAYVTTNY NELTRNNGGP PTVWRDILWE FLALGNGDAA LANFRANSGL TSEEGESKAH
TFHWIRNLAA LGTVDTSVTA NHPLAKVFSK NGARTYVAAN ITGAAITVTF SNGTTLNVPA
GKTVTSGAHT WSGGNAGGGT GPSPQPTPTV TPTVGPFAAT RYPQAGGGLP GAAGTAGTVT
LAAANGNHDG TPVNAQIFTA TGLTAAHNGG ATAFDLFVDA GTTVANGVQA RVSYDLTGDG
SWDRVETYRY FATDPVAGWE HYTQAAGLQS SSGTLGNLSG GRVRVEVWNA IGGGVTTLGT
GDRSVVRLPF G