Gene Sros_9017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_9017 
Symbol 
ID8672359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9961421 
End bp9963340 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content69% 
IMG OID 
Productglycoside hydrolase 
Protein accessionYP_003344391 
Protein GI271970195 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTGG ATCCACCGGC GGGCGTGCTC TGCGACGCCT CACCGTTCCC GCCCATCGCG 
GACTACGGCT TCCTGTCCGA CTGCGAGGTC ACCGCTCTGG TCGCGCCGAG CGGAAACGTG
GAGTGGCTGT GCCTGCCCCG GATGGACTCT CCCAGCGTGT TCGCCGCCAT CCTGGACCGG
GACGCCGGAG GGTTCCGGAT CGGTCCCGCG GACCTCAGGG TGCCGGCCGC GCGCCGCTAC
CTGCCGGGCA CCATGGTGCT GGAGACGAGC TGGGGCACGC CGACCGGCTG GATCATCGTC
CGTGACCTGC TGCTCATAGG GCCCTGGCAT CACGACGAGG AGCTGTCGAA CACGCACAGG
CGGGCCCCGA CCGACTACGA CGCCAGCCAC GTGCTGCTGC GCACGGTGCG CTGCGTCAGC
GGGGAGGTGC AGATCGCCCT CGACTGCGAG CCGGTGTTCG ACTACGGGCG CAGGCCGGCC
CACTGGATCT ACACGGACCG GGGCTACCAC CAGGGCGCCG CCCGGGGAGA CGGGGTGAAT
CTGGAGCTCA AGCTCACCAC CGACATGCGA CTGGGGTTCG AGGGGTCGCG GGCGACCGCG
CGCACCCTGA TGAGAGAGGG GGACACCCGC TTCTGCGCGC TGTCGTGGAC CGTTCACGAG
CCGCCCTACA CCTTCGCGGA GGCATACGGA CGGCTGGTGT GGACCGCCCA CCACTGGCAG
CACTGGCTTG CCAGAGGCGA CTTCCCCGAC CACCCCTGGC GCAGCTTCCT GGAGCGCAGC
GCCCTCACCC TCAAGGGCCT GACCTTCGCC CCGACCGGAG CGCTGATCGC GGCGGCGACC
ACCTCCCTGC CCGAGACACC GGGCGGGGAC CGGAACTGGG ACTACCGCTA CAGCTGGATC
CGCGACTCGA CCTTCGCCCT CTGGGGCCTC TACACCCTCG GATTCGACTG GGAGGCCGAC
GACTTCTTCT GGTTCATCGC CGACGTCGCG GAGAGGGACG AGGAGCTTCA GGTCATGTAC
GGCGTCGACG GCGAGCGTGA CCTGGAGGAG CACATCCTCG ACTATCTGGA CGGCTACGAG
GGGGCCAGAC CCGTCCGGGT CGGCAACGCC GCCTACAAGC ACCACCAGCA CGACGTCTGG
GGGGCGGTCC TCGACTCCTT CTACATCCAC ACCCGGTCCC GGGACCGCCT GGACGAGCGG
ATCTGGCCCA TCCTGAAGCG CCAGGTCGAC GCCGCCATCA AACACTGGCG GGAGCCCGAC
CGCGGTATAT GGGAGGTGCG CGGCGAGCCC AAGCACTTCA CCTCGTCGAA GGTCATGTGC
TGGGTGGCGG CCGACCGGGG CGCGCGGCTC GCCCGGCTCC GCCAGGACTT CCACCTGGCC
GCGCGCTGGC AGGCGTCCGC CGACGAGATC CACGCCGACG TCTGCGCCAA CGGAGTCACC
GAGCGGGGGG TCTTCAGGCA GCACTACGGC ACCGATGCCC TCGACGCGTC GCTCCTGCTC
CTCCCGCTGG TGCACTTCCT GCCGCCCACC GACCCCAGGA TCCGCGACAC CGTCCTGGCC
ATCGCCGACG AGCTGACCGT CGACGACCTG GTCATGCGCT ACCTGCCGAA GGAGACCGAC
GACGGCCAGG CCGGGGACGA GGGGACCTTC ACGATCTGCT CGTTCTGGCT CGTGTCGGCG
TTCACGGAGA TCAGGGAGGA GAGCCGTGCC CGCAGGCTCT GCGAGAAGGT CCTCTCCTTC
GCCAGCCCCC TCGGTCTCTA CGCCGAGGAG ATCGACCCCG TCACCGGACG CCACCTGGGG
AACTTCCCCC AGGCCTTCAC CCATCTCGCC CTCATCAACG CGGTCATCCA CATCATCCGG
GCCGAACGCG GCTCGCTGAC CTCGAAGACA TGGCGAACGC CCGCGACGGG ATCACTGTGA
 
Protein sequence
MTLDPPAGVL CDASPFPPIA DYGFLSDCEV TALVAPSGNV EWLCLPRMDS PSVFAAILDR 
DAGGFRIGPA DLRVPAARRY LPGTMVLETS WGTPTGWIIV RDLLLIGPWH HDEELSNTHR
RAPTDYDASH VLLRTVRCVS GEVQIALDCE PVFDYGRRPA HWIYTDRGYH QGAARGDGVN
LELKLTTDMR LGFEGSRATA RTLMREGDTR FCALSWTVHE PPYTFAEAYG RLVWTAHHWQ
HWLARGDFPD HPWRSFLERS ALTLKGLTFA PTGALIAAAT TSLPETPGGD RNWDYRYSWI
RDSTFALWGL YTLGFDWEAD DFFWFIADVA ERDEELQVMY GVDGERDLEE HILDYLDGYE
GARPVRVGNA AYKHHQHDVW GAVLDSFYIH TRSRDRLDER IWPILKRQVD AAIKHWREPD
RGIWEVRGEP KHFTSSKVMC WVAADRGARL ARLRQDFHLA ARWQASADEI HADVCANGVT
ERGVFRQHYG TDALDASLLL LPLVHFLPPT DPRIRDTVLA IADELTVDDL VMRYLPKETD
DGQAGDEGTF TICSFWLVSA FTEIREESRA RRLCEKVLSF ASPLGLYAEE IDPVTGRHLG
NFPQAFTHLA LINAVIHIIR AERGSLTSKT WRTPATGSL