Gene Sros_3305 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3305 
Symbol 
ID8666593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3599515 
End bp3600939 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content71% 
IMG OID 
ProductBeta-glucosidase 
Protein accessionYP_003338987 
Protein GI271964791 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.40854 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGC AAGAGACGCG GATTCAGACC CCGGATCTGG TGTTCCCGAC AGGCTTCGTC 
TGGGGCGCGG CCACCTCCGC CTACCAGATC GAAGGCGCGG TCTCCGAGGA CGGCCGCGGC
CGATCCATCT GGGACACCTT CGTCCAGCAG CCCGGCCGGG TGGTCAACGG CGAGAACGCC
GACGTCGCCA TCGACCACTA CCACCGTTAC CGCGACGACG TCCGGATGAT GGCCGACCTC
GGCCTGGGCG CCTACCGGTT CTCCGTCTCC TGGCCCCGGA TCCAGCCCGA CGGCAGCGGC
GCGATCAACT CCAAGGGCCT CGACTTCTAC AGCCGGCTGG TCGACGAGCT GCTGGCGAGC
GGCGTCGACC CGTGGGTGAC GCTCTATCAC TGGGACCTGC CGCAGGCCCT GGAGGACGCG
GGCGGCTGGC CGTCACGGGA AACGTCGAAG CGCTTCGCCG ACTACGCGGC GGCCGTACAC
GACGCGCTCG GCGACCGGGT CCGCAACTGG AGCACGATCA ACGAGCCGTG GTGCGCGGCG
TTCCTGGGAT ACGCCTCCGG TGAGCACGCC CCCGGGCGGC GCGAGCCGGC GCAGGCGGTG
CGCGCCGCCC ACCACCTCCT CCTCGCGCAC GGCCTGGCCA CCTCGGCCAT GCGCGCCCAG
CGGGCCGACA GCAGGATCGG CGGCAGCGTC AACCTCTACG CGATCTCGCC GCAGACCGGC
TCCGAGGCCG ACCAGGACGC CGCCCGCCGC ATCGACGGCC TGCAGAACCG CTTCTTCCTG
GACGCGCTGC TGAAGGGCGA GTACCCGGCC GACGTCCTGG AGGACCTCGC CGAGGTGCCT
GGGTTCGTCC AGGACGGGGA CATGAAGGTC ATCTCCGCCC CGCTGGACAT GCTGCTGATC
AACTACTACA GCCGCTTCAC CGTCTCGGGC ACCCCCGGCG GCGCGGCGTC GGCCGCGGCG
GCCCCCACCG GCACCGGGTC GCCGTGGGTC GGCAGCGAGG ACGTGTCGTT CGTCGAGGGC
GGGCGGCCGG TCACCGCGAT GGGCTGGGAG ATCGACGACA GCGGGCTGCA CGAGATCCTG
CTGCGGCTGG CCCGGGAGTA CCCGCGGATC CCGCTGGTCA TCTCCGAGAA CGGCGCGGCC
TTCGACGACG TCGTGGGCGC CGACGGCGTC GTGCACGACC ACGATCGCCT GAACTACATC
GACGCCCACC TGCGCACCTG CCACGCCGCG ATCGAGGCCG GGGTGCCGCT GGAGGGCTAC
TTCGCCTGGT CGCTGATGGA CAACTTCGAG TGGGCGTGGG GGTACGGCAA GCGTTTCGGG
CTGGTGCGCG TCGACTACGA GTCGCAGCTG AGAGTTCCCA AAGAGAGCGC TCTCTGGTAT
GCCGGGACAA TCAGGCGTGG AGGCCTGAGC GGTCCGGCAG AATAA
 
Protein sequence
MTTQETRIQT PDLVFPTGFV WGAATSAYQI EGAVSEDGRG RSIWDTFVQQ PGRVVNGENA 
DVAIDHYHRY RDDVRMMADL GLGAYRFSVS WPRIQPDGSG AINSKGLDFY SRLVDELLAS
GVDPWVTLYH WDLPQALEDA GGWPSRETSK RFADYAAAVH DALGDRVRNW STINEPWCAA
FLGYASGEHA PGRREPAQAV RAAHHLLLAH GLATSAMRAQ RADSRIGGSV NLYAISPQTG
SEADQDAARR IDGLQNRFFL DALLKGEYPA DVLEDLAEVP GFVQDGDMKV ISAPLDMLLI
NYYSRFTVSG TPGGAASAAA APTGTGSPWV GSEDVSFVEG GRPVTAMGWE IDDSGLHEIL
LRLAREYPRI PLVISENGAA FDDVVGADGV VHDHDRLNYI DAHLRTCHAA IEAGVPLEGY
FAWSLMDNFE WAWGYGKRFG LVRVDYESQL RVPKESALWY AGTIRRGGLS GPAE