Gene Sros_2061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2061 
Symbol 
ID8665343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2214756 
End bp2216267 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content75% 
IMG OID 
ProductBeta-glucosidase-related glycosidase-like protein 
Protein accessionYP_003337789 
Protein GI271963593 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGCG GCCCAGAGCT GCTTCGTCTG GCGGACACGG TGATCCTTCC GGGGTTCGAG 
GGCAAGACCC CGCCCGACTG GCTCCGTCGG CGTCTCGCCG GCGGTCTGAC CGGGGTGGTG
CTCTTCTCCC GGAACATCGC GACCCCGTCC CAGGTCGCCG ACCTGACGGC CGCGCTCCGC
GCGGAGAACC CGCAGGTCCT CGTCGGCATC GACGAGGAGT CGGGGGAGGT CACGCGCCTG
GAGGCGGCGG CCGGGAGCAC CCGGCCCGGC AGCTTCGCTC TCGGCGTCGT CGACGACCTC
GACCTCACCG AGGAGATCGC CAGGGACCTC GGCCGCGACC TCGCCGCGGC GGGCGTCAAC
CTCGACTTCG CCCCCTCGGC GGACGTCAAC TCCAACCCGG ACAATCCGGT CATCGGCCTG
CGTTCCTTCG GCGCCGACCC CGACCTGGTC TCCCGGCACA CCGCCGCCTG GGTCCGGGGC
ATGCAGTCCT CCGGCGTCGC CGCCTGCGCC AAGCACTTCC CCGGCCACGG TGACACCTCC
GTCGACTCCC ACCACGGCGT CCCGCTGGTC GCGGCCACCG CCGAGGAGCT CCACGAGGTG
GCGCTGCGGC CCTTCCGCGC GGCCATCGCC GAGGGCGTGC GCACGGTCAT GACCGGTCAC
CTGCTGGTCC CCGCCTTCGA CGCCGCGATG CCCGCCACGC TCAGCGGCCG GGTGCTGCAC
GACCTGCTCC GGGTCGAGCT GGGCTTCGAC GGCGTCATCG TCACCGACGG CATCGAGATG
GCGGCCGTCT CCGGCACCTA CGGCATCGGC GGCGCCTCGG CGCGGGCCAT CGCCGGGGGC
GCCGACGCGA TCTGCGTGGG CGGCGAGCAC GCCGACGAGC ACACCGCCAT CGCGGTCCGC
GACGCCATCG TGGACGCCGT GATCGAGGGG TGGCTCCCCG AGGAGCGGCT GGCCGACGCG
GCCCGCCGGG TCTGTGAGCT GGCCCTGTGG GGGGCGTCCA CGGGGCACGT CCGCCGCTCC
GCGCCCGCCC GCCCCGGTGA CGTCCCGATC GGCCTGGTCG CCGCCCGGCG CGCCCTGCGG
ATCACCCGCC GCTCCGCGTC GGCGGTCCTC CCGCTGCCCG CTCCGCCCCA CGTGGTCGAG
CTCGCCCCGG AGATGAACCT CGCCATCGAG AAGGACACCC CCTGGGGCGT CGGAGAGCCG
CTCGGCAAGC TCCTCCCCGG CACCACGGTG ACCCGCCTGA GCGCCTCGAC CGCCACCGGG
GGCGCGATCG AGTCCGCGCT CGCCTCCGCG GTGGACCGCC CCCTGGTCCT CGTCGTCCGC
GACGCCCACC GCCACCCCTG GCAGACGGAC GCCCTGAACC ACCTCCTGGC CGCCCGGCCG
GACACCGTCG TGGTCGAGAT GGGCCTGCCC GGCCGCTCCG ACCTCGGCGC CGTCCACATC
GCCACCCACG GGTCGGCCAA GGTCTGCGGC CAGGCCGCCG CCGAGATCCT CGCCGGGGAC
CTCCGGCTCT GA
 
Protein sequence
MKRGPELLRL ADTVILPGFE GKTPPDWLRR RLAGGLTGVV LFSRNIATPS QVADLTAALR 
AENPQVLVGI DEESGEVTRL EAAAGSTRPG SFALGVVDDL DLTEEIARDL GRDLAAAGVN
LDFAPSADVN SNPDNPVIGL RSFGADPDLV SRHTAAWVRG MQSSGVAACA KHFPGHGDTS
VDSHHGVPLV AATAEELHEV ALRPFRAAIA EGVRTVMTGH LLVPAFDAAM PATLSGRVLH
DLLRVELGFD GVIVTDGIEM AAVSGTYGIG GASARAIAGG ADAICVGGEH ADEHTAIAVR
DAIVDAVIEG WLPEERLADA ARRVCELALW GASTGHVRRS APARPGDVPI GLVAARRALR
ITRRSASAVL PLPAPPHVVE LAPEMNLAIE KDTPWGVGEP LGKLLPGTTV TRLSASTATG
GAIESALASA VDRPLVLVVR DAHRHPWQTD ALNHLLAARP DTVVVEMGLP GRSDLGAVHI
ATHGSAKVCG QAAAEILAGD LRL