Gene Sros_5683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5683 
Symbol 
ID8668977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6215947 
End bp6217509 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content76% 
IMG OID 
ProductBeta-glucosidase-related glycosidase-like protein 
Protein accessionYP_003341174 
Protein GI271966978 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0956393 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.544245 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCACC ACAGCCCCAC CCTGGCCCGG CTGGCCATGA CCGTGCTCCA GCCCGGCTTC 
GACGGCACCG CACCGCCCGA CTGGCTGCGC CGGGCGCTCT CCGAAGGGCT CGGCGGCGCC
GTGCTGTTCG CCCGGAATCT CGCCGGCCCG GCGCAGACGG CCGAGCTCGT CGCCGGCCTG
CGCCGGGAGA ACCCGGCCGT GGTGGTCGCG GTGGACGAGG AAGGCGGAGC GGTCACCCGG
CTGGAGGCCA GGACCGGCAG CTCCTGGCCG GGCAACCGGG CGCTCGGCGT GGCCGACGAC
GCCGAGCGCA CCGAGCGGGT CGCCCGCGAG ATCGGCCGCC TCCTCGCCTC GGCCGACATC
ACCCTCGACT ACGCCCCCGT GGTGGACGTC AACGCCAACC CGGCCAACCC GGTGATCGGC
ATCCGTTCCT TCGGCCCCGA CCCCGAGCTG GTGTCCCGGC AGACCACCGC CTGGATCACG
GGGTTGCAGG GCGCCGGGGT GGCCGCCTGC GCCAAGCACT TCCCCGGTCA CGGCGACACC
GTCACCGACT CCCACCACGC GCTGCCGACC GTCCACGCCG ACCTCGAGCT CCTCCAGGAG
CGCGACCTGC CTCCGTTCCG CGCGGCCGTC AAGGCCGGGG TCCAGGCCGT GATGTGCGGC
CACCTGCTGG TGCCCGCACT TGATCCCGGC AACCCCGCCA CGCTGAGCAG GCGGATCCTG
ACCGGCCTGC TCCGCGAGGA GATGGGCTTC GGCGGCATGC TGGTCACCGA CGCGATCGAG
ATGGGAGCCG TCGCCGCCCT GCACCCCCCG GGCGAGATAG CGGTCCGCGC GCTGGCCGCG
GGGGTGGACG CGATCTGCGT CGGCGTGTCC TCGCCCGGCG GGGAGAGCGT CTACGCGCTG
CGGGACGCGA TCGTGCGGGC CGTACACGAC GGCAGGCTGC CCGAGGAGCG GCTGGCCGAG
GCGGCGGGAC GCGTGCTGGC CCTGGCCGGC TGGTACGCCG AGAACGCCGC CGCGCGGGCA
CGGGACACGG AGCGGACGCG GGAGGCGCCG GACGCGGAGG CCCCGGAGGC CCCGGAGGGG
CGGGGAGGAC GGGATCCGCG AGGGGGAGGC GAGGAGCTCG GCCTGCAGGC CGCCCGCGCG
GCCATGCGCG TGACCGTCGC GGGCGATCGG ACCGCGCCTC CGCCCGTCCT CTCCCGCCCC
CCGCTGGTCG TCGACATCGC CCCGCGCCTG AACCTGGCGA TCGACCCCTC CACCCCCACC
GGCCTCGTCG GCGCCATGAC CGAGCTGCTG CCGGGCACCA CCGGGCACAC CGTCGCCGCC
GAGACCGCCG ACCTCCCCGA CCTCTCCGAC CACCGGCGCC CGCTTGTCCT GGTGGCGCAC
GACGCCCCCC GCCACGCGTG GGTCCGGGAC CTGCTGGCCC GCGCCGTCGG GCTGCGCCCC
GACGCGATCG TGATCGAGAC CGGGCTGCCC GGCGAACCCA CCGGGGCGGT GCACATCGCC
ACACACGGTA TTTCCCGGGT TTCGGCCCGC GCCGCCGCCC TGTGGCTGAC CGGCGGCCAA
TAG
 
Protein sequence
MPHHSPTLAR LAMTVLQPGF DGTAPPDWLR RALSEGLGGA VLFARNLAGP AQTAELVAGL 
RRENPAVVVA VDEEGGAVTR LEARTGSSWP GNRALGVADD AERTERVARE IGRLLASADI
TLDYAPVVDV NANPANPVIG IRSFGPDPEL VSRQTTAWIT GLQGAGVAAC AKHFPGHGDT
VTDSHHALPT VHADLELLQE RDLPPFRAAV KAGVQAVMCG HLLVPALDPG NPATLSRRIL
TGLLREEMGF GGMLVTDAIE MGAVAALHPP GEIAVRALAA GVDAICVGVS SPGGESVYAL
RDAIVRAVHD GRLPEERLAE AAGRVLALAG WYAENAAARA RDTERTREAP DAEAPEAPEG
RGGRDPRGGG EELGLQAARA AMRVTVAGDR TAPPPVLSRP PLVVDIAPRL NLAIDPSTPT
GLVGAMTELL PGTTGHTVAA ETADLPDLSD HRRPLVLVAH DAPRHAWVRD LLARAVGLRP
DAIVIETGLP GEPTGAVHIA THGISRVSAR AAALWLTGGQ