Gene Sros_1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1603 
Symbol 
ID8664880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1708321 
End bp1710030 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content73% 
IMG OID 
Product4-alpha-D-((1->4)-alpha-D- glucano)trehalosetreha lohydrolase 
Protein accessionYP_003337338 
Protein GI271963142 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGAGG TCTGGGCGCC GCAGGCGACG GCGGTCGATG TGGAGATCGG GGAGATCCGC 
CATCCGATGG CCGCCGGGAC GGGCGGCTGG TGGTCGGCGG AGGTGCCGGG GGCAGGTCAC
GGGACCGACT ACAGGTTCCG GGTGGACGGC GGCGAGCCGC TGCCGGACCC CCGGACCCGG
TGGCAGCCCG AGGGGATCTT CGGGCCGAGC CGGGTCTACG AGCACGACCG GTACGTCTGG
GGTGACGGGC TGTGGCGCGG GCGCGACCTG CCGGGGTCGG TGATCTACGA GCTGCACGTC
GGCACCTTCA CCCCCGCCGG GACCTTCGAC GCGGCGATCG GGAAGCTTGA GCACCTCGCC
GGGCTCGGCG TCGACTTCGT CGAGGTCATG CCGGTGCCGC CGGTGCCGGG GGAGCGCAAC
TGGGGCTACG ACGGGGTGGA CCTGTGGGCC GTCACCGAGA ACTACGACGG CCCCGACGGT
CTCAAGCGCT TCGTGGACGC CTGCCACCGG CAGGGCATCG GCGTCATCCT CGACGTCGTC
TACAACCATC TCGGCCCGTC GGGGAACTTC CTGGCCCCGT TCGGCCCCTA CTTCCACTCC
AGCGCCTCCT CCTTCTGGGG GCAGGCGGTC AACCTGGACG GCCCCGGCTC CGACGAGGTG
CGCCGCTACT TCATCGGCAA CGCGCTGCAG TGGCTGCGCG ACTACCACAT CGACGGCCTG
CGGCTGGACG CCGTGCACGC GCTGCACGAC AGGCGGGCCG TGCACCTGCT GGAGGAGATG
GCCGCGGAGG TGGAGGCGCT GTCGGCGGCC GTGGGCAGGC CGCTGACGCT CATCGCCGAG
TCCGACCTCA ACGACCCCCG GCTGGTGACC CCGCGCGAGG CCGGCGGGTA CGGCCTGGCC
GCGGCCTGGA ACGACGACGT CCACCACGCC CTGCACGTGG CGGTGACCGG GGAGCGGCAC
GGCTACTACG ACGACTTCGC CGGAGCGCTG CCCAAGGTCC TCGCCTCGGC CTACTACCAC
GACGGCACCT ACTCGGCCTT CCGGGGGCGC TCCCACGGCC GTCCGGCCCG TCACGTGCCC
GGCTACCGGT TCGTCTGCGC CGCGCAGAAC CACGACCAGA TCGGCAACCG TGCTGAGGGC
GACCGGATGG CGCCGGAGGC GCTGCGGCTG GCCGCCGGGC TGCTGCTCAC CTCGCCGTTC
ACCCCCATGC TGTTCATGGG GGAGGAGTGG GGAGCGCGCA CGCCGTTCCT GTTCTTCACC
GACCACGTCG AGCCGCAGCT CCGCGAGGGC GAGGCGGACC GGCGGCGGCG GGAGTTCGTC
GGCTTCGGCT ACGACGACTG GGCCGAGAAG GCGCCGGACC CGGGGGAGGA GCTGACGTTC
CTGCGCTCCA AGCTCGACTG GAGCGAGCTC GACGACGACG CCCACCGGGT CCATCTCGAC
TGGTACCGGG CCCTGATCGC GCTGCGCCGC GCGCATCCGG ACCTGTCGGA CCCGAGGCTC
GACCGGGTCC GGGCCGAGCA CGACGGCTCC TGGCTGGTGG TCCACCGGGG GGCGTTCCGG
GTGGCCGTCA ACTTCGGCGC CACGCCGGTC TCCCTCGACC TCACGGCCCC GGCCCAGGTC
GTGCTCGCCT CCGACCCCGG CGTCCACCTG GACTCCGGCC TCACCCTCCC GGCCCGTTCC
CTGGCGGTGC TCCGCCTCGC CGGGACCTGA
 
Protein sequence
MFEVWAPQAT AVDVEIGEIR HPMAAGTGGW WSAEVPGAGH GTDYRFRVDG GEPLPDPRTR 
WQPEGIFGPS RVYEHDRYVW GDGLWRGRDL PGSVIYELHV GTFTPAGTFD AAIGKLEHLA
GLGVDFVEVM PVPPVPGERN WGYDGVDLWA VTENYDGPDG LKRFVDACHR QGIGVILDVV
YNHLGPSGNF LAPFGPYFHS SASSFWGQAV NLDGPGSDEV RRYFIGNALQ WLRDYHIDGL
RLDAVHALHD RRAVHLLEEM AAEVEALSAA VGRPLTLIAE SDLNDPRLVT PREAGGYGLA
AAWNDDVHHA LHVAVTGERH GYYDDFAGAL PKVLASAYYH DGTYSAFRGR SHGRPARHVP
GYRFVCAAQN HDQIGNRAEG DRMAPEALRL AAGLLLTSPF TPMLFMGEEW GARTPFLFFT
DHVEPQLREG EADRRRREFV GFGYDDWAEK APDPGEELTF LRSKLDWSEL DDDAHRVHLD
WYRALIALRR AHPDLSDPRL DRVRAEHDGS WLVVHRGAFR VAVNFGATPV SLDLTAPAQV
VLASDPGVHL DSGLTLPARS LAVLRLAGT