Gene Sros_3597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3597 
Symbol 
ID8666885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3993044 
End bp3994885 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content70% 
IMG OID 
Productglycosyl hydrolase 
Protein accessionYP_003339273 
Protein GI271965077 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0143916 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCAG CGGGGTCCGC TCGTCGCCCG GATCCGGCGG GCGCCTCTCG TTACCCGCCG 
ATCGCCGACC ACGGACTGAT CGGTGACCTG CGCAGCGTGG CGCTCGTGGA CACCGGCGGC
ACGATCGGCT GGTACTGCTG CCCGCGCTTC GACTCGCCGA GCGTGTTCGC CTCGATCCTG
GACGCCGACC GGGGCGGGTC GTTCGAGCTG GCCGCCGACG TGCCCGCGCG GACCAAGCAG
TTCTACTTCC CCGACACCAA CGTGCTGATC ACCCGGTTCT TCGCCGCCGA CGGGGTGGGC
GAGATCCAGG ACTTCATGCC GGTCGTCGCC GGCTCGGCCG AGGACGGCAG GCACCGGCTG
ATCCGGCGGG TGATGTGCGT CCGGGGGGCG CTGCCGTTCC GGGCGCGGGT GGCGCCCCGC
TTCGACTACG GCAGGCAGTC GCACGACCTG AGCATGCGGG ACGGCCGGGC GATCTTCGAG
TCGCCGTCGC TCTCCCTGTC CCTGAGCGCC AGCACGTCCA TCGAGACCGA CGGCCCGGAC
GTCTGGTCGG AGTTCAAACT CACCGAGGGC GAGTCTGCGG TCTTCGCCCT CGACAAGCTC
GGCGACGGCG TGGAACCCCG CTCGTGCCCG ATCGCCGAGG CCGAGGAGGA GTTCGCCGCC
ACGGTCGCGT ACTGGCGGCG CTGGCTGTCC GCGTCGCGCT ACCGCGGCCG CTGGCGGGAG
ATGGTGCACC GCTCCGCGCT GACGCTGAAG CTGCTCACCT ACGCGCCGAC CGGCGGGATC
GTGGCGGCGG CGACGACCAG CCTCCCCGAG CAGATCGGCG GCGAGCGCAA CTGGGACTAC
CGCTACGTCT GGATCCGCGA CTCGGCGTTC TGCGTGTACG CCCTGCTCAG GCTGGGGTTC
AGCGAGGAGG CGGCGGCGTT CATGAACTTC CTGTCCGAGC ACGTCAGCCG GGACGGCACC
GGCCCGTCGG GCCCGCTGCA GATCATGTAC GGCATCGACG GGCGCACCGA GCTGCCCGAG
GTCGAGCTGT CCCACTTCGA GGGCTACCGG GGCTCCGCCC CGGTGCGCAC CGGGAACGGC
GCCGTCCACC AGCTCCAGCT GGACATCTAC GGAGCTCTCA TCGACTCCGT CTACCTCTAC
GACAAGATGT GCCAGCCGAT CTCCAGTGAC CACTGGGAAG AGGTCCACAC GCTGGTGGAC
TGGGTCTGCG ACAACTGGGA CCAGCCCGAC GAGGGCGTCT GGGAGACGCG TGGCGGGCGC
AAGGACTTCG TCTACTCGCG GCTGATGTGC TGGGTGGCGA TCGAACGGGC CATGCGGATG
GCCGGCCACC GCGGGCTGCC CGCCGACACG CCGCGCTGGC AACGGGCCCG CGACGCGATC
TACCGGCAGA TCATGCAGCG CGGCTGGTCG GCCCGGCTGC AGGCGTTCGT CCAGCATTTC
GACGAGGACG TCCTCGACGC CTCCATCCTG ATGATGCCGC TGGCCAAGTT CGTCTCCCCC
ACCGACCCGA AGTGGCTGTC GACTCTCGAC GCCCTCGGCA GGACCCTGGT GTCCGACTCC
CTGGTGTACC GCTACGACCC CCTGACCAGC CCGGACGGGT TGCGGGGACA GGACGGCACG
TTCTCGATCT GCTCGTTCTG GTACGTCGAG GCGCTCACCC GCGCCGGCCG CCTGGACGAG
GCGCGGCTGG CCTTCGAGAA GATGCTCACC TACGCCAATC ACCTCGGCCT GTACGCCGAG
GAGATCGGGC AGACCGGTGA GCAGCTCGGC AACTTTCCGC AGGCCTTCAC CCATCTCGCG
CTGATCAGTG CCGCGTTCAA CCTCGACCGC GCCCTCGGGT AG
 
Protein sequence
MAAAGSARRP DPAGASRYPP IADHGLIGDL RSVALVDTGG TIGWYCCPRF DSPSVFASIL 
DADRGGSFEL AADVPARTKQ FYFPDTNVLI TRFFAADGVG EIQDFMPVVA GSAEDGRHRL
IRRVMCVRGA LPFRARVAPR FDYGRQSHDL SMRDGRAIFE SPSLSLSLSA STSIETDGPD
VWSEFKLTEG ESAVFALDKL GDGVEPRSCP IAEAEEEFAA TVAYWRRWLS ASRYRGRWRE
MVHRSALTLK LLTYAPTGGI VAAATTSLPE QIGGERNWDY RYVWIRDSAF CVYALLRLGF
SEEAAAFMNF LSEHVSRDGT GPSGPLQIMY GIDGRTELPE VELSHFEGYR GSAPVRTGNG
AVHQLQLDIY GALIDSVYLY DKMCQPISSD HWEEVHTLVD WVCDNWDQPD EGVWETRGGR
KDFVYSRLMC WVAIERAMRM AGHRGLPADT PRWQRARDAI YRQIMQRGWS ARLQAFVQHF
DEDVLDASIL MMPLAKFVSP TDPKWLSTLD ALGRTLVSDS LVYRYDPLTS PDGLRGQDGT
FSICSFWYVE ALTRAGRLDE ARLAFEKMLT YANHLGLYAE EIGQTGEQLG NFPQAFTHLA
LISAAFNLDR ALG