Gene Sros_3336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3336 
Symbol 
ID8666624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3643944 
End bp3646937 
Gene Length2994 bp 
Protein Length997 aa 
Translation table11 
GC content69% 
IMG OID 
ProductGlycosidase-like protein 
Protein accessionYP_003339018 
Protein GI271964822 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.310456 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.29304 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTCAT CCCCCCGACG CGCTCTCCGG CTGCTCACCG TCGCGGTCCT CGGCCTGGCC 
GGCGTCGCCG TCCCGCTCTC GGCGGCGCTG GCGGCCAACA GCGTGACCGT CTACTACGCC
ACCGCCTGGA CCGGCGCCAA CATCCACTAC CAGCCCACCG GCGGAAGCTG GACCCCCGTC
CCCGGCGTCG CCATGGACGA GACCGCCTGC GCGGGCTGGA AGAAGAAGAC CGTCGACCTC
GGCACCGCCA CCGGGCTCAA GGCCGCCTTC AACAACGGCT CCGGCACCTG GGACAACAAC
GGCGGCCGCG ACTACACCAT CGGCGCCGGG GCCGTGAAGG TCTCCGGCGG CCAGGTCACC
GCCGGCGACC CCTGCCCCGG CGGTGGACCG ACCGAGCCGC CCGGCAAGCA GGCCGTCGTC
TACTTCCACA AGAAGACCAG GGGCTGGAGC GCCGCCAACA TCCACTACCA GCCCACCGGC
GGGACCTGGA CCCCCGTCCC CGGCGTCGCC ATGGACGAGA CCGCCTGCGC CGACTGGGCC
AGGAAGACCG TCGACCTCGG CACCGCCACC GGCCTCAAGG CCGCCTTCAA CAACGGCTCC
GGCACCTGGG ACAACAACAA CGGCGCCGAC TACGCCATCG GCACCGGACT GACCACCGTC
AAGGACGGGG CGGTGCGCGC GAACGCCACC GAGCCGTGCA CCCCCGAACC CCCGGACACC
ACCGCCCCCT CCGTCCCCAC CGGGCTGACC GCCACGGCGG AGGGCACCAC CGTCAAGCTG
AGCTGGAACG AGGCCGGCGA CGACATCGCG GTGACCGGCT ACGAGATCAC CCGCAGGAAG
GGCTCCGAGG CCCCGGTGAC GCTCACAACG GGCACCAACA CCGTCTACAC CGACGCCCGC
CTGGAGGAGC AGACCACCTA CGCCTACACG GTCAGGGCCC GCGACGCGGC GGGCAACCGG
TCCGGGGCGA GCCCCGAGGC GGCGGCGAAG ACCGGCGACG CCCCGCCAGG ACCGGCCAAG
GGCTCGCCAC TGGGCGGCGA CCCGCGCGAG GACTCCATCT ACTTCGTGAT GACCGCCCGC
TTCAACGACG GCGACACCTC CAACAACCGC GGCGGCGGCC AGCACACCGT CTCCGGCAAC
GCCAGGAACA ACGACCCGAT GTTCCGCGGC GACTTCAAGG GCCTGATCGA CAAGCTCGAC
TACATCAAGG GCCTCGGCTT CTCCGCTCTG TGGATCACCC CGGTCGTGCT CAACCGCTCC
GACTACGACT TCCACGGCTA CCACGGCTGG GACTTCCACC GTGTCGACAC GCGGCTGGAG
ACCCCCGGCG CGACCTACCA GGACCTGATC GACAAGGCGC ACGCCAAGGG CATCAAGATC
TTCCAGGACG TGGTCTACAA CCACAGCTCC CGCTGGGGAG CCAAGGGGTT GTACGTCCCC
CCGGTGTGGG GCGCGCGTGA CGAGCAGTGG AAGTGGCTCT ACAGCGAGAA GGTCCCGGGC
AAGGAGTACG ACCCGCTGGA GGAGCACCAG GGCGACGACC CGGAGCTGAC GGCCAGGCAG
AACGAGATGG CCAAGGGCAG GCCGTACAAC GGCGACCTGT GGTCCACCGC GGCCCCGGCG
GGCAACACCT GCAGGGACTA CGGCACGCCC ACCCAGTGGA AGAGCCCCGA GGGCTTCACC
ATCCACAACT GCCAGTGGCC CAGCCCTACC TCGGGCATGT TCCCCGCGGA GTCCTATCAC
CAGTGCTGGA TCGGCAACTG GGAGGGCTCG GACTCCAAGA GCTGCTGGCT CCACGACGAC
CTGGCCGACC TCAACACCGA GAACAAGGCG GTGCAGGACT ACCTGATCGA CGCCTACAAC
AAGTACATCG ACATGGGCGT CGACGGCTTC CGCGTCGACA CCGCCGTGCA CGTCTCGCGG
CTGGTGTGGA ACCGCCGCTT CCTGCCGGCC CTGCAGCAGC ACGCCGTGGC CACGCACGGG
GACAAGGGCA AGGACTTCTA CGTCTTCGGC GAGGTCGGGG CGTTCGTCAA CGACAAGTGG
AACCGCGGCT CGGTCAACCA CTCCGCGCAG TTCTACACCT GGAAGGAGCG CAGGGAGTTC
AACCCCGACG ACGCCAGGGC GGTCGTCGAG CAGTTCGACT ACGAGAACCT GATGGGCACC
GGCAACCAGC CCACCACCGA CAACGCCTTC CTGCGCGGCA ACGCCTACCA CGCGCCCGAC
CACTCGAAGT TCTCCGGCAT GAACGTCATC GACATGCGCA TGCACATGAA CTTCGGCGAC
GCGCCCAACG CCTTCAACAA CGGCAAGGAC TCCGACGACA GCTACAACGA CGCCACCTAC
AACACCGTCT ACGTCGACTC CCACGACTAC GGCCCCAACA AGTCCAACGA GCGCTACACC
GGCGGGACCG ACGCGTGGGC GGAGAACATG TCGCTGATGT GGACCTTCCG CGGCATCCCG
ACGCTCTACT ACGGCTCGGA GATCGAGTTC CAGAAGGGTC AGAGGATCGA CTGCGGGCCC
ACCTGCCCGC TGGCGACCAC CGGCCGCGCC TACTACGGCG ACCACCTCGC GGGCAGCGTC
ACCGCCTCCG GCTTCGGCGT CGTCTCCTCG GCCACCGGCG CGGTGGCGCA GACGCTGGAC
AAGCCGCTGG TCAAGCACGT GCAGCGGCTC AACCAGATCC GCCGGGCCAT CCCGGCACTG
CAGAAGGGGC AGTACTCGAC CGAGGGCGTC ACGGGCCAGA TGGCCTACAA GCGCCGCTTC
ACCCAGGGGG CGGTGGACAG CTTCGTGCTG GTCTCGGTCT CCGGGGGCGC CACCTTCACC
GGCATCCCGA ACGGCACCTA CGTCGACGCG GTCACCGGTG ACAGCAAGGC CGTGACCGGC
GGGACGCTCA CGATCGCCGG CGGCGGCAAA GGCAACCTGC GGGCCTACGT CCTGTCACTG
CCCGGCAACC CGGCCCCCGG GAAGATCGGT ACGGCGGGGC CGTACCTGAG GTAG
 
Protein sequence
MRSSPRRALR LLTVAVLGLA GVAVPLSAAL AANSVTVYYA TAWTGANIHY QPTGGSWTPV 
PGVAMDETAC AGWKKKTVDL GTATGLKAAF NNGSGTWDNN GGRDYTIGAG AVKVSGGQVT
AGDPCPGGGP TEPPGKQAVV YFHKKTRGWS AANIHYQPTG GTWTPVPGVA MDETACADWA
RKTVDLGTAT GLKAAFNNGS GTWDNNNGAD YAIGTGLTTV KDGAVRANAT EPCTPEPPDT
TAPSVPTGLT ATAEGTTVKL SWNEAGDDIA VTGYEITRRK GSEAPVTLTT GTNTVYTDAR
LEEQTTYAYT VRARDAAGNR SGASPEAAAK TGDAPPGPAK GSPLGGDPRE DSIYFVMTAR
FNDGDTSNNR GGGQHTVSGN ARNNDPMFRG DFKGLIDKLD YIKGLGFSAL WITPVVLNRS
DYDFHGYHGW DFHRVDTRLE TPGATYQDLI DKAHAKGIKI FQDVVYNHSS RWGAKGLYVP
PVWGARDEQW KWLYSEKVPG KEYDPLEEHQ GDDPELTARQ NEMAKGRPYN GDLWSTAAPA
GNTCRDYGTP TQWKSPEGFT IHNCQWPSPT SGMFPAESYH QCWIGNWEGS DSKSCWLHDD
LADLNTENKA VQDYLIDAYN KYIDMGVDGF RVDTAVHVSR LVWNRRFLPA LQQHAVATHG
DKGKDFYVFG EVGAFVNDKW NRGSVNHSAQ FYTWKERREF NPDDARAVVE QFDYENLMGT
GNQPTTDNAF LRGNAYHAPD HSKFSGMNVI DMRMHMNFGD APNAFNNGKD SDDSYNDATY
NTVYVDSHDY GPNKSNERYT GGTDAWAENM SLMWTFRGIP TLYYGSEIEF QKGQRIDCGP
TCPLATTGRA YYGDHLAGSV TASGFGVVSS ATGAVAQTLD KPLVKHVQRL NQIRRAIPAL
QKGQYSTEGV TGQMAYKRRF TQGAVDSFVL VSVSGGATFT GIPNGTYVDA VTGDSKAVTG
GTLTIAGGGK GNLRAYVLSL PGNPAPGKIG TAGPYLR