Gene Sros_3337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3337 
Symbol 
ID8666625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3647063 
End bp3649102 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content69% 
IMG OID 
ProductGlycosidase-like protein 
Protein accessionYP_003339019 
Protein GI271964823 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.543706 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.14646 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATCCT ACAAAGTCAT TGTCACCACA GCGTTGAGCG CCCTGGCAAG CGCCGTCGTG 
GTGGTGCCCG CGGTGCTCAG CGCCCCCGCC CAGGCGGCAC CGCCCACGCC CGGCCCGATC
GTCAAGCTGT GGAACTGGAA CTGGAACTCG GTCGCGCGCG AGTGCACCGA CGTGCTCGGC
CCCGCCGGGT ACGGCGCGGT CCAGGTCTCC CCGCCCAGCG ACTCGATGAG CAAGGGCGGC
TCGGTCTGGT GGGACATCTA CCAGCCCGCC CGCTACACCC TGAACAGCAA GTTCGGCGAC
GAGAACGCGT TCAGGAACAT GGTCAGGGCC TGCCACGACG CGGGCGTCAA GGTCCACGCC
GACGCGGTCG TCAACCACAT GACCGGCCAG GGCAGCAGGA GCTACGGCGG CTACGACTTC
GGCAAGTACT CCTACCCCGG GCTCTACTCC TCCTGGGACT TCCACTCCCC CACCTGCTCG
ATCAACGACG GCGACTACGT CAACGACGGC TGGCGGGTGC AGAACTGCGA GCTGGTCGGC
CTGTCCGACC TCAACACCGG CTCCGAGTAC GTCCGCAACC AGATCGCCGG CTACCTCAAC
AAGCTGACCG ACCTCGGCGT GGACGGCTTC CGCATCGACG CCGCCAAGCA CATCGCCCCC
GGCGACCTGG CGGCGATCAA GTCCAAGCTG AGGGGCTCGC CGTACATCCA CCAGGAGGTC
ATCGAGGGCC CGAACGAGGC GGTCAAGCCG GGCCACTACA CCGGGATCGG CGACGTCCAC
GAGTTCGTGT ACGGCCGCAA GATGAAGGAG CAGTTCACCG GCCAGATCAA GTGGCTGCAG
AGCTTCGGCC AGAGCTGGGG CCTGTCGGTG CCCAGCGACA AGGCCGTGGT TTTCGTCGAC
AACCACGACA CCGAGCGCAA CGGCTCCACC CTCAACTACA AGTACGGCGA CGCCTACAAG
CTGGCCAACG TCTTCATGCT GGCGTGGCCG TACGGCACGC CGCGCGTCTA CTCCGGCTTC
ACCTGGAACA ACGGGGAGGG CGGCCCCCCG TCGGCCAACG GCGGCTTCGT CACCGACGCC
GACTGCGGCA ACGGGCAGTG GACCTGCTTC CACCGCCAGA TGAGCGGGAT GGTCGGCTTC
AACCGGGCCG TCGCGGGCAC CCCGGTCGGC AACTGGTACG ACAACGGCAA CAACGTGATC
GCCTTCAGCC GCGGCGGCAA GGGCTGGGTC GCGATCAACA ACGAGGGCGG CTCCGTCACC
CGGACCTTCA GCACCGGCCT GCCCGCCGGG ACCTACGCCG ACGACCTGGG CGGCGGCTCG
GTGACGGTCG GCTCGAACGG CACCGCCTCG GTGACCGTCC CCGCCAAGGG CGCGGTCGCC
ATCCACACCG GTGGCGTCAG GCCCACCGAC CCGCCGGCCG GTGACGCCAC CGTCTACTAC
GCCACCACCT GGACCGCCGC CAACATCCAC TACCAGCCCA CCGGCGGGAC CTGGACCCCC
GTCCCCGGCG TCGCCATGGA CGAGACCGCC TGCGCCGGCT GGAAGAAGAA GACCGTCGAC
CTCGGCACCG CCACCGGGCT CAAGGCCGCC TTCAACAACG GCTCCGGCAC CTGGGACAAC
AACGGCGACC GCGACTACAC CATCGGCAGG GGCGTCAGCA CGGTCAAGGG CGGCGTCGTG
ACCGCGGGCG CCACCAGCCC CTGCGGCGAC GGTCCCGGCC CCAGCCCGAC CCCGACGGTG
AACCCGGGCG AGGTCGCCGC CTCCTTCAAC GCGAACGTGA CGACCTCCTA CGGCCAGAAC
GTCTTCGTCG TCGGCGACGT CGAGGAGCTG GGCGGCTGGG ACCCCGCCAA GGCCGTGGCG
CTGTCGCCGG CCGGCTACCC CGTCTGGAAG GCCACGGTGA GCCTGCCGGC CGGCACGGCG
GTCTCCTACA AGTACGTCAA GAAGAACCCC GACGGCTCGG TGACGTGGGA GAGCGACCCC
AACCGCTCCT TCACCACCCC GTCCGGCGGC GCCGTGACGC GCGACGACAC CTGGCGCTGA
 
Protein sequence
MRSYKVIVTT ALSALASAVV VVPAVLSAPA QAAPPTPGPI VKLWNWNWNS VARECTDVLG 
PAGYGAVQVS PPSDSMSKGG SVWWDIYQPA RYTLNSKFGD ENAFRNMVRA CHDAGVKVHA
DAVVNHMTGQ GSRSYGGYDF GKYSYPGLYS SWDFHSPTCS INDGDYVNDG WRVQNCELVG
LSDLNTGSEY VRNQIAGYLN KLTDLGVDGF RIDAAKHIAP GDLAAIKSKL RGSPYIHQEV
IEGPNEAVKP GHYTGIGDVH EFVYGRKMKE QFTGQIKWLQ SFGQSWGLSV PSDKAVVFVD
NHDTERNGST LNYKYGDAYK LANVFMLAWP YGTPRVYSGF TWNNGEGGPP SANGGFVTDA
DCGNGQWTCF HRQMSGMVGF NRAVAGTPVG NWYDNGNNVI AFSRGGKGWV AINNEGGSVT
RTFSTGLPAG TYADDLGGGS VTVGSNGTAS VTVPAKGAVA IHTGGVRPTD PPAGDATVYY
ATTWTAANIH YQPTGGTWTP VPGVAMDETA CAGWKKKTVD LGTATGLKAA FNNGSGTWDN
NGDRDYTIGR GVSTVKGGVV TAGATSPCGD GPGPSPTPTV NPGEVAASFN ANVTTSYGQN
VFVVGDVEEL GGWDPAKAVA LSPAGYPVWK ATVSLPAGTA VSYKYVKKNP DGSVTWESDP
NRSFTTPSGG AVTRDDTWR