Gene Sros_1806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1806 
Symbol 
ID8665084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1928726 
End bp1929997 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content68% 
IMG OID 
Productmacrolide glycosyltransferase 
Protein accessionYP_003337539 
Protein GI271963343 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCTCA CGATCCTGTT CATGCCGGAG AGCGCCTACG GGCCGACGAA CAACTGCATC 
GGCATCGGTG ACATTCTCCG CAAGCGCGGC CACCGCGTCG TCTTCGCAGC TGAAGCCTCC
TGGAAGGGGA AATTGGAGGC TCTTGGATTC GAAGAGGATC TGGTGGATCT CGCGCCGCCG
TCCGAAGAGG AGCAGGACCC CGGACAGTTC TGGAAGGACT TCATCCGGGA CACCGCGCCG
GAATATCGCA AGTCGACCTC GGCTCAGCTG GAGACGGTGA CCAAGCCGAT CTGGGAGGCG
CTCGTCGACG GCGCGAAGTA CTGCGAGCCT CAGCTGAAGG CGATTATCGA GCGCGTTCAG
CCGGACGTGA TCGTCGAGGA CAATGTCATC ACCTTCCCGG CGCTGCTCAC GGCCGGTAAG
CCGTTCGTCC GCATCGTCTC CTGCAACCCG CTGGAGGTGC GCGGCGAGGG CGTCGCCCCG
GTCTTCTCCG GCCTGCCCGC CGACGACCGG TCCGAGTGGG ACGCCTTCCG CGCCGAGTAC
GACCGGACCC ACCGCGAGCT CTGGACCGCC TTCAACGAGT GGGTCGTCGC CCAGGGCGCC
CGGCCGCTGC CCGAGCTGGA CTTCATCCAC GAGGGCGACC TGAACCTCTA CGTCTTCCCG
GAGATCGCCG ACTACACCGA CGCCCGGCCG CTGGACGGCT CCTGGCACCG CCTGGACTCC
TCGGTCCGCG AGACCGACGG CGGCTTCGAG CTGCCCGCGT CGCTGGCCGA CCGGGACGGC
GCGCTGGTCT ACTTCTCGCT CGGCTCGCTC GGCTCGGCGG ACGTCTCGCT GATGCAGCGG
GTCATCGACG TGCTCGGCAC CACCCCGCAC CGGTTCATCG TCTCCAAGGG CCCGCTGCAC
GAGGAGATCA AGCTCGCCGA CAACATGTGG GGAGCCGAGT TCGTCCCGCA GACGAAGATC
ATCCCCATGG CGGACCTGGT GATCACGCAC GGTGGCAACA ACACCACCAC CGAGGCGCTG
CACTTCGGCA AGCCGATGAT CCTGCTGCCC CTGTTCTGGG ACCAGTACGA CAACGCGCAG
CGGATCCACG AGCTCGGCTA CGGCGTCCGC CTGGCCACCT ACACCTTCAC CGACGAAGAG
CTGACCGGCG CGCTGGACAG GCTGCTCGGC GACGCGGGGC TCCGTGAGCG CCTGGCCGCG
GCCGGCGAGG AGATCCGCCG GCGTGACGGC CTGCGCAAGG CCGCCGACCT GATCGAGCAG
GCCGGCGCCT GA
 
Protein sequence
MSLTILFMPE SAYGPTNNCI GIGDILRKRG HRVVFAAEAS WKGKLEALGF EEDLVDLAPP 
SEEEQDPGQF WKDFIRDTAP EYRKSTSAQL ETVTKPIWEA LVDGAKYCEP QLKAIIERVQ
PDVIVEDNVI TFPALLTAGK PFVRIVSCNP LEVRGEGVAP VFSGLPADDR SEWDAFRAEY
DRTHRELWTA FNEWVVAQGA RPLPELDFIH EGDLNLYVFP EIADYTDARP LDGSWHRLDS
SVRETDGGFE LPASLADRDG ALVYFSLGSL GSADVSLMQR VIDVLGTTPH RFIVSKGPLH
EEIKLADNMW GAEFVPQTKI IPMADLVITH GGNNTTTEAL HFGKPMILLP LFWDQYDNAQ
RIHELGYGVR LATYTFTDEE LTGALDRLLG DAGLRERLAA AGEEIRRRDG LRKAADLIEQ
AGA