Gene Sros_3064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3064 
Symbol 
ID8666351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3343355 
End bp3344839 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content71% 
IMG OID 
Productarylsulfatase A 
Protein accessionYP_003338757 
Protein GI271964561 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.886966 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.178037 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCACAC GCAACGTCCT GTTCCTGATG ACCGACCAGC ATCGGGTCGA CACACTGGGG 
TGCTACGGCA ACCCGGTGGT GCGCACTCCC GCGCTGGACG GCCTGGCGGC CGAGGGCACC
CGGTTCGATC GCTTCTACAC GCCCACCGCG ATCTGCACCC CGGCGCGGGC CTCCTTGTTC
ACCGGCCTGC ATCCCTTCCG GCACGGCCTG CTGGTCAACC CCGAGCGCAA CGGCGGCGCC
CGCGACGAGG TCGACGACGC CCACCCGATC CTGTCGGCGC CGCTGCTTGA GGCGGGCTAC
AACATGGGCC ACGTCGGCAA GTGGCACATC GGGCGCGAGC GGGGCCCCGA GTTCTACACG
ATGGACGGCG AGCACCTGCC CGGCGCGCTC AACCCCTTCC ACCATCCCTC CTACGAGCGG
TGGCTCAAGG AGAACGGCCA CCCGCCGTTC GCGGTGCGCG AGGCGGTCTT CGGCAAGGCG
CCCAACGACT CCGGGCGCGG CCACCTGATC GCGGGGCGCC TCCAGCAGCC CGCCGAGGCC
ACGATGGAGG CGTTCCTGAC CGAGCGGACC CTGGAGCTCC TCGAAGGCTA CGCCCGAGAC
TTCCACGACA GCGGCAAACG GTTCATGCTC TCCTGCCACT GGTACGGCCC GCATCTGCCG
TACCTCATCC CGGACGAGTA CTACGACATG TACGACCCGG AGCAGGTGCC GCTGCCGGCC
TCGATGGCCG AGACCTTCGC CGGCAAGCCC GACGTCCAGC GCCGCTACGC CGAGTACTGG
TCGGCCGACC ACTTCGACGC CGACGCCTGG CGCAAGCTGA TCGCGGTCTA CTGGGGCTAC
GTCACGATGA TCGACGACCA GATCGGCCGC CTGCTCGCCG CGCTGCGCGA GCACGGCCTC
TGGGACGACA CGGCCGTGGT CTTCACCGCC GACCACGGCG AGTTCACCGG CGCCCACCGG
CTCAACGACA AGGGCCCGGC GATGTACGAG GACATCTACC GCATCCCCGG CATCGTCCGC
GTCCCCGGCG CCCCGGCCGG GGTCGTCGAC GAGTTCGCCA CGCTGATCGA CCTCAACCCC
ACGATCCTCG GCCTGGCCGG GCTGCCGCCC CGCGAGCCCT GCGACGGGGA GAGCCTGCTG
CCGCTGATCG AGGATGAGGA TCCCGCGTGG CGGCAGGAGG TGGTCACCGA GTTCCACGGC
CACCACTTCC CCTACTCCCA GCGGATGATC CGCGACCGGC GCCACAAGCT GGTCTTCAAC
CCCGAGAGCG TGAACGAGCT CTACGACCTG GAGACCGACC CGCACGAACT GCACAACGTC
CACTCCGCCC CCGCCTACGC CGGGGTGCGG CGCGACCTCA CCGGGCGGCT CTACCGCGAG
CTGCTGCGGC GCGGCGATCC CGCCTACACC TGGATGAGCT ACATGGCCGA CATCGACGGC
GACCGGGCCG CCGACGTCGA CGGCGTGGCC GGCGAGGTGG CCTGA
 
Protein sequence
MSTRNVLFLM TDQHRVDTLG CYGNPVVRTP ALDGLAAEGT RFDRFYTPTA ICTPARASLF 
TGLHPFRHGL LVNPERNGGA RDEVDDAHPI LSAPLLEAGY NMGHVGKWHI GRERGPEFYT
MDGEHLPGAL NPFHHPSYER WLKENGHPPF AVREAVFGKA PNDSGRGHLI AGRLQQPAEA
TMEAFLTERT LELLEGYARD FHDSGKRFML SCHWYGPHLP YLIPDEYYDM YDPEQVPLPA
SMAETFAGKP DVQRRYAEYW SADHFDADAW RKLIAVYWGY VTMIDDQIGR LLAALREHGL
WDDTAVVFTA DHGEFTGAHR LNDKGPAMYE DIYRIPGIVR VPGAPAGVVD EFATLIDLNP
TILGLAGLPP REPCDGESLL PLIEDEDPAW RQEVVTEFHG HHFPYSQRMI RDRRHKLVFN
PESVNELYDL ETDPHELHNV HSAPAYAGVR RDLTGRLYRE LLRRGDPAYT WMSYMADIDG
DRAADVDGVA GEVA