Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3064 |
Symbol | |
ID | 8666351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 3343355 |
End bp | 3344839 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | arylsulfatase A |
Protein accession | YP_003338757 |
Protein GI | 271964561 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.886966 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.178037 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCACAC GCAACGTCCT GTTCCTGATG ACCGACCAGC ATCGGGTCGA CACACTGGGG TGCTACGGCA ACCCGGTGGT GCGCACTCCC GCGCTGGACG GCCTGGCGGC CGAGGGCACC CGGTTCGATC GCTTCTACAC GCCCACCGCG ATCTGCACCC CGGCGCGGGC CTCCTTGTTC ACCGGCCTGC ATCCCTTCCG GCACGGCCTG CTGGTCAACC CCGAGCGCAA CGGCGGCGCC CGCGACGAGG TCGACGACGC CCACCCGATC CTGTCGGCGC CGCTGCTTGA GGCGGGCTAC AACATGGGCC ACGTCGGCAA GTGGCACATC GGGCGCGAGC GGGGCCCCGA GTTCTACACG ATGGACGGCG AGCACCTGCC CGGCGCGCTC AACCCCTTCC ACCATCCCTC CTACGAGCGG TGGCTCAAGG AGAACGGCCA CCCGCCGTTC GCGGTGCGCG AGGCGGTCTT CGGCAAGGCG CCCAACGACT CCGGGCGCGG CCACCTGATC GCGGGGCGCC TCCAGCAGCC CGCCGAGGCC ACGATGGAGG CGTTCCTGAC CGAGCGGACC CTGGAGCTCC TCGAAGGCTA CGCCCGAGAC TTCCACGACA GCGGCAAACG GTTCATGCTC TCCTGCCACT GGTACGGCCC GCATCTGCCG TACCTCATCC CGGACGAGTA CTACGACATG TACGACCCGG AGCAGGTGCC GCTGCCGGCC TCGATGGCCG AGACCTTCGC CGGCAAGCCC GACGTCCAGC GCCGCTACGC CGAGTACTGG TCGGCCGACC ACTTCGACGC CGACGCCTGG CGCAAGCTGA TCGCGGTCTA CTGGGGCTAC GTCACGATGA TCGACGACCA GATCGGCCGC CTGCTCGCCG CGCTGCGCGA GCACGGCCTC TGGGACGACA CGGCCGTGGT CTTCACCGCC GACCACGGCG AGTTCACCGG CGCCCACCGG CTCAACGACA AGGGCCCGGC GATGTACGAG GACATCTACC GCATCCCCGG CATCGTCCGC GTCCCCGGCG CCCCGGCCGG GGTCGTCGAC GAGTTCGCCA CGCTGATCGA CCTCAACCCC ACGATCCTCG GCCTGGCCGG GCTGCCGCCC CGCGAGCCCT GCGACGGGGA GAGCCTGCTG CCGCTGATCG AGGATGAGGA TCCCGCGTGG CGGCAGGAGG TGGTCACCGA GTTCCACGGC CACCACTTCC CCTACTCCCA GCGGATGATC CGCGACCGGC GCCACAAGCT GGTCTTCAAC CCCGAGAGCG TGAACGAGCT CTACGACCTG GAGACCGACC CGCACGAACT GCACAACGTC CACTCCGCCC CCGCCTACGC CGGGGTGCGG CGCGACCTCA CCGGGCGGCT CTACCGCGAG CTGCTGCGGC GCGGCGATCC CGCCTACACC TGGATGAGCT ACATGGCCGA CATCGACGGC GACCGGGCCG CCGACGTCGA CGGCGTGGCC GGCGAGGTGG CCTGA
|
Protein sequence | MSTRNVLFLM TDQHRVDTLG CYGNPVVRTP ALDGLAAEGT RFDRFYTPTA ICTPARASLF TGLHPFRHGL LVNPERNGGA RDEVDDAHPI LSAPLLEAGY NMGHVGKWHI GRERGPEFYT MDGEHLPGAL NPFHHPSYER WLKENGHPPF AVREAVFGKA PNDSGRGHLI AGRLQQPAEA TMEAFLTERT LELLEGYARD FHDSGKRFML SCHWYGPHLP YLIPDEYYDM YDPEQVPLPA SMAETFAGKP DVQRRYAEYW SADHFDADAW RKLIAVYWGY VTMIDDQIGR LLAALREHGL WDDTAVVFTA DHGEFTGAHR LNDKGPAMYE DIYRIPGIVR VPGAPAGVVD EFATLIDLNP TILGLAGLPP REPCDGESLL PLIEDEDPAW RQEVVTEFHG HHFPYSQRMI RDRRHKLVFN PESVNELYDL ETDPHELHNV HSAPAYAGVR RDLTGRLYRE LLRRGDPAYT WMSYMADIDG DRAADVDGVA GEVA
|
| |