Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Svir_26220 |
Symbol | |
ID | 8387946 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharomonospora viridis DSM 43017 |
Kingdom | Bacteria |
Replicon accession | NC_013159 |
Strand | + |
Start bp | 2826283 |
End bp | 2828646 |
Gene Length | 2364 bp |
Protein Length | 787 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644976660 |
Product | arylsulfatase A family protein |
Protein accession | YP_003134437 |
Protein GI | 257056605 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAATC CAGCCCTGGA TCGGACCCGG CTGCCGCTGC CGGAGAGCTT CCCTCCTCCC GCCACGTCGC TGGACGTGCG CGACCAGGAT CCGGCGTACG AACCGATGCG TCCGATACAG GCACCAAAGG ACGCGCCGAA CGTGGTGATC GTGCTGATCG ACGACATGGG GTTCGGTGCC CCGAGCGCGT TCGGCGGTCC CTGCCGGATG CCCACCGCGG ACCGGCTCGC CGACGCGGGC CTGCGCTACA GCCGGTTCCA CGTCACCGCG ATCTGCTCAC CGACCCGCCA GTCGCTGTTG ACCGGCCGCA ACCATCACTC GGTCGGCATG GGCGTGACCA CCGAGATGGC CAGTGCCGCA CCCGGCTACA ACGGGATCCG TCCGCGCAGC GCGGCCACGC TGGCCCAGGT GCTGCGCGGG AACGGGTACA ACACGGCCGC GTTCGGCAAA TGGCACCAGA CTCCCGCCCG GGACGTCAGC GCGGCCGGTC CGTTCGACCG CTGGCCCACG GGCGAGGGGT TCGAGAAGTT CTACGGATTC CTGTGCGCGG AGATGAACCA CTGGTACCCG GTGCTGTTCG ACGGGACCAA GCCCATCGAA CCGTCCCGGA AACCCGAGGA CGGCTACCAC CTGTCCGAGG ACCTGGTGGA CCAGGCGATC GACTGGGTAC GGGACCAGCG CACCCTCAAA CCGGACCACC CGTTCTTCGT GTACCTGTCC TTCGGCGCGA CGCACGCGCC CTACCACGTC CCCCGTGAGT ACCGGGACAG GTACAAGGGC AAGTTCGACC ACGGCTGGGA CGTCCAGCGC GAGATCACGC TACGACGCCA GAAGGAACTG GGGGTCGTGC CAAAGGACGC GGAACTGGCA CCGTGGGCCG AGGGCGTTCC GCACTGGGAC GAGCTCTCCG AGCCCGAGCG CAAGGCCGCC GCCGCCCTCA TGGAACTGTA CGCGGGCTTC GCCGAACACA CCGACGACCA GGTCGGTCGC TTCGTCGACG CGCTCGAGGA ACTGGGCGAG TTGGACAACA CCCTCTTCAT CTACATCCTC GGCGACAACG GCGCCTCCGC CGAGGGCGGC CTCGGTGGCA CGTTGAACGA ACACCGCTAC GCCAGCGGCA TCCCGGACTC CGCCGAGTAC ATCAACGAAC ACCTCGACGC CTTGGGGGAC GCCACCACGC ACGCGCACTA CCCGGTGGGC TGGGCGTTGG CGATGAACAC CCCGTACCAG TGGACCAAGC AGGTGGCCTC GCACTTCGGT GGCACCCGGG ACGGGATGGT GGTGCACTGG CCACGCGGGA TACGACAGAA GGGCGGCATC CGACACCAGT TCCACCACGT GATCGACGTG ATGCCGACCG TGCTGGAGGC CGCGGGTATC CCGCATCCGA CCGAGGTGGA CGGTGTGGCG CAGAAACCCA TCGAGGGCAC CAGCATGCTC TACACCTTCA ACGGCCCCGA CGAACCGGAC CGCCACCGGG TGCAGTACTT CGAGATGGTC GGCAACCGGG GCATCTACCA CGACGGTTGG ATGGCGGTGA CCCGGCACGG CACGCCGTGG GAGATGGTGC AGGACGGTCA GCCCCGCCGT TTCGACGAGG ACGTGTGGGA GTTGTACGAC ACCAACACGG ACTGGACCCA AGCCCACGAC CTCGCGGACA CCTACCCCGA CAAACTGGCC GAATTGCAGC AGCTGTTCCT CATCGAGGCC GCGAAGTACA ACGTGTTCCC GATGGACGAC CGGATGACCG AACGGGAGAA CCCTCGGGAG GCCGGTCGGC TGGACCTCAT GGGGGAACGA AGGTCGATCA CCTTCCATGC GAACGCGAAG CGGCTGACCG AGGAGACCGC GCCCAACGTC AAGAACCGCT CACACACCAT CACCGCGGAA GTCGAGATCC CCGACGGCGG CGCGGAAGGG GTCCTCGTCG CCCAGGGCGG TCGATTCGGC GGCTGGTCGG TGTATTTCCA CGAGGGCAGG TTCTGCTACG CCTACAACTA CTTCGGACTT AAGGTGCACA CCGCGCGCAG CCACGAACCC CTGCCCGCCG GAAGACACGA GGTGCGGATG GAGTTCGCCT ACGCCGGTGG CGGGGTCGGC AAGGGCGGCG CCGTCACGCT GCGGGTGAAC GGCTCCGAGG TCGGAGGGAT CATGGTCCCG GCGACCATCC CGTACTACTT CGCCTTCGAC GAGACCTTCG ACATCGGCGT GGACCGGGCC TCCCCGGTCA CCGACGACTA CCCGGTCGTC GACAACCGGT TCACCGGCCG GTTGCACTGG CTTCGTGTCG ATCTGGGCGA CGACCTGTAC GACGACGCCA GTGCCGAACG CGAACGCGCA CGTTTCCGGG CCGTGCATGA CTGA
|
Protein sequence | MRNPALDRTR LPLPESFPPP ATSLDVRDQD PAYEPMRPIQ APKDAPNVVI VLIDDMGFGA PSAFGGPCRM PTADRLADAG LRYSRFHVTA ICSPTRQSLL TGRNHHSVGM GVTTEMASAA PGYNGIRPRS AATLAQVLRG NGYNTAAFGK WHQTPARDVS AAGPFDRWPT GEGFEKFYGF LCAEMNHWYP VLFDGTKPIE PSRKPEDGYH LSEDLVDQAI DWVRDQRTLK PDHPFFVYLS FGATHAPYHV PREYRDRYKG KFDHGWDVQR EITLRRQKEL GVVPKDAELA PWAEGVPHWD ELSEPERKAA AALMELYAGF AEHTDDQVGR FVDALEELGE LDNTLFIYIL GDNGASAEGG LGGTLNEHRY ASGIPDSAEY INEHLDALGD ATTHAHYPVG WALAMNTPYQ WTKQVASHFG GTRDGMVVHW PRGIRQKGGI RHQFHHVIDV MPTVLEAAGI PHPTEVDGVA QKPIEGTSML YTFNGPDEPD RHRVQYFEMV GNRGIYHDGW MAVTRHGTPW EMVQDGQPRR FDEDVWELYD TNTDWTQAHD LADTYPDKLA ELQQLFLIEA AKYNVFPMDD RMTERENPRE AGRLDLMGER RSITFHANAK RLTEETAPNV KNRSHTITAE VEIPDGGAEG VLVAQGGRFG GWSVYFHEGR FCYAYNYFGL KVHTARSHEP LPAGRHEVRM EFAYAGGGVG KGGAVTLRVN GSEVGGIMVP ATIPYYFAFD ETFDIGVDRA SPVTDDYPVV DNRFTGRLHW LRVDLGDDLY DDASAERERA RFRAVHD
|
| |