Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Svir_32200 |
Symbol | |
ID | 8388544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharomonospora viridis DSM 43017 |
Kingdom | Bacteria |
Replicon accession | NC_013159 |
Strand | + |
Start bp | 3491249 |
End bp | 3492757 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644977247 |
Product | carboxypeptidase C (cathepsin A) |
Protein accession | YP_003135019 |
Protein GI | 257057187 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.20738 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.101175 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGAGA AGAACACGGA GGCCACCGAG CGGACCGACT CCGCCGAGAA CGCCGACGCC AAGGGTGCCC AGACATCCGC CGAGCCGCGG GACGACATCG TGACCACCCG GCACACGCTG TCCCTCGGCG ACCGCGAACT CGTCTACACA GCGCGCGCGG GCCGCATCGT GCTGCGCAAG GAGGTGCTCA AGGACGGCGC TTTCGACGGG CACAAGCCCA AGGCCGAGGT CTTCCTCACC TCGTACACCC TCGACGACGC CGATCCGAGT TCGCGGCCGG TGACGTTCGC GTTCAACGGG GGACCCGGCT CGTCCAGCAT CTGGCTGCAC ATGGGCCTGT TCGGGCCGCG TCGCGTGGTG TCGGGCGACG TCGACGACCC GGAGCCGCCG CCGTACCGTT TGGCGGACAA CACGGAGACG CTGTTGACCC ACAGCGACCT CGTCTTCATC GACCCGGTGT CCACCGGTTA CTCGCGTACC GTGGAGGGCG AGAAGCCGAA GGACTTCCAC GGCTTCACCC CCGACGTGGA GGCCGTCGGC GAGGTGATCC GACTGTGGAC GTCACGTAAC GAACGTTGGC TGTCGCCGAA GTTCGTGGCG GGCGAGTCGT ACGGCACCGT GCGCGCCGCC GCGCTCGCCG CGCATCTGCA GCAGCGGCAC GGGCTCTACC TCAACGGGCT GCTGTTGATC TCGTCGGTGC TCGACCTGGG CACGGTGATG TTCAACGAGG GCAACGACCT GCCGTACTCG TTGTACGTGC CGACGTACGC GGCCATCGCG CACTACCACG GCAAGCACGG CGACCGACCG CTCGAGGAGG TACTGGCCGA GGCCGAGGAG TTCGCCTCGC GTGACCTGCC GTGGGCACTG GCCCGTGGCG CGCGCCTTTC GGCCGAGGAA CGCGCCGACG TCGTCGCCCG GCTGGCACGT CTCACCGGAC TTTCTGAATC CTATGTAGAC CGAGTGAACC TGCGCATCGA ACACGTGCGG TTCTTCACCG AACTGCTGCG CGACCGAGGA CTGACCACGG GGCGCATGGA CGGCCGCTTC ACCACGTGGG AGCCCGACGG CGGGCGTGAG CACATGAGCG ACGACGCCTC GATCTCCCGC ATCATCGGCG CCTACTCCGC GACGTTCAAC CACTACGTGC GCAGCGAGCT CGGTTACGCC AACGATTTGC CCTACGAGAT CCTGTCGCTC GACGTCAACC GCGAATGGTC CTATTCGGAC TTCGAGGGCA GGCCCATCTC CGTGGTGCAC GACCTGTCGG CGGCGATGCG CGCCAACCCG CATCTGAAAG TGCACGTCGC GTGCGGCTAC TACGACGGCG CGACACCCCA CTTCGCGGCC GAACACGTGT TGGCCCAATT GCAGATCCCG GACGAACTGC GGTCGAACAT CGAAACGGCC TACTACCCGG CCGGCCACAT GATGTACGTC CACGAACCGT CGCGCGTGCA GCAGTCACGT GACCTGGCGG AGTTCGTCGC CCGCAGCTCC AACCGGTGA
|
Protein sequence | MPEKNTEATE RTDSAENADA KGAQTSAEPR DDIVTTRHTL SLGDRELVYT ARAGRIVLRK EVLKDGAFDG HKPKAEVFLT SYTLDDADPS SRPVTFAFNG GPGSSSIWLH MGLFGPRRVV SGDVDDPEPP PYRLADNTET LLTHSDLVFI DPVSTGYSRT VEGEKPKDFH GFTPDVEAVG EVIRLWTSRN ERWLSPKFVA GESYGTVRAA ALAAHLQQRH GLYLNGLLLI SSVLDLGTVM FNEGNDLPYS LYVPTYAAIA HYHGKHGDRP LEEVLAEAEE FASRDLPWAL ARGARLSAEE RADVVARLAR LTGLSESYVD RVNLRIEHVR FFTELLRDRG LTTGRMDGRF TTWEPDGGRE HMSDDASISR IIGAYSATFN HYVRSELGYA NDLPYEILSL DVNREWSYSD FEGRPISVVH DLSAAMRANP HLKVHVACGY YDGATPHFAA EHVLAQLQIP DELRSNIETA YYPAGHMMYV HEPSRVQQSR DLAEFVARSS NR
|
| |