Gene Svir_32200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSvir_32200 
Symbol 
ID8388544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharomonospora viridis DSM 43017 
KingdomBacteria 
Replicon accessionNC_013159 
Strand
Start bp3491249 
End bp3492757 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content67% 
IMG OID644977247 
Productcarboxypeptidase C (cathepsin A) 
Protein accessionYP_003135019 
Protein GI257057187 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.20738 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.101175 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGAGA AGAACACGGA GGCCACCGAG CGGACCGACT CCGCCGAGAA CGCCGACGCC 
AAGGGTGCCC AGACATCCGC CGAGCCGCGG GACGACATCG TGACCACCCG GCACACGCTG
TCCCTCGGCG ACCGCGAACT CGTCTACACA GCGCGCGCGG GCCGCATCGT GCTGCGCAAG
GAGGTGCTCA AGGACGGCGC TTTCGACGGG CACAAGCCCA AGGCCGAGGT CTTCCTCACC
TCGTACACCC TCGACGACGC CGATCCGAGT TCGCGGCCGG TGACGTTCGC GTTCAACGGG
GGACCCGGCT CGTCCAGCAT CTGGCTGCAC ATGGGCCTGT TCGGGCCGCG TCGCGTGGTG
TCGGGCGACG TCGACGACCC GGAGCCGCCG CCGTACCGTT TGGCGGACAA CACGGAGACG
CTGTTGACCC ACAGCGACCT CGTCTTCATC GACCCGGTGT CCACCGGTTA CTCGCGTACC
GTGGAGGGCG AGAAGCCGAA GGACTTCCAC GGCTTCACCC CCGACGTGGA GGCCGTCGGC
GAGGTGATCC GACTGTGGAC GTCACGTAAC GAACGTTGGC TGTCGCCGAA GTTCGTGGCG
GGCGAGTCGT ACGGCACCGT GCGCGCCGCC GCGCTCGCCG CGCATCTGCA GCAGCGGCAC
GGGCTCTACC TCAACGGGCT GCTGTTGATC TCGTCGGTGC TCGACCTGGG CACGGTGATG
TTCAACGAGG GCAACGACCT GCCGTACTCG TTGTACGTGC CGACGTACGC GGCCATCGCG
CACTACCACG GCAAGCACGG CGACCGACCG CTCGAGGAGG TACTGGCCGA GGCCGAGGAG
TTCGCCTCGC GTGACCTGCC GTGGGCACTG GCCCGTGGCG CGCGCCTTTC GGCCGAGGAA
CGCGCCGACG TCGTCGCCCG GCTGGCACGT CTCACCGGAC TTTCTGAATC CTATGTAGAC
CGAGTGAACC TGCGCATCGA ACACGTGCGG TTCTTCACCG AACTGCTGCG CGACCGAGGA
CTGACCACGG GGCGCATGGA CGGCCGCTTC ACCACGTGGG AGCCCGACGG CGGGCGTGAG
CACATGAGCG ACGACGCCTC GATCTCCCGC ATCATCGGCG CCTACTCCGC GACGTTCAAC
CACTACGTGC GCAGCGAGCT CGGTTACGCC AACGATTTGC CCTACGAGAT CCTGTCGCTC
GACGTCAACC GCGAATGGTC CTATTCGGAC TTCGAGGGCA GGCCCATCTC CGTGGTGCAC
GACCTGTCGG CGGCGATGCG CGCCAACCCG CATCTGAAAG TGCACGTCGC GTGCGGCTAC
TACGACGGCG CGACACCCCA CTTCGCGGCC GAACACGTGT TGGCCCAATT GCAGATCCCG
GACGAACTGC GGTCGAACAT CGAAACGGCC TACTACCCGG CCGGCCACAT GATGTACGTC
CACGAACCGT CGCGCGTGCA GCAGTCACGT GACCTGGCGG AGTTCGTCGC CCGCAGCTCC
AACCGGTGA
 
Protein sequence
MPEKNTEATE RTDSAENADA KGAQTSAEPR DDIVTTRHTL SLGDRELVYT ARAGRIVLRK 
EVLKDGAFDG HKPKAEVFLT SYTLDDADPS SRPVTFAFNG GPGSSSIWLH MGLFGPRRVV
SGDVDDPEPP PYRLADNTET LLTHSDLVFI DPVSTGYSRT VEGEKPKDFH GFTPDVEAVG
EVIRLWTSRN ERWLSPKFVA GESYGTVRAA ALAAHLQQRH GLYLNGLLLI SSVLDLGTVM
FNEGNDLPYS LYVPTYAAIA HYHGKHGDRP LEEVLAEAEE FASRDLPWAL ARGARLSAEE
RADVVARLAR LTGLSESYVD RVNLRIEHVR FFTELLRDRG LTTGRMDGRF TTWEPDGGRE
HMSDDASISR IIGAYSATFN HYVRSELGYA NDLPYEILSL DVNREWSYSD FEGRPISVVH
DLSAAMRANP HLKVHVACGY YDGATPHFAA EHVLAQLQIP DELRSNIETA YYPAGHMMYV
HEPSRVQQSR DLAEFVARSS NR