Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0973 |
Symbol | |
ID | 4026196 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 1089587 |
End bp | 1090618 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637966150 |
Product | O-sialoglycoprotein endopeptidase |
Protein accession | YP_573029 |
Protein GI | 92113101 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGGTAT TGGGCATCGA GACCTCCTGC GACGAAACCG GCGTCGCCAT TTATGACACC GAGCGCGGCC TGATCGCCGA TGCGCTGCAC AGCCAAATGG CCATGCACGC CGAATTCGGC GGTGTGGTCC CGGAACTCGC CTCGCGGGAT CACACTCGCA AGCTGCTGCC GCTGATTCGC CAGGTGCTCG ACGACGCCGA GCTGCGCGGC GACCAGCTAG ACGCCATCGC CTACACGGCG GGCCCCGGCC TGGTCGGCGC GCTGATGGTC GGCGCCTCCA CCGCGCACGG CCTGGCGCGC GCCTGGGACA TCCCGGCACT CGGCGTGCAT CACATGGAAG GCCATCTGCT GGCGCCGATG CTCGAGGCCG CGCCGCCCGA CTTTCCCTTC GTGGCCCTGC TGGTGTCGGG TGGGCACACG CAGCTCGTCG AGGTCCACGG CCTGGGCCGT TACCGGCTGC TGGGCGAATC GGTCGACGAT GCCGCCGGCG AGGCCTTCGA CAAGGCCGCC AAGATGCTCG AACTGCCCTA CCCTGGCGGC CCCCACGTCG CCCAGCTCGC CGAGCGCGGC GACCCGACCC GGTTTCGCTT TCCGCGCCCG ATGACCGACC GGCCGGGACT CGACTTCAGC TTTTCGGGTC TCAAGACCCA CACCCTGACC ACCGCCAACC AGCTCAAGGC GGCGGGCCCC CTCAGCGACC AGGACCGCGC CGACATCGCG CGCGCCTTCG AGGAAGCCGT CGTCGACACG CTGGTCATCA AGTGCCGGCG CGCCCTCGAC ACCACGGGCC TCAAGCGGCT GGTGGTGGCC GGCGGCGTCA GCGCCAATCA TCGCCTGCGC GAGCGCCTGG ACCGGGAAAC CGCCAAGCGC CAGGCCCAGG CGTTCTACCC GCGCGGACGC TTCTGCACCG ACAACGGCGC AATGATCGCT TATGTCGGCG CACAACGCCT GCTGGCCGGG GAGCGCGACG ACGCGACGAT GCAGGCCACG CCGCGCTGGC CGCTGGCGTC GCTCACTCCT CCGGCGGCTT GA
|
Protein sequence | MRVLGIETSC DETGVAIYDT ERGLIADALH SQMAMHAEFG GVVPELASRD HTRKLLPLIR QVLDDAELRG DQLDAIAYTA GPGLVGALMV GASTAHGLAR AWDIPALGVH HMEGHLLAPM LEAAPPDFPF VALLVSGGHT QLVEVHGLGR YRLLGESVDD AAGEAFDKAA KMLELPYPGG PHVAQLAERG DPTRFRFPRP MTDRPGLDFS FSGLKTHTLT TANQLKAAGP LSDQDRADIA RAFEEAVVDT LVIKCRRALD TTGLKRLVVA GGVSANHRLR ERLDRETAKR QAQAFYPRGR FCTDNGAMIA YVGAQRLLAG ERDDATMQAT PRWPLASLTP PAA
|
| |