Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2537 |
Symbol | |
ID | 4026118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 2843238 |
End bp | 2844284 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637967744 |
Product | peptidase M4, thermolysin |
Protein accession | YP_574583 |
Protein GI | 92114655 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3227] Zinc metalloprotease (elastase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGTCTT CACGAAGGGC GACGTTTTCC GGTTTCATGC CGCCGCACGT GCTCGATCGA ATCGCGGTGC AGGGTACCGA GCGCCAGCGG CGGTGTGCTC AGCAGACGTT GCAGGCCGAC CAGTGGTTTC GTCTACGGGC CTCGCCGCCG CCAGCGCGCG ACGCGGCGCG GGCCGTCGCC GGGCGGCCCG ACCGCCGCAT CCATTCCGCC GACCACGAAC AGACGCTGCC CGGCCGTCTG GTGCGGGAAG AAGGCCAGGC CGCGCATGGG GATGCGGCCG TCGACGAGGC CTATGAGTGG CTGGGTGCGA CCTACCGGTT CTACTGGGAG GTCTTCGGGC GCGACTCCAT CGACGATCGA GGCATGCCGC TGATCGGCAC CGTGCATTAC GGCCGCGATT ACGACAACGC CTTCTGGAAC GGTGCGCAGA TGGTCTTCGG TGACGGCGAC GGCGACCTGT TTCGGCGGTT CACGGCGGCG CCGGAAGTCG TCGCGCACGA GCTGACCCAC GGTGTGATCG AGCGCGATGT GGGGCTGGTC TACGCCGGCC AGTCCGGGGC GCTCAACGAG TCCCTGGCCG ATGTCTTCGG GGTGGTGGTC AAGCAGTACC ATGCCGGCCA GACGGCGCAG GAAGCGGACT GGCTCATCGG CGCGGCGTTG TTGACCGACC GGGTGCAAGG CCGTGCACTA CGCTCCATGG AAGCGCCGGG GACGGCATAC GATGACCCCG TGCTGGGACG CGATCCGCAA CCGGGCCACA TGCGCGATTT CGTCGACACG CAGGCCGACA ACGGGGGCGT TCACATCAAT TCCGGCATAC CCAACCGGGC CTTTTACCTG GCGGCGGTGG CCCTGGAGGC GCCGGCGTGG GAGAGCGTCG CGCCCGTGTG GTATGCGGCG ATGCGCGATG ACGCCCTGAG CCGGGAATCG GATTTCGCGG CTTTCGCGGC ACTCACCGTG GCGCATGCCC GGCGCCAGCA TGGAGAGGGA AGTCGCGAGG CGCGTGCGGT GGATGACGCC TGGCGCGAGG TGGGCGTCGT CTCATGA
|
Protein sequence | MASSRRATFS GFMPPHVLDR IAVQGTERQR RCAQQTLQAD QWFRLRASPP PARDAARAVA GRPDRRIHSA DHEQTLPGRL VREEGQAAHG DAAVDEAYEW LGATYRFYWE VFGRDSIDDR GMPLIGTVHY GRDYDNAFWN GAQMVFGDGD GDLFRRFTAA PEVVAHELTH GVIERDVGLV YAGQSGALNE SLADVFGVVV KQYHAGQTAQ EADWLIGAAL LTDRVQGRAL RSMEAPGTAY DDPVLGRDPQ PGHMRDFVDT QADNGGVHIN SGIPNRAFYL AAVALEAPAW ESVAPVWYAA MRDDALSRES DFAAFAALTV AHARRQHGEG SREARAVDDA WREVGVVS
|
| |