Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1628 |
Symbol | |
ID | 4028232 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1849516 |
End bp | 1850952 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637966817 |
Product | peptidase S1C, Do |
Protein accession | YP_573680 |
Protein GI | 92113752 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATATA TCGCCCGTCA TACCACGCTC TGGTGTCTTG CCCTGCTGAC GCTTTTCACC GCTCAGCTCG CTCAGGCACG CGACCTGCCG GACTTCACCC AGCTGGTGAA GGATGCCGCC CCCGGGGTCG TCAACATCTC CACCACCAGT ACCGTCGAAT CCCGGGGCAT GGCCGGATCG CCGTTCGGCA GTCAGGACGT GCCCGACATC TTCCGGCACT TCTTCGGCGA TCAGATGCCG CCGATGATAC CCGGGGCGCC GGGCTACGGC GGCAGTGAAG AGCGCCACTC CCTCGGCTCC GGTTTCGTCA TCAGCCGCGA CGGCTACATC ATGACCAACG CCCACGTGGT CGACGGTGCC GACGAGATCG TGGTGCGCCT CAACGACCGT CGCGAGCTCG AGGCCACGCT CGTGGGGGCC GACAAGAAAA CCGACGTCGC CGTGCTCAAG GTCGATGCCG ACGACTTGCC GGTACTCGAG ATGGGAGACT CCGACGCTCT GGAAGTCGGT GAATGGGTCG CCGCCATCGG CTCGCCCTTC GGCTTCGATC ATTCGGTGAC CTCGGGCATC GTCAGCGCCA TCGACCGTAC GCTGCCCAGC GACGCCTATG TGCCGTTCAT CCAGACCGAC GTGGCGATCA ACCCCGGCAA TTCCGGCGGT CCGCTGTTCA ACCTCGATGG CGAGGTCGTG GGTATCAACT CGCAGATCTA TACCCGCAGT GGCGGCTTCA TGGGGGTGTC CTTCGCCATT CCCATCAACG TGGCGATGGA TATCGCCGAT CAGTTGAAGG ACTCCGGGCA TGTCAATCGC GGCTGGCTCG GCGTGGTGAT TCAGCCGGTA TCCCGTGACC TGGCGGAATC CTTCGGGCTC GACGGTCCGC GCGGCGCGCT GATTTCCGAT GTCACCGACG ACAGCCCGGC AAGCCGTGCG GGACTCGAGG CGGGCGACGT GGTGCTGTCC GTCAACGACG ATCGCGTCGA GGATTCCAGC TCGTTGCCGC GTCTGGTGGG GCGTGTCGCG CCGGGCGAGG ACATCACGCT GACGGTGATG CGCGATGGCG AACGCCGCGA TCTCGACGTG ACCGTGGGCA GCTGGCCCGA CGAGGGCAAG GCCGTGACGG GGACGTCGAA GAAGGACGAC AGCCAGGTGC GGCTCGGCAT CGCCATCAGC GATCTGGATG AACCGATGCG CCGGCAACTG GACGTCGACA GCGGCGTGCT GGTGCGCCAG GTCGACCCGC GTGGTGCGGC CGCCGCCGCC GGCCTGTCAC GCGGCGATGT CATCGTCAGC TTCAACGGCC AGGACATCGA GGATACCGAT GCCCTGATGG AGGCGGTGAA GGACGCGCCC ACCGATCGTG CCGTGCCCGT GCGCATCGTG CGCGACGGCC AATCGTTGTT CGTGGCGCTG CGCCTGGAGA CCGAAAAGCA GGAGTGA
|
Protein sequence | MKYIARHTTL WCLALLTLFT AQLAQARDLP DFTQLVKDAA PGVVNISTTS TVESRGMAGS PFGSQDVPDI FRHFFGDQMP PMIPGAPGYG GSEERHSLGS GFVISRDGYI MTNAHVVDGA DEIVVRLNDR RELEATLVGA DKKTDVAVLK VDADDLPVLE MGDSDALEVG EWVAAIGSPF GFDHSVTSGI VSAIDRTLPS DAYVPFIQTD VAINPGNSGG PLFNLDGEVV GINSQIYTRS GGFMGVSFAI PINVAMDIAD QLKDSGHVNR GWLGVVIQPV SRDLAESFGL DGPRGALISD VTDDSPASRA GLEAGDVVLS VNDDRVEDSS SLPRLVGRVA PGEDITLTVM RDGERRDLDV TVGSWPDEGK AVTGTSKKDD SQVRLGIAIS DLDEPMRRQL DVDSGVLVRQ VDPRGAAAAA GLSRGDVIVS FNGQDIEDTD ALMEAVKDAP TDRAVPVRIV RDGQSLFVAL RLETEKQE
|
| |