Gene Csal_1628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1628 
Symbol 
ID4028232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1849516 
End bp1850952 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content66% 
IMG OID637966817 
Productpeptidase S1C, Do 
Protein accessionYP_573680 
Protein GI92113752 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATATA TCGCCCGTCA TACCACGCTC TGGTGTCTTG CCCTGCTGAC GCTTTTCACC 
GCTCAGCTCG CTCAGGCACG CGACCTGCCG GACTTCACCC AGCTGGTGAA GGATGCCGCC
CCCGGGGTCG TCAACATCTC CACCACCAGT ACCGTCGAAT CCCGGGGCAT GGCCGGATCG
CCGTTCGGCA GTCAGGACGT GCCCGACATC TTCCGGCACT TCTTCGGCGA TCAGATGCCG
CCGATGATAC CCGGGGCGCC GGGCTACGGC GGCAGTGAAG AGCGCCACTC CCTCGGCTCC
GGTTTCGTCA TCAGCCGCGA CGGCTACATC ATGACCAACG CCCACGTGGT CGACGGTGCC
GACGAGATCG TGGTGCGCCT CAACGACCGT CGCGAGCTCG AGGCCACGCT CGTGGGGGCC
GACAAGAAAA CCGACGTCGC CGTGCTCAAG GTCGATGCCG ACGACTTGCC GGTACTCGAG
ATGGGAGACT CCGACGCTCT GGAAGTCGGT GAATGGGTCG CCGCCATCGG CTCGCCCTTC
GGCTTCGATC ATTCGGTGAC CTCGGGCATC GTCAGCGCCA TCGACCGTAC GCTGCCCAGC
GACGCCTATG TGCCGTTCAT CCAGACCGAC GTGGCGATCA ACCCCGGCAA TTCCGGCGGT
CCGCTGTTCA ACCTCGATGG CGAGGTCGTG GGTATCAACT CGCAGATCTA TACCCGCAGT
GGCGGCTTCA TGGGGGTGTC CTTCGCCATT CCCATCAACG TGGCGATGGA TATCGCCGAT
CAGTTGAAGG ACTCCGGGCA TGTCAATCGC GGCTGGCTCG GCGTGGTGAT TCAGCCGGTA
TCCCGTGACC TGGCGGAATC CTTCGGGCTC GACGGTCCGC GCGGCGCGCT GATTTCCGAT
GTCACCGACG ACAGCCCGGC AAGCCGTGCG GGACTCGAGG CGGGCGACGT GGTGCTGTCC
GTCAACGACG ATCGCGTCGA GGATTCCAGC TCGTTGCCGC GTCTGGTGGG GCGTGTCGCG
CCGGGCGAGG ACATCACGCT GACGGTGATG CGCGATGGCG AACGCCGCGA TCTCGACGTG
ACCGTGGGCA GCTGGCCCGA CGAGGGCAAG GCCGTGACGG GGACGTCGAA GAAGGACGAC
AGCCAGGTGC GGCTCGGCAT CGCCATCAGC GATCTGGATG AACCGATGCG CCGGCAACTG
GACGTCGACA GCGGCGTGCT GGTGCGCCAG GTCGACCCGC GTGGTGCGGC CGCCGCCGCC
GGCCTGTCAC GCGGCGATGT CATCGTCAGC TTCAACGGCC AGGACATCGA GGATACCGAT
GCCCTGATGG AGGCGGTGAA GGACGCGCCC ACCGATCGTG CCGTGCCCGT GCGCATCGTG
CGCGACGGCC AATCGTTGTT CGTGGCGCTG CGCCTGGAGA CCGAAAAGCA GGAGTGA
 
Protein sequence
MKYIARHTTL WCLALLTLFT AQLAQARDLP DFTQLVKDAA PGVVNISTTS TVESRGMAGS 
PFGSQDVPDI FRHFFGDQMP PMIPGAPGYG GSEERHSLGS GFVISRDGYI MTNAHVVDGA
DEIVVRLNDR RELEATLVGA DKKTDVAVLK VDADDLPVLE MGDSDALEVG EWVAAIGSPF
GFDHSVTSGI VSAIDRTLPS DAYVPFIQTD VAINPGNSGG PLFNLDGEVV GINSQIYTRS
GGFMGVSFAI PINVAMDIAD QLKDSGHVNR GWLGVVIQPV SRDLAESFGL DGPRGALISD
VTDDSPASRA GLEAGDVVLS VNDDRVEDSS SLPRLVGRVA PGEDITLTVM RDGERRDLDV
TVGSWPDEGK AVTGTSKKDD SQVRLGIAIS DLDEPMRRQL DVDSGVLVRQ VDPRGAAAAA
GLSRGDVIVS FNGQDIEDTD ALMEAVKDAP TDRAVPVRIV RDGQSLFVAL RLETEKQE