Gene Csal_2211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2211 
Symbol 
ID4026403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2485616 
End bp2486854 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content66% 
IMG OID637967416 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_574261 
Protein GI92114333 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family
[TIGR02038] periplasmic serine pepetdase DegS 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCGCT CACTGATGTC GCTCGTCTGG CCCACCATCA GTGGCATCCT GCTTGCCATC 
GTGCTGCTCA ACGCCTTCCC CCAGCTGCTT GGCGGACGCA ACGCCGACGA CACCGTGACG
CCCGTCCCCG ATACCAGCGC CGTGCGCGCC ATTGCCGAAC GCACCGATAC CGCCGCCCGT
GCCCCGGAAG TCGTCGAGGC CGCCCCGCTC GACCGGGCAC AAGGACCGGC CAGCTATGCC
ACCGCGGTGG ACAAGGCCGC CCCGGCCGTC GTCAATATCT ACTCGTCGCG CATGGTCGAT
CCCAGCGAGC ACCCCCTGAT GTCCGACCCG TTCTTCGAGC AATTCTTCGG CAAGGACATG
CCGCAACGTC AGCGCATGCT GTCCAGTCTG GGTTCCGGCG TCATCGTGAG TCCCGAGGGC
TACGTCCTCA CGAACAACCA CGTCATTCGC AACGCCGACC AGATCCAGGT CGCCCTGCGC
GACGGCCGCG AGACCCTCGC CGAGGTCGTC GGCACCGATC CCGAAAGCGA CCTCGCGGTG
CTCAAGATTC CCGTCGACAA CCTGCCGGTC ATCGAGCTCA GCGACTCGGA GCAGGTCGCC
GTAGGGGACG TCAGCCTGGC CATCGGCAAC CCCTTCGGCG TGGGACAGAC CGTGACCATG
GGAATCATCA GCGCGACCGG ACGCAACCAT CTGGGGCTCA ACGCCTACGA GGACTTCATT
CAAACCGATG CCGCCATCAA CCCGGGCAAC TCCGGTGGCG CCCTGGTTAA TGCCGAAGGC
GCTCTGGTGG GCATCAACAC CGCGATCTTC TCCCGCTCGG GAGGCTCGCA AGGCATCGGT
TTCGCGATTC CCGCCAACCT CGCCCATCAG GTACTCGATC AGATCGTCGC GCATGGACGT
GTCATTCGCG GCTGGCTGGG CATCGACGTC CAGGCGATGA CTCCCGACCT TGCCACCTCT
TTCGGCCTGA AGACCCTCAA GGGCGTGGTC ATCGCCAATG TCGTGCCCGG CGGCCCCGGC
GAAAAGGCAG GCCTGCAGCC CGGAGACGTG CTGATGTCCG TCAACGGCAA GATCATCGTC
GACGCTCGCG AGGCGATGGC CGATATCGCG GAGATCTCGC CCGGCACCTC GCTCCCCGTC
ACCATCGTGC GCGATGGCGA AAAGCGTGAG GTGACGCTGA CGGTCGGCGA GCGCCCCCAG
GCAGCGCAGC GTCAACCAAC CGCTCCCTCT TCTGAATGA
 
Protein sequence
MRRSLMSLVW PTISGILLAI VLLNAFPQLL GGRNADDTVT PVPDTSAVRA IAERTDTAAR 
APEVVEAAPL DRAQGPASYA TAVDKAAPAV VNIYSSRMVD PSEHPLMSDP FFEQFFGKDM
PQRQRMLSSL GSGVIVSPEG YVLTNNHVIR NADQIQVALR DGRETLAEVV GTDPESDLAV
LKIPVDNLPV IELSDSEQVA VGDVSLAIGN PFGVGQTVTM GIISATGRNH LGLNAYEDFI
QTDAAINPGN SGGALVNAEG ALVGINTAIF SRSGGSQGIG FAIPANLAHQ VLDQIVAHGR
VIRGWLGIDV QAMTPDLATS FGLKTLKGVV IANVVPGGPG EKAGLQPGDV LMSVNGKIIV
DAREAMADIA EISPGTSLPV TIVRDGEKRE VTLTVGERPQ AAQRQPTAPS SE