Gene Csal_0044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0044 
Symbol 
ID4027223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp52018 
End bp53340 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content68% 
IMG OID637965196 
Productcarboxyl-terminal protease 
Protein accessionYP_572108 
Protein GI92112180 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCAA GCCAATCCCG TTCCCCGCGC CTCGTCGTGC GCCATTGCCT GCACCTGGGC 
ATGGCCCTGG CGATCGGTGC GCTGACCCTG CCGCTGCCCG CGCATGCCCA ACAGCCGGCC
GGCGACGACG CCCTGCCCGT GGAAGACGTG CAAACCTTCG CCGAGGTCTT CGAGCGCATC
AAGCGGGCCT ATGTCGACGA GGTCGACGAC ACCACGCTGA TGCGCAACGC CATGCGCGGC
ATGCTCGGCG AGCTCGACCC GCATTCCGCC TATCTCGATG CGGAGTCCTT CGAGGCACTC
CGCGAAACCA CCGAGGGCGA GTTCAGCGGC GTGGGCATCG AGGTCGGCAT GCAGGAAGGC
CAGTTGACGA TCATCGCCCC CATCGACGAC AGCCCCGCCG CACGTGCCGG ACTCCAGGCA
CAGGACGTCA TCCTGCGCAT CGACGACACG CCCACCGAGA GCCTGTCGCT GCAGGAGGCC
GTGGAAATGA TGCGCGGCGA CGAAGGCGAG GAGATTCGCC TGACCATCCT GCGCGAGGGC
GAGGAAGCCC CGCGCGAGGT CACGCTGACC CGCGAGACGA TCCGCACCGA CAGCGTCAAG
CATGAGATGC TGTCGCCGGG CTACGGCTAC CTGCGCATCA GCCAGTTCCA GAGCCGCACC
GGCGAACAGG CCCGCGATGC CATCGCCGCA CTGCGCGAGG AAGGCGACGG CAATCTCAAG
GGCCTGGTGC TGGACCTGCG CAACAATCCC GGCGGCGTGC TCGACAGTGC CGTCGATGTC
GCCGACCTGT TCCTCGACAG CGGGCTGATC GTCTATACCG AAGGCCGCCT GGCAGACAGC
GACATGCGCT TCTCGGCCTC TCCCCAGACC AGCGCCCCGG ACGTACCCAT GGTCGTGCTG
ATCAACGGCG GCAGTGCCTC GGCGGCGGAG ATCGTCGCCG GTGCCCTGCA GGACCAGCAA
CGCGCCGTGC TGATGGGCAC CGAAAGCTTC GGCAAGGGCT CCGTGCAGCA GGTGCTGCCG
CTCAACAACG GCGACGGCCT GAAGCTGACC ACCGCGCTCT ACTACACGCC GGACGGCCGC
TCGATCCAGG CTCAGGGCAT CGCCCCGGAC GTCGAAGTCG TGCGCGGTCG CCTCGAGGTC
GCCGAAGCCA CCGGCCTGAG CATCCGCGAG TCGGATCTCG AGAATCACCT GCGCAACATC
AACGGCGAGC GGGAACGCAC CGAGCGCGAA AGCTCCCTCG CCGAAAGCGA CTACCAGCTC
GGCGAAGCCC TCAACCTGCT CAAGGCCCTC AACGTCCTGC CCCGTGCCCA GAGCGGCAAC
TGA
 
Protein sequence
MTASQSRSPR LVVRHCLHLG MALAIGALTL PLPAHAQQPA GDDALPVEDV QTFAEVFERI 
KRAYVDEVDD TTLMRNAMRG MLGELDPHSA YLDAESFEAL RETTEGEFSG VGIEVGMQEG
QLTIIAPIDD SPAARAGLQA QDVILRIDDT PTESLSLQEA VEMMRGDEGE EIRLTILREG
EEAPREVTLT RETIRTDSVK HEMLSPGYGY LRISQFQSRT GEQARDAIAA LREEGDGNLK
GLVLDLRNNP GGVLDSAVDV ADLFLDSGLI VYTEGRLADS DMRFSASPQT SAPDVPMVVL
INGGSASAAE IVAGALQDQQ RAVLMGTESF GKGSVQQVLP LNNGDGLKLT TALYYTPDGR
SIQAQGIAPD VEVVRGRLEV AEATGLSIRE SDLENHLRNI NGERERTERE SSLAESDYQL
GEALNLLKAL NVLPRAQSGN