Gene Csal_2537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2537 
Symbol 
ID4026118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2843238 
End bp2844284 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content69% 
IMG OID637967744 
Productpeptidase M4, thermolysin 
Protein accessionYP_574583 
Protein GI92114655 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTCTT CACGAAGGGC GACGTTTTCC GGTTTCATGC CGCCGCACGT GCTCGATCGA 
ATCGCGGTGC AGGGTACCGA GCGCCAGCGG CGGTGTGCTC AGCAGACGTT GCAGGCCGAC
CAGTGGTTTC GTCTACGGGC CTCGCCGCCG CCAGCGCGCG ACGCGGCGCG GGCCGTCGCC
GGGCGGCCCG ACCGCCGCAT CCATTCCGCC GACCACGAAC AGACGCTGCC CGGCCGTCTG
GTGCGGGAAG AAGGCCAGGC CGCGCATGGG GATGCGGCCG TCGACGAGGC CTATGAGTGG
CTGGGTGCGA CCTACCGGTT CTACTGGGAG GTCTTCGGGC GCGACTCCAT CGACGATCGA
GGCATGCCGC TGATCGGCAC CGTGCATTAC GGCCGCGATT ACGACAACGC CTTCTGGAAC
GGTGCGCAGA TGGTCTTCGG TGACGGCGAC GGCGACCTGT TTCGGCGGTT CACGGCGGCG
CCGGAAGTCG TCGCGCACGA GCTGACCCAC GGTGTGATCG AGCGCGATGT GGGGCTGGTC
TACGCCGGCC AGTCCGGGGC GCTCAACGAG TCCCTGGCCG ATGTCTTCGG GGTGGTGGTC
AAGCAGTACC ATGCCGGCCA GACGGCGCAG GAAGCGGACT GGCTCATCGG CGCGGCGTTG
TTGACCGACC GGGTGCAAGG CCGTGCACTA CGCTCCATGG AAGCGCCGGG GACGGCATAC
GATGACCCCG TGCTGGGACG CGATCCGCAA CCGGGCCACA TGCGCGATTT CGTCGACACG
CAGGCCGACA ACGGGGGCGT TCACATCAAT TCCGGCATAC CCAACCGGGC CTTTTACCTG
GCGGCGGTGG CCCTGGAGGC GCCGGCGTGG GAGAGCGTCG CGCCCGTGTG GTATGCGGCG
ATGCGCGATG ACGCCCTGAG CCGGGAATCG GATTTCGCGG CTTTCGCGGC ACTCACCGTG
GCGCATGCCC GGCGCCAGCA TGGAGAGGGA AGTCGCGAGG CGCGTGCGGT GGATGACGCC
TGGCGCGAGG TGGGCGTCGT CTCATGA
 
Protein sequence
MASSRRATFS GFMPPHVLDR IAVQGTERQR RCAQQTLQAD QWFRLRASPP PARDAARAVA 
GRPDRRIHSA DHEQTLPGRL VREEGQAAHG DAAVDEAYEW LGATYRFYWE VFGRDSIDDR
GMPLIGTVHY GRDYDNAFWN GAQMVFGDGD GDLFRRFTAA PEVVAHELTH GVIERDVGLV
YAGQSGALNE SLADVFGVVV KQYHAGQTAQ EADWLIGAAL LTDRVQGRAL RSMEAPGTAY
DDPVLGRDPQ PGHMRDFVDT QADNGGVHIN SGIPNRAFYL AAVALEAPAW ESVAPVWYAA
MRDDALSRES DFAAFAALTV AHARRQHGEG SREARAVDDA WREVGVVS