Gene Csal_2234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2234 
Symbol 
ID4026044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2504504 
End bp2505958 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content67% 
IMG OID637967439 
Productmicrocin-processing peptidase 2 
Protein accessionYP_574284 
Protein GI92114356 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCAC TCAACGATAG CGCCTTGAAT CAGGCGACCT CATTGCTCCT CGCGCCGGGC 
GGTCTCGACC TGTCGGCGCT GGATACCGGC CTGGGCCACG CCATGGGCCC GGGCGTCGAT
TACGCCGACC TCTATTTCCA ACGTACCTGG CAGGAAAGCT GGATACTCGA GGATGGCGAA
GTCAAGGACG CCGGTTACAA CATCGACGGC GGTGTCGGGG TACGTACCCT GGCCGGAGAG
AAGACCGGCT TCGCCTACTC GAACCAGATC AGTGCCGATG CGCTGGCCGA TACCGGCCGT
ATCGCCGCCG GGATTGCGCG TAGCGGGCAG CGAGTGTCGC CGCAGGCGGT GCATGCTGCG
ATGCCGACGC GGCGCTATGC CGATGTCGAT CCGCTGGCCG GACTGTCGGC CGCCGACAAG
ATCGCCATGC TGAAGACGGC GGATCGGGTG GCACGCGCCG CCGATCCGTG CGTCTCCCAG
GTCAGCGCGT CGCTGTCGGG CGAGTACGAA GTGGTGCTGG TACGCGCCAG CGACGGCACT
CTGGCCGCCG ACATCCGGCC GCTGGTGCGC TTCAACGTCA GCGTCATCGC CGTTCGCGAC
GGCCGCCGCG AGCGAGGCAG CGCCGGCGGC GGGGGGCGCT ACTCGATGGC ACGCCTGCGC
GACGACAACG TCGCCGAACG CTTTGCCAAG GAAGCCGTGC GTCAGGCGCT GGTCAACCTG
GAGGCGGTCG ACGCCCCCGC CGGGCAGATG CCCGTCGTGC TGGGCAATGG CTGGCCGGGG
ATCCTGCTCC ACGAAGCGGT GGGCCACGGT CTCGAGGGCG ACTTCAATCG CAAGGGCAGT
TCGGCGTTCG CCGGGCGCAT CGGCGAGCGT GTGGCATCCC CCGGCGTCAC GGTGGTCGAC
GACGGCACCT TGGCCGACCG GCGCGGTTCG TTGAGCATCG ATGACGAGGG GACACCCACG
CAGTGCAACA CCCTGATCGA GGACGGCATC CTGAAGGGAT ACATGCAGGA CAAGCTCAAC
GCGCGCTTGA TGGGCATGGC GCCCACCGGC AATGCGCGTC GCGAATCCTA TGCCCATGCC
ACCATGCCGC GCATGACCAA TACCTGCATG CTGGCCGGCC AGGACGACCC CGATGACATC
GTCAAGAGCG TCAAGCGGGG CATCTACGCG GTGAGCTTCG GAGGTGGCCA GGTGGACATT
ACCTCGGGGC GTTTCGTGTT CTCCGCCTCC GAAGCCTATC TGATCGAGGA CGGCAAGATC
ACCACGCCGG TCAAGGGCGC CACGCTGATC GGCAACGGCC CCGAGGTGAT GCAGCGGGTT
TCGATGATCG GCCACGACCT CGAGCTGGAT ACCGGCATCG GGGTCTGCGG CAAGGAAGGG
CAGGGTGTGC CGGTTGGCGT GGGCCAACCG ACACTCAAGG TCGATGAACT GACCGTAGGC
GGCACGCAGT CCTGA
 
Protein sequence
MNALNDSALN QATSLLLAPG GLDLSALDTG LGHAMGPGVD YADLYFQRTW QESWILEDGE 
VKDAGYNIDG GVGVRTLAGE KTGFAYSNQI SADALADTGR IAAGIARSGQ RVSPQAVHAA
MPTRRYADVD PLAGLSAADK IAMLKTADRV ARAADPCVSQ VSASLSGEYE VVLVRASDGT
LAADIRPLVR FNVSVIAVRD GRRERGSAGG GGRYSMARLR DDNVAERFAK EAVRQALVNL
EAVDAPAGQM PVVLGNGWPG ILLHEAVGHG LEGDFNRKGS SAFAGRIGER VASPGVTVVD
DGTLADRRGS LSIDDEGTPT QCNTLIEDGI LKGYMQDKLN ARLMGMAPTG NARRESYAHA
TMPRMTNTCM LAGQDDPDDI VKSVKRGIYA VSFGGGQVDI TSGRFVFSAS EAYLIEDGKI
TTPVKGATLI GNGPEVMQRV SMIGHDLELD TGIGVCGKEG QGVPVGVGQP TLKVDELTVG
GTQS