Gene Csal_2232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2232 
Symbol 
ID4026042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2502508 
End bp2503848 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content66% 
IMG OID637967437 
Productmicrocin-processing peptidase 1 
Protein accessionYP_574282 
Protein GI92114354 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.352601 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGG CCTTCGATGC CGTAGAGCAG CAAGCACGCT TGGAGGCGCG CGTGGCACAG 
GCCCTGGAAT GGGCCAAGCA GTTGGGTGCC GATGCGTGTG AAGTGGGGGC CAGTGTCGAC
CAGGGCATCG GTGTCAGCGT GCGCCTGGGC GATGTGGAGA GCGTCGAACT GTCACGCGAT
CAGGGCATTG CGGTCACTGT CTATGTCGGA CAGCGCAAGG GTAGCGTGTC GACGTCCGAT
GACAGTGACG AATCGCTGCG CGCGGCGGTC GAGAAGGCCG TGGCGATCGC CGGGTATACC
GGCGAGGATC CCGCGTCCGG CTTGGCCGAT GCGTCGCTGA TGGCCACCGA CCTGCCGGAT
CTCGGGGTGC ACCATCCCTG GCCGCTGAGC ACCGATGACG CCATCGAGCT GGCGCTGGCC
TGCGAAGCCG CCGGGCGCAA TGTCGAGGGC ATCACCAACT CCGATGGGGC CAGCCTTTCC
AGCGGCGAGG GCGTTCGCGT CTATGGCAAC AGTCACGGCT TTCTGGGCAG CCAGCGCGGC
AGCAGGCATT CCTTGTCGTG CATGCTGATC GCCGGCCACG GTGCGGAAAT GCAGCGCGAC
TATGATTACA CCTCGGTGCG CGACCCTGCG GCCATGCTGG CGCCGGAGAC GGTGGGACGC
AACGCCGCCG ACAAGACGCT GGCTCGCCTG GGGGCGAGCT CGCCGGCCAC CGGACGTATG
CCGGTGCTGT TCGCGCCGGA GCTGGCCAGC GGCCTGGTGG GCAACTTTTT GAACGCCATT
GCCGGAGGGG CGTTGTACCG CGAGGCTTCC TTCCTCTGCG ACCGGCTTGG CGAAAGCGTC
TTTCCCGAGT GGTTCTCCTT GCGTGAAAAG CCACGGGAAT ATGGTGCCAT GGCCAGCACG
GCCTTCGACA ACGATGGCGT GGCCACGCGC GACAACGTTT TCATCGACCG GGGGCGCCTG
GCGAGCTACA TGCTGTCGGC GTACAGCGCA CGGCGGCTGG GCATGAGCAC GACCGGCAAT
GCCGGCGGTG CACGCAACCT GCGTATCGAG GCGCCCCTGA TGTCGCGCGA GGCACTCTTG
GCGCGCATGG AGCGCGGTGT GCTGGTCACC GAGCTGATGG GGCAAGGCGT CAATGGCGTG
ACCGGCGACT ATTCACGCGG TGCGGCAGGT TTCTGGGTCG AGAACGGCAA GATTCAGCAT
CCCGTCGAAG AATTCACCAT CGCGGGGAAT CTGCGCGACA TGTTCGCCAA CCTGGAAGGC
GTGGGCAGCG ATACCGACAC GCGTGGCAGC GTGCATACCG GCAGCTGGCT GATCGGTGAC
ATGATGGTCG CCGGCGAGTA A
 
Protein sequence
MSQAFDAVEQ QARLEARVAQ ALEWAKQLGA DACEVGASVD QGIGVSVRLG DVESVELSRD 
QGIAVTVYVG QRKGSVSTSD DSDESLRAAV EKAVAIAGYT GEDPASGLAD ASLMATDLPD
LGVHHPWPLS TDDAIELALA CEAAGRNVEG ITNSDGASLS SGEGVRVYGN SHGFLGSQRG
SRHSLSCMLI AGHGAEMQRD YDYTSVRDPA AMLAPETVGR NAADKTLARL GASSPATGRM
PVLFAPELAS GLVGNFLNAI AGGALYREAS FLCDRLGESV FPEWFSLREK PREYGAMAST
AFDNDGVATR DNVFIDRGRL ASYMLSAYSA RRLGMSTTGN AGGARNLRIE APLMSREALL
ARMERGVLVT ELMGQGVNGV TGDYSRGAAG FWVENGKIQH PVEEFTIAGN LRDMFANLEG
VGSDTDTRGS VHTGSWLIGD MMVAGE