Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2232 |
Symbol | |
ID | 4026042 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 2502508 |
End bp | 2503848 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637967437 |
Product | microcin-processing peptidase 1 |
Protein accession | YP_574282 |
Protein GI | 92114354 |
COG category | [R] General function prediction only |
COG ID | [COG0312] Predicted Zn-dependent proteases and their inactivated homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.352601 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGG CCTTCGATGC CGTAGAGCAG CAAGCACGCT TGGAGGCGCG CGTGGCACAG GCCCTGGAAT GGGCCAAGCA GTTGGGTGCC GATGCGTGTG AAGTGGGGGC CAGTGTCGAC CAGGGCATCG GTGTCAGCGT GCGCCTGGGC GATGTGGAGA GCGTCGAACT GTCACGCGAT CAGGGCATTG CGGTCACTGT CTATGTCGGA CAGCGCAAGG GTAGCGTGTC GACGTCCGAT GACAGTGACG AATCGCTGCG CGCGGCGGTC GAGAAGGCCG TGGCGATCGC CGGGTATACC GGCGAGGATC CCGCGTCCGG CTTGGCCGAT GCGTCGCTGA TGGCCACCGA CCTGCCGGAT CTCGGGGTGC ACCATCCCTG GCCGCTGAGC ACCGATGACG CCATCGAGCT GGCGCTGGCC TGCGAAGCCG CCGGGCGCAA TGTCGAGGGC ATCACCAACT CCGATGGGGC CAGCCTTTCC AGCGGCGAGG GCGTTCGCGT CTATGGCAAC AGTCACGGCT TTCTGGGCAG CCAGCGCGGC AGCAGGCATT CCTTGTCGTG CATGCTGATC GCCGGCCACG GTGCGGAAAT GCAGCGCGAC TATGATTACA CCTCGGTGCG CGACCCTGCG GCCATGCTGG CGCCGGAGAC GGTGGGACGC AACGCCGCCG ACAAGACGCT GGCTCGCCTG GGGGCGAGCT CGCCGGCCAC CGGACGTATG CCGGTGCTGT TCGCGCCGGA GCTGGCCAGC GGCCTGGTGG GCAACTTTTT GAACGCCATT GCCGGAGGGG CGTTGTACCG CGAGGCTTCC TTCCTCTGCG ACCGGCTTGG CGAAAGCGTC TTTCCCGAGT GGTTCTCCTT GCGTGAAAAG CCACGGGAAT ATGGTGCCAT GGCCAGCACG GCCTTCGACA ACGATGGCGT GGCCACGCGC GACAACGTTT TCATCGACCG GGGGCGCCTG GCGAGCTACA TGCTGTCGGC GTACAGCGCA CGGCGGCTGG GCATGAGCAC GACCGGCAAT GCCGGCGGTG CACGCAACCT GCGTATCGAG GCGCCCCTGA TGTCGCGCGA GGCACTCTTG GCGCGCATGG AGCGCGGTGT GCTGGTCACC GAGCTGATGG GGCAAGGCGT CAATGGCGTG ACCGGCGACT ATTCACGCGG TGCGGCAGGT TTCTGGGTCG AGAACGGCAA GATTCAGCAT CCCGTCGAAG AATTCACCAT CGCGGGGAAT CTGCGCGACA TGTTCGCCAA CCTGGAAGGC GTGGGCAGCG ATACCGACAC GCGTGGCAGC GTGCATACCG GCAGCTGGCT GATCGGTGAC ATGATGGTCG CCGGCGAGTA A
|
Protein sequence | MSQAFDAVEQ QARLEARVAQ ALEWAKQLGA DACEVGASVD QGIGVSVRLG DVESVELSRD QGIAVTVYVG QRKGSVSTSD DSDESLRAAV EKAVAIAGYT GEDPASGLAD ASLMATDLPD LGVHHPWPLS TDDAIELALA CEAAGRNVEG ITNSDGASLS SGEGVRVYGN SHGFLGSQRG SRHSLSCMLI AGHGAEMQRD YDYTSVRDPA AMLAPETVGR NAADKTLARL GASSPATGRM PVLFAPELAS GLVGNFLNAI AGGALYREAS FLCDRLGESV FPEWFSLREK PREYGAMAST AFDNDGVATR DNVFIDRGRL ASYMLSAYSA RRLGMSTTGN AGGARNLRIE APLMSREALL ARMERGVLVT ELMGQGVNGV TGDYSRGAAG FWVENGKIQH PVEEFTIAGN LRDMFANLEG VGSDTDTRGS VHTGSWLIGD MMVAGE
|
| |