Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2820 |
Symbol | |
ID | 4028505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 3148161 |
End bp | 3149606 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637968027 |
Product | peptidase M48, Ste24p |
Protein accession | YP_574865 |
Protein GI | 92114937 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACGCC GCTGGAATCT GCTTCCCGTT CTTTGTCTCG CCCTGGCCGC GTGCGTGGTC ACCCCCGTTG CCATGGCGCA GGACGATTAC GGGCTGCCCG AGCTCGGCGG GGCCAGCGCG AGCGTTTCCA GCGGCGAGGA GTATCGCCTC GGTCGCGCCT GGCTGCGCCA GTTTCGTGCC CAGACCGACG AGTGGGAAGA CCCCATCGCC CGTGACTACA TTCATGGCCT GGTCGCCCGC CTGCTGCCCT ACACCGACGT TCACGACCCT GTCATCGTCA CCCTGGTCGA CAGTCGTCTC CTCAACGCCT TCGCCGTGCC GGGCGGCGTG ATGGGGGTCA ACAGCGGCCT GTTCACCTTC GCCGACAGCG AAGACACTCT CGCTTCGGTC ATCGCCCACG AACTCGGCCA TCTCTCGCAG CACCACTATG CACGGCGCAT GCAGCGCGTG GAAGAAACCC AGTTGCCGAC CATGGCGGCG ATGCTGGCCG GCATGGTGCT GGCCGCCGGC GGCGCGGGCG ACGCCGGCCT GGCCACCATG GTCGGGTCCC AGGCGGCCTT CATCCAGGAC CAGCTCGCCT ACTCGCGCCG CTTCGAGCAG GAGGCCGACC GCATCGGCCT GGACGCCATG GCCGATGCCG GCTTCGACCC GCAGGCCATG CCGGAAATGT TCCGCGCCAT GCAGCACCTG GCCAGCCTGC AGGGCGGCAA CCCGCCGGAG TTTCTGCTCA CCCACCCGGT GACCGAATCG CGCATCAGCG ACACCCAGAC GCGCGCCAAC CAGCTGCCAT CGCCCGCACC GCATACCAGC AAGGTGTTCG CGATGATTCG CGGCCGCGCG CTGCTCTCGC TCCATCGCAG CGACCCTGAA CAGGCCATGA CCCGGCTGCG CCAGGACGAT CCGCCCGAAG CGGCGGTCCG CTACCTGCAG GCCTTGATCG ACGCCCAGCG CGGCAATACC GCCAAGGCCC TCGCCACCCT CGACGCCCTG AGCGAGGCCC AGCCGGACCT TTCCATGCTG CCCGCCAGCG CCGCCGAGGT CGCCCTCGAC GCCGGACAGC GCGACGACGC CCTGCGTCGC GCCCGGCGCA TTCTCCGCCT GCAGCCGGAC TACTACCCGG CACAGCGCAT CGAGGCCGAG GTCCTGCTGC AGCAGGCGCC GGACCAGGCC TTCAATGTCC TGCGCGACAT GAGCGATCAG TACCCCGAGG ACCCGCACGT CTTCGCCCTG CTCGCCGAAG CCGCCGGACG CAGCGGCCGC GACCTGTGGG GCATGCTCGC GCGTGCCGAG CATTTGCAGC TCACCGGCCA TATCGACCGC GCCATCAAGC AGATCGACAT CGCCGAGCAG ACCGCCCGCG ACCGAGGCGA CTTCGCCATG GCCAGCCGGC TCGGCGAACG TCGACAGGCG TATCTGGGAT ATCGACAGAC GCTGCGCAAT TTCTAG
|
Protein sequence | MPRRWNLLPV LCLALAACVV TPVAMAQDDY GLPELGGASA SVSSGEEYRL GRAWLRQFRA QTDEWEDPIA RDYIHGLVAR LLPYTDVHDP VIVTLVDSRL LNAFAVPGGV MGVNSGLFTF ADSEDTLASV IAHELGHLSQ HHYARRMQRV EETQLPTMAA MLAGMVLAAG GAGDAGLATM VGSQAAFIQD QLAYSRRFEQ EADRIGLDAM ADAGFDPQAM PEMFRAMQHL ASLQGGNPPE FLLTHPVTES RISDTQTRAN QLPSPAPHTS KVFAMIRGRA LLSLHRSDPE QAMTRLRQDD PPEAAVRYLQ ALIDAQRGNT AKALATLDAL SEAQPDLSML PASAAEVALD AGQRDDALRR ARRILRLQPD YYPAQRIEAE VLLQQAPDQA FNVLRDMSDQ YPEDPHVFAL LAEAAGRSGR DLWGMLARAE HLQLTGHIDR AIKQIDIAEQ TARDRGDFAM ASRLGERRQA YLGYRQTLRN F
|
| |