Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4241 |
Symbol | |
ID | 5708091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4814232 |
End bp | 4815296 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641273660 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_001539013 |
Protein GI | 159039760 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0825162 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0152473 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGACG AACCCCTGGT CCTGGGAATC GAGACGTCCT GCGACGAGAC CGGGGTCGGT GTCGTCCAGG GCCACACCCT GCTCGCCGAC GCGTTGGCCA GCAGCGTCGA GCAGCATGCC CGGTTCGGCG GTGTGGTGCC CGAGGTGGCC AGCCGGGCAC ACCTGGAGGC GATCGTGCCG ACCATGGACC GGGCGTTGGC GGAGGCGGGG GTGACGCTCG CCGATGTCGA CGCCATCGCG GTAACCTCCG GCCCCGGGCT GGCCGGCGCG CTGTTGGTCG GCGTCGCCGC GGCCAAGGGG TACGCGGTCG CCGCCGAGAA GCCGGTCTAC GGCGTGAACC ATCTCGCGGC GCATGTCGCC GTGGACACCC TGGAACACGG GCCGCTGCCG GAACCGGCGA TTGCCCTGCT GGTCTCGGGC GGGCACTCGT CGCTGCTACT GGTCGACGAC CTGGCCCACG GTGTCACCCC GCTCGGCGCC ACGATCGACG ACGCGGCCGG CGAGGCGTTC GACAAGGTCG CCCGGCTGCT CGGGCTGCCC TTCCCGGGCG GCCCGTACAT TGACCGGGAG GCTCGGGCCG GTGACCGGGC GGCCATCGCG TTTCCACGCG GGCTGACCGC GGCAAAGGAC CAGGCGGCGC ACCGCTACGA CTTCTCCTTC TCCGGGTTGA AGACCGCGGT GGCGCGTTGG GTGGAGAGTC GGCAGCGGGC CGGTGAGGTG GTGCCGGTTG CCGATGTCGC CGCTTCCTTC CAGGAGGCGG TTTGTGACGT ACTGGTCGGG AAGGCACTGG ACGCCTGCCG GTCGAGTGGG ATACAGACCC TCGTGATCGG CGGCGGAGTG GCGGCCAACT CGCGGCTGCG GGCGATGGCC GAGCAGCGCG CGGCGACGTA CGACGTCCAG GTACGAACAC CCCGACCGAC GTTGTGTACG GACAACGGCG CGATGGTCGC CGCACTCGGC TCGCACCTGG TCGCCGCCGG TGTCGCGCCG AGCAGCCTGG ACCTACCCGC CGATTCGGCG ATGCCACTGA CGACGGTCAG TGTTACAGGG GAGGAGCGGA CATGA
|
Protein sequence | MADEPLVLGI ETSCDETGVG VVQGHTLLAD ALASSVEQHA RFGGVVPEVA SRAHLEAIVP TMDRALAEAG VTLADVDAIA VTSGPGLAGA LLVGVAAAKG YAVAAEKPVY GVNHLAAHVA VDTLEHGPLP EPAIALLVSG GHSSLLLVDD LAHGVTPLGA TIDDAAGEAF DKVARLLGLP FPGGPYIDRE ARAGDRAAIA FPRGLTAAKD QAAHRYDFSF SGLKTAVARW VESRQRAGEV VPVADVAASF QEAVCDVLVG KALDACRSSG IQTLVIGGGV AANSRLRAMA EQRAATYDVQ VRTPRPTLCT DNGAMVAALG SHLVAAGVAP SSLDLPADSA MPLTTVSVTG EERT
|
| |