Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0494 |
Symbol | |
ID | 4068619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 608625 |
End bp | 609770 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637982498 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_589573 |
Protein GI | 94967525 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family [TIGR03723] putative glycoprotease GCP |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGACG CCGTCATCCT GGGAATTGAA AGCTCGTGCG ACGAGACCGC CGCGGCTGTG ATCCGAAACG GCGCAGAAAT CCTCTCCAGC GTAGTGTTCT CGCAGATCTA CACGCATATG CGGTACGGCG GCGTGGTGCC GGAACTGGCC TCGCGCGAGC ACTTGAAGGC TATCGTTCCC GTGGTGCGCC AGGCGGTGGA AGACGCTGGA CAGAGCTATG ACAAGATTGA TGCCATCGCT GTGACACGCG GACCCGGACT GGCCGGAGCG CTGCTGGTGG GCGTGAGTTA TGCGAAGGCG CTGTCATTCG CGCTGGATAA GCCGCTGATC GGCGTGAACC ACCTGGAAGG ACACATTCAC GTGGTGCTGC TGGAACAGAA GCAGCAAGGC GTCGGCGAAA TTCAGTTTCC GGTGCTGGCG CTGGTGGTGA GCGGCGGACA CACGCATCTT TACCTTGCAG AGAAGAAGGA TGCGGGATGG ACGTATCGCG ATGTGGGACA CACGCGCGAC GATGCGGCCG GCGAGGCCTA CGACAAAGTC GCGAAGCTGC TGGGGCTTGG ATATCCCGGG GGGCCGATTC TCGATGGCCT GGCAAAGCAT GGCGATCCCA GGGCGGTGAG GTTTCCGTTC GCGCAGATCA AGCATCGCGA CCGCAATCCG CAGAACCGAC ATGAGGATGA CGATGCGCGA GTGGATTTCT CGTATAGCGG TATCAAGACC GCGGTGCTGC GCTATGTTGA AACGCACGAG ATGAAGGCGG CGATTGAAGC GCGGCGAACG GCGTTGAAGG AAATCGAGAA GCCATCGCAG GACGATTATT TGCGGGTGTG CGATCGGCAG ACGCTCGATC TGATTGCATC GTTTCAGCGC GCGGTGGTGA ATGATCTTGT CTCGAAGGCG CTGCACGCGG CTGCGGAAAA CAATGCAGCA ACGCTCTTGG TGACGGGCGG AGTTGCGGCG AATTCCGAGC TGCGTGAGAC GTTTGAACGA CGTGCCGGCG AACTTGGGTT GCCTGTGTAT TTCCCTTCGC GACCGCTGTC TACGGACAAC GCGGCGATGA TTGCGGCGGC GGCGTATCCG CGGTTTCTGA GCGGAGAATT TGCGGCGCCT GATCTGTCCG CGGAAGCCAA TCTTCGCCTG CGCTAA
|
Protein sequence | MADAVILGIE SSCDETAAAV IRNGAEILSS VVFSQIYTHM RYGGVVPELA SREHLKAIVP VVRQAVEDAG QSYDKIDAIA VTRGPGLAGA LLVGVSYAKA LSFALDKPLI GVNHLEGHIH VVLLEQKQQG VGEIQFPVLA LVVSGGHTHL YLAEKKDAGW TYRDVGHTRD DAAGEAYDKV AKLLGLGYPG GPILDGLAKH GDPRAVRFPF AQIKHRDRNP QNRHEDDDAR VDFSYSGIKT AVLRYVETHE MKAAIEARRT ALKEIEKPSQ DDYLRVCDRQ TLDLIASFQR AVVNDLVSKA LHAAAENNAA TLLVTGGVAA NSELRETFER RAGELGLPVY FPSRPLSTDN AAMIAAAAYP RFLSGEFAAP DLSAEANLRL R
|
| |