Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1099 |
Symbol | |
ID | 4069559 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1376150 |
End bp | 1377247 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637983108 |
Product | peptidase M48, Ste24p |
Protein accession | YP_590176 |
Protein GI | 94968128 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000343868 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000201004 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCAAGCTC CCAGAGTTCG TCTGCTGATC ACGCTCGTCT TCTTTGTCAG TTTTTCAGTT GCGCAAACTC TTAACAGTTC CAATGCAAGT GCGGCGCCGG TGGTAGAGGT GTCGAGCGCG AAGTATGCGA AGGCGATGCA GAAGGTAGTG ACAAAGTACG ACGTCACAAA GATCGGCGAA CGCAAAGTTG CGGGCGGGAT GAACTTTGTC TCCATTGAAG CGGAGGCCAG GCTCGGGCGG CAGTTGTCGG GCGAAGCGGA CCGCATGCTG CGGTTGGTGC AGGATCCGGT CATTACTGAG TATGTGAATC GGCTGGGGCA GAACCTGGTG CGCAATTCGG ATGCGAAGGT GCCGTTCACA ATCAAAGTTG TGGATTCGGA AGAGATCAAC GCATTCGCTC TGCCAGGCGG ATATTTCTAC GTGAATACCG GGCTGATTCT TGCCGCGGAT AGCGAAGCGG AGTTGGCGGC GGTGATGTCG CACGAGATTG CGCACGTGGC GGCGCGTCAT GCGACGAAAA ATTTAAGCAA GAGGGAACTG CTGCAGCTGT GCACACTGCC CACTTTCTTT ATCGCTGGGC CGGCGGTGAT CGCGATACGA GAGGCGGCCC AGATCGCGTT ACCGATGACG TACATGAAAT TCTCGCGCGA TGCCGAACGC GAGGCCGATT TGCTTGGCAT GGAGTATGCA TACGCTTCTG GGTACGATCC GCAGGCCATG GTGACGTTCT TCCAGAAGGC GCTGGTCAGG GACCAGAAGA GGCAGCGGTT GATTGCGCGG GCGTATGCGA CGCATCCGAT GACGGCGGAG CGCATGCAGC GCGCGCAGGC GGAAATCCAG ACGTTGTTGC CGCCGAAAGA CAACTACATG CTGACGACCA ACGAATTCGA TGAGATCAAA GCACGTGTGT CGCGACTGGA GAGGAACCAA CTGGTAGCCT GGGCGCCGAG CAGCAAACCA ACGCTGCGGA ACCGGACCGA TGTGGAGAGT GCTCCGGTCA GCAATACACC GACTTTGAGA AAAACTGTTG GTGACGGAAC CAATTTCACC GATGTTAAGA AGGACGTCCG CACGTATGCG GAGCGGCAGT GGAACTAG
|
Protein sequence | MQAPRVRLLI TLVFFVSFSV AQTLNSSNAS AAPVVEVSSA KYAKAMQKVV TKYDVTKIGE RKVAGGMNFV SIEAEARLGR QLSGEADRML RLVQDPVITE YVNRLGQNLV RNSDAKVPFT IKVVDSEEIN AFALPGGYFY VNTGLILAAD SEAELAAVMS HEIAHVAARH ATKNLSKREL LQLCTLPTFF IAGPAVIAIR EAAQIALPMT YMKFSRDAER EADLLGMEYA YASGYDPQAM VTFFQKALVR DQKRQRLIAR AYATHPMTAE RMQRAQAEIQ TLLPPKDNYM LTTNEFDEIK ARVSRLERNQ LVAWAPSSKP TLRNRTDVES APVSNTPTLR KTVGDGTNFT DVKKDVRTYA ERQWN
|
| |