Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1420 |
Symbol | |
ID | 4068799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1719789 |
End bp | 1721129 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637983429 |
Product | peptidase M50, putative membrane-associated zinc metallopeptidase |
Protein accession | YP_590496 |
Protein GI | 94968448 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0750] Predicted membrane-associated Zn-dependent proteases 1 |
TIGRFAM ID | [TIGR00054] RIP metalloprotease RseP |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.130402 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGGTT TTCTCATAGC GATTGTGTCG ATCGTGTTCG TACTCGGGGT GTTGGTTCTC GTTCACGAAT TCGGTCATTT TGCCGCTGCC AAATTGTTTG GCGTTCGCGT GGAGACATTC TCCATCGGGT TCGGAAAGCG GCTCGTCGGT TTCCGGCGCG GTGAGACCGA TTACCGCATC AGCGCTCTCC CGCTCGGCGG GTACGTGAAG ATGACTGGCG AAACTCCCCT GGACAGCCGT ACCGGCGCGC CCGAGGAGTT CATGTCTCAT CCGCGATGGC AGCGGATCAT CATCGCCCTC GCCGGCCCGT TCATGAATAT CGCGCTCGCC ATCGGACTGC TGACCGTGGT CTACATGGTG CACGACGAAG AACCGGCATT CTGGGGAGAA AAAGCGACGG TGGGGTTCGT TGCTCCGGGT AGCACCGCGG ACAAGGTCGG CGTGAAAGCC GGCGACACGA TCGTAAAAAT CGCCAACGTC GATAATCCGA CTTGGGAAGA CGTTTATCTC CAGACCTCCA CGAGTCCGGG AGCGGCGGTA CGGCTCGATC TCCTGCGCGA CCGGCAGGTG ATTGCGACTT CGGTTGTCCC CGAGAAGGGG TCTGAAGAGC AGGGCAGCAA TCCAGGTTGG ACTCCGTACA ATCCGAACAC CGTTGCCAGT CTTGAAGCGG AGATGCCTGC TGCGAAGGCT GGCATCAAAG TCGGCGATTC GATTACGGCG ATCGATGGCG CACCGATCTA CTCGACAGAG TCGATGATCG CCATGCTGCA ACAGACAAAG GAAAAGCCGG TCGAGCTGAC TGTGCAGCGT GACGGCAAGG AATTTAAGGT TACTGTGACG CCGCAGCTGA CCAACGACAA GGGCGAGAGC CGATATCGCA TCGGCATGGT ATCGGAGCCG AAGTACATCT CGCTTCACTT GCCGTTCAAG GCTGCGCTGT CGAAATCTCT CGACGAAAAC CGGAAGTTCT CATTCCTCGT CGTTGACCTG GTCAAAAAGC TGGCGCGGGG CGCAGTTTCG ATTAAGACGA TGTCGAGCCC GATCGGCATG GCGAAGGCAT CCGGCGAAGC AGCTCGGCAG CCCGGTTGGT CGCCATTGAT GCGGATGATG GCGCTCTTCA GCTTGCAGCT TGGGATCTTC AACTTGTTCC CGATCCCCAT CCTCGATGGC GGCATGATCC TGATGCTATT GATCGAAGGC CTTATGCGCC GCGACATCAG CATGCGAATC AAGGAACGCG CCTATCAAGT CGCGTTTGTC TTCCTGATGC TGTTCGCAGC CGTGGTCATC TTCAACGACA TTGTGAAGAC GGTTCCCGGG CTACACCAGC ACTTGCCGTA A
|
Protein sequence | MEGFLIAIVS IVFVLGVLVL VHEFGHFAAA KLFGVRVETF SIGFGKRLVG FRRGETDYRI SALPLGGYVK MTGETPLDSR TGAPEEFMSH PRWQRIIIAL AGPFMNIALA IGLLTVVYMV HDEEPAFWGE KATVGFVAPG STADKVGVKA GDTIVKIANV DNPTWEDVYL QTSTSPGAAV RLDLLRDRQV IATSVVPEKG SEEQGSNPGW TPYNPNTVAS LEAEMPAAKA GIKVGDSITA IDGAPIYSTE SMIAMLQQTK EKPVELTVQR DGKEFKVTVT PQLTNDKGES RYRIGMVSEP KYISLHLPFK AALSKSLDEN RKFSFLVVDL VKKLARGAVS IKTMSSPIGM AKASGEAARQ PGWSPLMRMM ALFSLQLGIF NLFPIPILDG GMILMLLIEG LMRRDISMRI KERAYQVAFV FLMLFAAVVI FNDIVKTVPG LHQHLP
|
| |