Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2595 |
Symbol | |
ID | 4070558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3063546 |
End bp | 3064559 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637984612 |
Product | peptidase M50 |
Protein accession | YP_591670 |
Protein GI | 94969622 |
COG category | [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCAGCA CTTCACTCGG ACCAGCTTCG AACTGCCGGC GATGCGGCTC ACCGTTATCG CCGGGCGCGC TGGAGTGTCT CCAGTGTCAT GCGTTCGTGC ATGCCGACGA GCTCACCCGG ATCTCAAACG AAGCGAAGCA GCTGGAGAGT GCCGGCGATT TGCGCGCGGC GCGCGAGAGA TGGCTAACGG CAATTCCGTT ACTTCCAGTG GATGCACAAC AGACGGCGTG GATCAACAAC CATGCCGTGG ATTTGTACGC ACAGGCGCTG GAGGCCGACG TACAGCGACA GCAGGAGGGC GAGTCCAAGA GCAAATGGGC TAAGCGGCTC GGACCGTTTG CACCCCTGTT GGTGCTGCTT GGGAAAGCGA AAGTGCTGTT CGGACTGTTG AAGCTCAAGT TCGTCCTGAG CTTCGCCGGC TTTATATGGT TCTACTGGAC GATTTTCGGG ATGTGGTTCG GCGTCGGATT CGCGGTGCTA ATCCTCTGTC ACGAGATGGG GCACTACATC GAGGTGAAGC GGCGCGGACT TCCTGTGGAA GTGCCGGTGT TTCTACCGGG CCTTGGCGCA TATGTGAAGT GGAAGAACCT CGGCGTGTCC GGGGAAGGGC GGGCGATGAT CAGCCTGGCC GGTCCGCTGG CGGGATTTCT ATCGTCGGCG GTGTGCATTT TGGTTTATTG GCAGACGCAC TCGAAGCTGT GGCTGGCGCT GGCGCACTCG GGAGCCTGGT TGAACCTGAT GAACCTGATT CCGGTGTGGG CACTGGACGG GGCTCAGGCG ATCCAGGCGG TGAGCCGTTA TGGCAGGGTG GTGCTGTTGT TGAGCTGTGC GCTTCTATTC TGGGGAACAC GCGACTATGT GTTGTTGTTT ATTGGAGCCG GAGTATTGTG GCAAGCGTTT GTGAAGCCGA CCGACGAGAC GAGCGCACGG ATTGCGGCAT ACTTCACGCT GCTGCTCTGC GCGTTGGCGC TGATTCTGGT GCTAGCGCCT GGACGAGGGT TCGGGGCGGA GTAG
|
Protein sequence | MSSTSLGPAS NCRRCGSPLS PGALECLQCH AFVHADELTR ISNEAKQLES AGDLRAARER WLTAIPLLPV DAQQTAWINN HAVDLYAQAL EADVQRQQEG ESKSKWAKRL GPFAPLLVLL GKAKVLFGLL KLKFVLSFAG FIWFYWTIFG MWFGVGFAVL ILCHEMGHYI EVKRRGLPVE VPVFLPGLGA YVKWKNLGVS GEGRAMISLA GPLAGFLSSA VCILVYWQTH SKLWLALAHS GAWLNLMNLI PVWALDGAQA IQAVSRYGRV VLLLSCALLF WGTRDYVLLF IGAGVLWQAF VKPTDETSAR IAAYFTLLLC ALALILVLAP GRGFGAE
|
| |