Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4415 |
Symbol | |
ID | 4073321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 5243208 |
End bp | 5244371 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637986448 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_593489 |
Protein GI | 94971441 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAAGA TGCTCCGTCC CTTTCTTCTC GGCGTTTTGC TTGCCACTGG TTTTTTCTAC CTGACCACGC ACCATCGCGC CGCCAGCGCC GGTTCCGACG TCGACAACGT CTGGATCTCG CGTCCCGACC GCCTCGAGCT GACCCAGGCC GCCGGCCCCG TCACCTACGA CCCCGAAGAG CAGGTCAACA TCGAGGTCTA TAAAAGAGGT CTGCCCAGCG TCGTGAACGT CACCTCCACG ACGGTCGCCT TCGACTTCTT TTATGGCGCG GTGCCGCAAG AGGGCCAGGG CTCCGGCTTC ATCATTGATA AGCAGGGCCA TATCCTGACG AACTTTCACG TCGTCCAGGG CAATCCGCAG AAGCTGGAAA TCACCCTCAG TAATCGCAAG AAATATCCAG CCAAGGTTAT TGGCCTCGAC CGCTCGCACG ATCTCGCGGT CGTCCAGATC AACGCCCCTG ACCTCGTACC CGCCGTCATG GGCGACAGTC ACGGTCTCGT TGTCGGCCAG AAGGTCTTCG CCATCGGCAA TCCCTTTGGC CTCTCCGGTA CCATGACCCG CGGCATCATC AGCTCCATCC GCGCCATCGT CGAGCCCGAC GGCACCAAGA TCGACGAAGC CATCCAGACC GACGCCGCCA TCAACCCCGG CAACTCCGGC GGTCCGCTGC TCAACTCGCG CGGTGAAGTC ATCGGCATTA ACACCATGAT TGCCAGCAAC GGCGCCGCAC AGAGCGCCGG CATCGGCTTC GCTGTCCCCA TCAACGCGGC CAAAGCCGTG CTCAATGACC TCGTGCAGTA CGGCGAAGTC CGCCGTCCGT CGCTCGGAAT CCGCGGCGGG CTGCCGATCA CACCCGAGCT CGCCGAACAA ATGGGCCTCG CCGCCGACTA CGGCGTCCTC ATCCAGGCCG TAATCCCCGG TCGTGGCGCC GACAAAGCAG GACTGAAAGG CGGCAACGAG CGCGCCTATC TCGGCAACAC GCCGATCATG ATCGGTGGTG ACCTCATCGT AGCCATCGGC GACGAGCAGA TCGCGGACCT GCAGGACCTG TCGCACGCCA TGAACGCCCA CAAAGCGGGC GAGACGGTGC GGGTCACCAT CTACCGCGCC AAACGCAAAA TGGTAGTCCC CGTCCAGCTA GACGAGGCCC GCCAAGGCGC ATAA
|
Protein sequence | MRKMLRPFLL GVLLATGFFY LTTHHRAASA GSDVDNVWIS RPDRLELTQA AGPVTYDPEE QVNIEVYKRG LPSVVNVTST TVAFDFFYGA VPQEGQGSGF IIDKQGHILT NFHVVQGNPQ KLEITLSNRK KYPAKVIGLD RSHDLAVVQI NAPDLVPAVM GDSHGLVVGQ KVFAIGNPFG LSGTMTRGII SSIRAIVEPD GTKIDEAIQT DAAINPGNSG GPLLNSRGEV IGINTMIASN GAAQSAGIGF AVPINAAKAV LNDLVQYGEV RRPSLGIRGG LPITPELAEQ MGLAADYGVL IQAVIPGRGA DKAGLKGGNE RAYLGNTPIM IGGDLIVAIG DEQIADLQDL SHAMNAHKAG ETVRVTIYRA KRKMVVPVQL DEARQGA
|
| |