Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4524 |
Symbol | |
ID | 4070202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 5367303 |
End bp | 5368349 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637986563 |
Product | PDZ/DHR/GLGF |
Protein accession | YP_593598 |
Protein GI | 94971550 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGTATC TGAAATCTGT GCAGTCTGTA ATGCTGTCTG TCCTGCTTGC CGGAGCGGTG ACGGTGACTG GTGCCCTCCC GGCGATGGCG TCGTCCGGCG ACGAGGTGCA GAGCGGCGAT TCGTACCTTG GCATTGGGCC GCGCGACATT AGTCCGGCCC GCGTGCAAGC GCTCAAACTG AAGGATGACT CCGGCGTCGA AGTCACGGAA CTGGACAACG ACGCACCCGC GGCGAAAGCC GGCATGAAAC TCGGTGACGT GATCCTGAAC TACAACGGCC AGAAGGTCGA GAGTGCGGAA CAATTGCGGC GCTTGATTCA CGAGACGCCC GTCGGTCGCT CAGTTCAGAT TGTGATCAGC CGCAATGGCC AGCAGCAAAC GCTATCGGTT ACGCCCGGCA GCAAGCGCCA GATGAATGCC GCCATTCCCA AGTCGCCGCG CTCGCGTTCG GGCAATGGCT TTTTCGACAA TCCGCCCGAC ATGAGCATGA ACCTGCTTCA GGCGGCGTCG AAAGGCGGAT TGCTCGTCGA GAATATTACT CCGCAGCTTG GCGAGTTTCT CGGGGTGAAG AATGGCAATG GCGTCATGGT GCGCTCGGTC GAAAAGGGTT CGCCTGCGGA ATTCGCAGGC TTGCGCGCCG GCGATGTGAT TGTGCGAATC GAGAAGGATT CGATTGCCGA TATGAGCGAC TGGCATCGCC TCACCCACAA GCGCAGTGGA AAGACCATGC TCGGCGTAGT GCGCGACAAG CACGAGCAGA ACTTCTTCAT GGAATTTCCG TCCAAGCGCG ATAGTTCGTG GATGATGGAC CTTCCGGATA TCAATCCTGA CGAGATCAAG TTCGAAGTGT CCGAACTCCG TCCGGAAATG CTGAAGGCTT TTGCGGATGC GCAAGGGGCT TTCGACTCGG AAGGTGGCCA CTGGCAACTC GATCAGGACG AGATTCAGAA AGCCATCCAA AATGGCCAGC AGTCCATGGA CAAAGCCATG AAGCAATTCC AGGATCTGCA CAAGCAGTGG AACTGCACGG AAGATTCGAA AGAGTAA
|
Protein sequence | MQYLKSVQSV MLSVLLAGAV TVTGALPAMA SSGDEVQSGD SYLGIGPRDI SPARVQALKL KDDSGVEVTE LDNDAPAAKA GMKLGDVILN YNGQKVESAE QLRRLIHETP VGRSVQIVIS RNGQQQTLSV TPGSKRQMNA AIPKSPRSRS GNGFFDNPPD MSMNLLQAAS KGGLLVENIT PQLGEFLGVK NGNGVMVRSV EKGSPAEFAG LRAGDVIVRI EKDSIADMSD WHRLTHKRSG KTMLGVVRDK HEQNFFMEFP SKRDSSWMMD LPDINPDEIK FEVSELRPEM LKAFADAQGA FDSEGGHWQL DQDEIQKAIQ NGQQSMDKAM KQFQDLHKQW NCTEDSKE
|
| |