Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0206 |
Symbol | |
ID | 4069675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 218675 |
End bp | 220552 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637982206 |
Product | peptidoglycan binding domain-containing protein |
Protein accession | YP_589285 |
Protein GI | 94967237 |
COG category | [S] Function unknown |
COG ID | [COG2989] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.350389 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGCGCG ACTCGCGGTC TATAATCTCT CACGTTTTGG GAACGGCGCA GCACGCAAGA GCGCGCACGA GCAAGGCCAA CATCGTCTAC GTTGACTGCC TACTGGCCAC GGATTCACAA CGCAAGCTTG CCGCAAAAAG AACGGACGAT ACCGCGATGA TGAACCTTGG CTTTGCCCGA TGTGCTTTTG TGCGCGGGTG TACGTTCGGC TTGTGTGTGG TTGGGATCCT CGCGACGAGT GCATGCGCGA CGGGCAAGGC CCTCCTTCCA GGCGGCGAGA CGTTTCAGGC CGCAACCAGC GGACCGACTG TTGCCGATAG CTCGCTGCGT GAGATTGTTG CCGCGGGGCA GCTTTCCGAC CTTCGGTGGC CGGATTTTTC GGATTACCGC GCTTATGTGC AGACCTTTTA CGAGTCCTCT GGGTACAACC TAGCTTGGAC TCGTGGCGGC CAGACCACAC CCCAGGCGCT GGCGATCATC GAGATTTTGA AACAGGCGGA CGGCAAAGGC TTGAACGCGG AAGACTACGA CGCTTCCCGA TGGGCTGATC GGACAAAACA GCTGAGCCAG CCGGCTGCGG CAGCACGGTT CGACACGGCA CTCACCGTCT GCGTGATGCG CTACATCTCC GACCTGCATA TCGGGAGGGT CAATCCGACG CACGTCAAAT TCGCACTCAC CGGAAGAAGC GCGAAATACG ATCTCCCGCA ATTCCTGACC CAACGCCTGG TGAACGGCCA GAACGTTGAA GCGGAGCTTG CGGCGGTGCA ACCTCAATTC GCCGGCTACA AAGCGACGCA GGCCTGGCTG CAACGCTACA TAGAGCTGGC GCGCCAGGAC AACGGTGAGC AGTTGCCCGT TCCGACGAAG GCCCTTGATC CAGGAAAGCC CTATGCGGGG ATACCGCGCC TTACGAGTTT GCTGCACCTG CTCGGTGACC TGCCGGCCGA TGCGGTTGTC CCGGCCGGTG ACGTTTACCA GGCGCCATTG GTGGATGCGG TGAAGCGTTA CCAGTCCCGC CATGGTCTCA CAGCTGATGG TCGCCTGGGA GCCCAAACCG TGAAGGAACT CAATACGCCG CTAAGCACCC GCGTGGAACA GTTGCGCCTG ACCCTGGAGC GCTGGCGGTG GCTGCCGCAG GAGTTCCCGC AACCTCCGGT GGTGGTGAAT ATTCCGGAGT TCCGCCTGCG AGCCTATGAC GCGAACCACA AGGTTGTGTT GAGTATGAAT GTGGTGGTGG GCAAAGCGCT CCGCCACGAG ACGCCGGTTT TCGACGACGA AATGAAGTAC GTTGTTTTCC GTCCGTACTG GAATGTACCG CCGAGCATTC AACGTTCCGA GATTGTGCCC GCCATTCAGC GCGATCGCGA CTATATATCG AAGAAGAACT ACGAAGTGAC CACGCAGGCT GGGCAGGTCG TGACCTCAGG CACCATCAGC GATGAGGTGC TGCAGCAGTT GCGTGCGGGG AAGCTCGCGG TGCGGCAGAA GCCGGGGCCC ACCAATGCAC TGGGTCTGGT GAAGCTGATC TTCCCCAACC AATACAACGT CTACCTGCAC AGCACGCCCT CGCAGCAGCT GTTTTCGCAA GCGCGGCGGG ACTTCAGTCA CGGTTGCATT CGCGTAGAAA AGCCGGCCGA GTTGAGCGCC TGGGCGTTAC AGGACAAACC GGAATGGACG GTGGAAAGAG TCCGCGCCGC GATGCAAAAG GGACCGGACA ACGTCCAGGT CAACCTGTCG AAGCCAGTGC CAGTGCTCAT TCTCTATGGC ACTGCGGTCG CCGAGGAAGA TGGATCCGTC CACTTTTTCG ACGATCTCTA TGGGTACGAT GCGGACCTTG AGAAGGCCTT GGCAAGGGGA TATCCGTACC CCTTGTAA
|
Protein sequence | MSRDSRSIIS HVLGTAQHAR ARTSKANIVY VDCLLATDSQ RKLAAKRTDD TAMMNLGFAR CAFVRGCTFG LCVVGILATS ACATGKALLP GGETFQAATS GPTVADSSLR EIVAAGQLSD LRWPDFSDYR AYVQTFYESS GYNLAWTRGG QTTPQALAII EILKQADGKG LNAEDYDASR WADRTKQLSQ PAAAARFDTA LTVCVMRYIS DLHIGRVNPT HVKFALTGRS AKYDLPQFLT QRLVNGQNVE AELAAVQPQF AGYKATQAWL QRYIELARQD NGEQLPVPTK ALDPGKPYAG IPRLTSLLHL LGDLPADAVV PAGDVYQAPL VDAVKRYQSR HGLTADGRLG AQTVKELNTP LSTRVEQLRL TLERWRWLPQ EFPQPPVVVN IPEFRLRAYD ANHKVVLSMN VVVGKALRHE TPVFDDEMKY VVFRPYWNVP PSIQRSEIVP AIQRDRDYIS KKNYEVTTQA GQVVTSGTIS DEVLQQLRAG KLAVRQKPGP TNALGLVKLI FPNQYNVYLH STPSQQLFSQ ARRDFSHGCI RVEKPAELSA WALQDKPEWT VERVRAAMQK GPDNVQVNLS KPVPVLILYG TAVAEEDGSV HFFDDLYGYD ADLEKALARG YPYPL
|
| |