Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2187 |
Symbol | |
ID | 4071439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2609862 |
End bp | 2610971 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637984203 |
Product | O-succinylbenzoate synthase |
Protein accession | YP_591262 |
Protein GI | 94969214 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR01928] o-succinylbenzoic acid (OSB) synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATCG AAGCAATTAC ACTGCGCGAA ATTGAAATGC CGCTGGTCAA CTTTTTCGAG ACCAGCTTTG GGCGTATCTA CAGCCGCCGC ATGCTACTGG TGACGATGCA TTGCGATGGC GTGGACGGCT GGGGCGAGTG CGTAGCTGAC GAGGCGCCGT TTTATAGCCC GGAGTCGGTG GATACGGCGT GGCTGATTAT TCGCATGTAC CTGGCGCCGA TGCTGCTCGG CAAAGAAGTG GAGCGAGGCG CCGACGTTCA GCCGCTGCTG GCGCGCGTGC GCGGCCATCG CATGGCCAAG GGCGTCCTTG AGAACGCAAT GTGGGACGCC GAAGCGAAGG CGAAGAATCT GCCGATATGG AAGCTGCTCG GCGGTTCCCG TGAAAAGATT CCGTGTGGCG TGTCGATCGG CATCCAGGAC TCGCACGAGC AGTTGCTCGA CAAGATTGAG ACCGAACTCG CCGCTGGCTA TCAGCGCATC AAGGTGAAGG TGAAGCCGGG ATGGGACGTC GAAGTGCTCG AGAAAATTCG CAAGCGCTGG CCCGACATCC TGCTGAGCTG CGATGCGAAT TCGGCATACA CGCTGAGCGA CTTCGAGCAC CTCAAAGAGT TCGAGCAATT CAAGCTGCTG ATGATCGAGC AGCCCCTCTG GAACGACGAC TTCTACTTCC ATGCCGCGCT ACAGAAGCAA CTGAAGACAG CCCTCTGTCT CGATGAGGCA ATCGAAAGCT GGCGCGATGC GCAGGCCGCG CTGGAACTGG GCGCCTGCAG GATCGTCAAC ATCAAGGTCG GGCGCGTTGG CGGATTCAGC GAAGCCATCG CAGTCCACGA CATCGCGCAA CGCTTCGGCG TGCCGGTGTG GTGCGGCGGC ATGCTCGAGT GCGGCCTCGG ACGGTCGCAT AACATCGCGC TCTCGACGCT GCCAAATTTC TCGCTACCTG GCGATGTGTC GGCGTCCAAA CGATATTGGA AGGAAGACGT AATCGAGCCC GAAGTCACGG TTTCGCCGGA TGGCTTTATT CCGATTCGCG ACGTGCCGGG TACGGGATAT ACGTTGCGCG AAGATCAGAT TGAGAGAATC ACCACGAAGA AAGAGACGGT GAGGGCATAG
|
Protein sequence | MKIEAITLRE IEMPLVNFFE TSFGRIYSRR MLLVTMHCDG VDGWGECVAD EAPFYSPESV DTAWLIIRMY LAPMLLGKEV ERGADVQPLL ARVRGHRMAK GVLENAMWDA EAKAKNLPIW KLLGGSREKI PCGVSIGIQD SHEQLLDKIE TELAAGYQRI KVKVKPGWDV EVLEKIRKRW PDILLSCDAN SAYTLSDFEH LKEFEQFKLL MIEQPLWNDD FYFHAALQKQ LKTALCLDEA IESWRDAQAA LELGACRIVN IKVGRVGGFS EAIAVHDIAQ RFGVPVWCGG MLECGLGRSH NIALSTLPNF SLPGDVSASK RYWKEDVIEP EVTVSPDGFI PIRDVPGTGY TLREDQIERI TTKKETVRA
|
| |