Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4056 |
Symbol | |
ID | 4072478 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4796853 |
End bp | 4797968 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637986087 |
Product | L-alanine-DL-glutamate epimerase fmaily protein |
Protein accession | YP_593130 |
Protein GI | 94971082 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.24697 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.156881 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGTA GGGAACTCTT GCAACTCTCA ACCGCCGCGG GAGCAGCGCT TCTGCTGAGC AATTCTGCGC TTGCCCAGGC ATCGCAATCT ACGGACGGTC ACTGGCACAC CAGTGTTGAG CGGCTGAAGC TGCGCCATAC CTGGACGACC ACGATGTCCA GCAGCGAATA TCGTGACACG CTTCACGCCC GCTTTACGAG CGACGGTGTT GTCGGCTACG GCGAGGGTGC GCCGATCGTT CGATACCGCG AGGACGCGGC CACTGGCCAA AAAGCCCTCG AGTCGCAGAT GGCCTTTCTG AACGCCGTCG ATCCGTGGCA TTTCGAGAAA GTCATGGCTG AACTGGCGCA GAAGATGGAG GGTAATTTCG CTGCGAAGGC TGCCATCGAT ATTGCCCTGA TGGATTGGGC GGGCAAGCGC CTCAATGCGC CGATCTATCG CATGCTCGGC CTTGATGCCG CTGACGCGCC GGTCACGACG TTTTCCATTG GAATCGACAC TCCTGAAATT ACGCGGCAGA AGGTGCGCGA GGCCGAGGAA TTCCCGGTCC TCAAAATCAA AGTTGGCCTC AAAACCGACG AGGCCACCGT CGAAGCTGTT CGGAGTGTCA CGAAGAAGCC GCTGCGCGTA GATGCCAACG AAGGTTGGAC AGATAAGGAA GAAGCTGTTC GCAAAATCAA CTGGCTCGAA TCGCAAGGTG TCGAGTTCGT GGAGCAGCCT ATGCCGGCGC ACATGATCGA AGAGACGCGC TGGGTACGCA GCAAGGTACA TCTTCCCATT CTTGCCGATG AAGCCGCCGT GAACGCGCAT GCAATTCCCG GGCTGATGAA CGCTTATGAC GGCATCAACG TGAAACTCGA TAAATGTGGC GGCATCCAGC AGTCGCTAAA GATGATCAAC GTTGCGAAAG CGCTTGGCAT GAAGACGATG CTCGGCTGCA TGGTTTCCAC TTCCGTCAGC GTGACCGCGG CTGCTCACCT CTCGCCACTC GTGGACTACG CCGATCTAGA TGGCAATTTG CTCATTGCCA ACGATCCGTT CACAGGCGTC AAAGTTGAAA AAGGAAAGCT GGTGCTGCCG AACGGCCCGG GCTTGGGGCT TACAAAGAAC TCTTAG
|
Protein sequence | MNRRELLQLS TAAGAALLLS NSALAQASQS TDGHWHTSVE RLKLRHTWTT TMSSSEYRDT LHARFTSDGV VGYGEGAPIV RYREDAATGQ KALESQMAFL NAVDPWHFEK VMAELAQKME GNFAAKAAID IALMDWAGKR LNAPIYRMLG LDAADAPVTT FSIGIDTPEI TRQKVREAEE FPVLKIKVGL KTDEATVEAV RSVTKKPLRV DANEGWTDKE EAVRKINWLE SQGVEFVEQP MPAHMIEETR WVRSKVHLPI LADEAAVNAH AIPGLMNAYD GINVKLDKCG GIQQSLKMIN VAKALGMKTM LGCMVSTSVS VTAAAHLSPL VDYADLDGNL LIANDPFTGV KVEKGKLVLP NGPGLGLTKN S
|
| |