Gene Acid345_4056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4056 
Symbol 
ID4072478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4796853 
End bp4797968 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content57% 
IMG OID637986087 
ProductL-alanine-DL-glutamate epimerase fmaily protein 
Protein accessionYP_593130 
Protein GI94971082 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.24697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.156881 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGTA GGGAACTCTT GCAACTCTCA ACCGCCGCGG GAGCAGCGCT TCTGCTGAGC 
AATTCTGCGC TTGCCCAGGC ATCGCAATCT ACGGACGGTC ACTGGCACAC CAGTGTTGAG
CGGCTGAAGC TGCGCCATAC CTGGACGACC ACGATGTCCA GCAGCGAATA TCGTGACACG
CTTCACGCCC GCTTTACGAG CGACGGTGTT GTCGGCTACG GCGAGGGTGC GCCGATCGTT
CGATACCGCG AGGACGCGGC CACTGGCCAA AAAGCCCTCG AGTCGCAGAT GGCCTTTCTG
AACGCCGTCG ATCCGTGGCA TTTCGAGAAA GTCATGGCTG AACTGGCGCA GAAGATGGAG
GGTAATTTCG CTGCGAAGGC TGCCATCGAT ATTGCCCTGA TGGATTGGGC GGGCAAGCGC
CTCAATGCGC CGATCTATCG CATGCTCGGC CTTGATGCCG CTGACGCGCC GGTCACGACG
TTTTCCATTG GAATCGACAC TCCTGAAATT ACGCGGCAGA AGGTGCGCGA GGCCGAGGAA
TTCCCGGTCC TCAAAATCAA AGTTGGCCTC AAAACCGACG AGGCCACCGT CGAAGCTGTT
CGGAGTGTCA CGAAGAAGCC GCTGCGCGTA GATGCCAACG AAGGTTGGAC AGATAAGGAA
GAAGCTGTTC GCAAAATCAA CTGGCTCGAA TCGCAAGGTG TCGAGTTCGT GGAGCAGCCT
ATGCCGGCGC ACATGATCGA AGAGACGCGC TGGGTACGCA GCAAGGTACA TCTTCCCATT
CTTGCCGATG AAGCCGCCGT GAACGCGCAT GCAATTCCCG GGCTGATGAA CGCTTATGAC
GGCATCAACG TGAAACTCGA TAAATGTGGC GGCATCCAGC AGTCGCTAAA GATGATCAAC
GTTGCGAAAG CGCTTGGCAT GAAGACGATG CTCGGCTGCA TGGTTTCCAC TTCCGTCAGC
GTGACCGCGG CTGCTCACCT CTCGCCACTC GTGGACTACG CCGATCTAGA TGGCAATTTG
CTCATTGCCA ACGATCCGTT CACAGGCGTC AAAGTTGAAA AAGGAAAGCT GGTGCTGCCG
AACGGCCCGG GCTTGGGGCT TACAAAGAAC TCTTAG
 
Protein sequence
MNRRELLQLS TAAGAALLLS NSALAQASQS TDGHWHTSVE RLKLRHTWTT TMSSSEYRDT 
LHARFTSDGV VGYGEGAPIV RYREDAATGQ KALESQMAFL NAVDPWHFEK VMAELAQKME
GNFAAKAAID IALMDWAGKR LNAPIYRMLG LDAADAPVTT FSIGIDTPEI TRQKVREAEE
FPVLKIKVGL KTDEATVEAV RSVTKKPLRV DANEGWTDKE EAVRKINWLE SQGVEFVEQP
MPAHMIEETR WVRSKVHLPI LADEAAVNAH AIPGLMNAYD GINVKLDKCG GIQQSLKMIN
VAKALGMKTM LGCMVSTSVS VTAAAHLSPL VDYADLDGNL LIANDPFTGV KVEKGKLVLP
NGPGLGLTKN S