Gene Acid345_2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2007 
Symbol 
ID4070913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2406299 
End bp2407588 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content60% 
IMG OID637984021 
Productfumarylacetoacetate hydrolase 
Protein accessionYP_591082 
Protein GI94969034 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR01266] fumarylacetoacetase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.79989 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAGA CACATGATCC AAAGATCAAG TCGTGGGTTG CTTCGGCCAA TGCGCCGGAC 
TGCGACTTCC CGCTGCAGAA CTTGCCGTTC GGAGTCTTCC GTCGCAAGAA TGACGAAGGT
GGCGGCATCG GCGTCGCGAT TGGCGATCAG GTGTTCGATG TCGGCGCCTG GGTGCGCGAC
CAAGGGAAGA GCTCTGCTGA ATTTAAACTT CTGACTGAAA AGCGCCTGAA CCGATTCCTA
GCTGCCGGCC CGCAGATATG GAGCGCGGCA CGCATGGCAC TGTTTAACCT GCTGCGCGAA
GATTCTCCGC AACGCGAGCA GGTAGCGCGC TATCTCGATG CAACTGCGAA CGTCGAGATG
GAGATGCCGA TCGATATCGG CGATTACACC GATTTCTACG CCTCCGTCTT CCACGCGACG
AATGTGGGGA GCATGTTCCG GCCGGACAAT CCTCTGCTGC CAAATTACAA GTGGGTGCCG
ATCGGCTATC ACGGGCGAGC TTCTTCTGTC GTCGCCAGCG GCGCGGCAGT GAAGCGGCCG
AGCGGCCAGC GCAAGCCACC GACGGCTGAC ATGCCGACAT TTGGTCCGTG CGCACAGCTC
GACTACGAAC TCGAAGTGGG CGCAGTGATC GGGCCGGGGA ATGCCCTTGG AGAGACGGTA
CCGTTGCGCG ACGCAGAGAA GCACATCTTC GGCTTGTGCT TGCTGAACGA TTGGTCGGCG
CGCGATATAC AAGCGTGGGA GTATCAACCC CTGGGGCCGT TTCTGGCCAA GAACTTCGTG
ACCACGATTT CTCCGTGGCT CGTGACGCTG GAAGCGCTCG AGCCCTATCG TAGGAGCGCG
TACAAGCGTC CGGAGGGCGA TCCACAGCCC TTGCCGTATC TCAGTGACGA AAACGACCAG
CAGCGCGGCG CCTTCGACGT CTCGCTCGAT GCTTATCTTT CGACGCGCAA GATGCGTGAT
GAGAAAATCG CGCCAATCAG CCTGAGTCAC GGATCTTTAC GCGATATGTA TTGGACCTTC
GGGCAGATGC TCGCGCACCA TGCCTCGAAC GGATGCAATC TCCAACCTGG CGACTTGATC
GGCAGCGGCA CGGTTTCTGG ACAGTCGAAA CACTCGCGGG GATGCCTGCT CGAGCTGTCT
TGGCGCGGCA CCGAGCCCAT CTCACTACCG AGCGGCGAAA CTCGCAAGTT TCTCGAAGAT
GGCGACGAGG TAATCTTCCG CGGCTACGCG GAGCGCGAAG GTCAAGCACG GATTGGCTTC
GGCGAGTGCC GGGGCATCGT CGTCGGATGA
 
Protein sequence
MNETHDPKIK SWVASANAPD CDFPLQNLPF GVFRRKNDEG GGIGVAIGDQ VFDVGAWVRD 
QGKSSAEFKL LTEKRLNRFL AAGPQIWSAA RMALFNLLRE DSPQREQVAR YLDATANVEM
EMPIDIGDYT DFYASVFHAT NVGSMFRPDN PLLPNYKWVP IGYHGRASSV VASGAAVKRP
SGQRKPPTAD MPTFGPCAQL DYELEVGAVI GPGNALGETV PLRDAEKHIF GLCLLNDWSA
RDIQAWEYQP LGPFLAKNFV TTISPWLVTL EALEPYRRSA YKRPEGDPQP LPYLSDENDQ
QRGAFDVSLD AYLSTRKMRD EKIAPISLSH GSLRDMYWTF GQMLAHHASN GCNLQPGDLI
GSGTVSGQSK HSRGCLLELS WRGTEPISLP SGETRKFLED GDEVIFRGYA EREGQARIGF
GECRGIVVG