Gene Acid345_2470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2470 
Symbol 
ID4072094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2921106 
End bp2922281 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content62% 
IMG OID637984487 
Product3-ketoacyl-CoA thiolase 
Protein accessionYP_591545 
Protein GI94969497 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.559901 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.822135 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGAAG TAGTGATTGT ATCGGCCGTC CGGACGCCGG TCGGCAAGGC TTACAAAGGG 
ACGCTGCGGG CAACGCGGCC GGATGATCTG GGCGCGGTGG CGATCAAGGG CGCTCTGGAG
CGGGTACCGC AACTCGACGT GCGCGAGATT GAAGACGTGA TCCTGGGCTG CGCCATGCCG
GAGGCGGAAC AGGGCATGAA CGTGGCGCGC ATCGCTTCCC TGCGCGCGGG ATTGCCGGTC
GAGTGCTCGG CGATGACGAT TAATCGTTTC TGCGCGTCGG GACTGCAGGC GATTGCGCTG
GCGGCGGAGC GGATTCGCGG TGGCGGAGCG GAAGTGATCG TCGCAGGCGG TACCGAGAGC
ATGACGATGG TGCCGATGGG CGGACATAAG TTCACGGCCA ATCCGTACCT GGTGGAGACG
TATCCGGATT CGTATCTGTC GATGGGCTTG ACGGCGGAGC GGCTGGCGGT GCGCCACGGG
ATCACGCGCG AGATGGCGGA CGAGTTCTCG TACAACAGCC ACAAGAAAGC CATTGCCGCG
ATCGAGGCTG GGCGCTTCGA GGATGAGATC GTTCCGGTGC CGGTGACGTT CGTGACGCCG
AACGGATCGA AGCCGAAGAA GCAGGAGATA TTGTTCAAGG TGGACGAAGG GCCGAGGGCC
GACACGACGT TGGAAGCTCT GGCGGGATTG AAGCCGGCGT TCCACGTGAA GGGCACGGTG
ACGGCGGGCA ATTCGTCGCA GATGTCGGAC GGGGCGGCGG CTGCGGTGGT GATGTCGGCG
GAGCACGCGA AGAAATTGGG GATCAAGCCG CTCGCGAGGT TTGTGGCGTT CGCAACCGCG
GGTTACAAGC CGGAAGAGAT GGGACTGGGG CCGGTGTTCG CGATTCCGAA GGCGCTGAAG
ATCGCCGGGC TGAAGCTCGA GGACATTGAT GTGTTCGAGT TGAACGAGGC GTTCGCGGCG
CAGGCGTTGT CGGTGATCAA GGAAGCGGGG ATTGATATCA ATAAGGTCAA TCCAAATGGC
GGCGCGGTGG CGTTGGGGCA TCCGCTGGGA TGCACGGGAG CTAAGCTGAC GGCGACGATC
ATTCGTGAAT TGAAGCGGCG CAATGGGAAG TACGGGATTG TGACGATGTG CGTCGGCGGA
GGTATGGGTG CTGCGGGGAT TTTTGAAAAT CTTTAA
 
Protein sequence
MREVVIVSAV RTPVGKAYKG TLRATRPDDL GAVAIKGALE RVPQLDVREI EDVILGCAMP 
EAEQGMNVAR IASLRAGLPV ECSAMTINRF CASGLQAIAL AAERIRGGGA EVIVAGGTES
MTMVPMGGHK FTANPYLVET YPDSYLSMGL TAERLAVRHG ITREMADEFS YNSHKKAIAA
IEAGRFEDEI VPVPVTFVTP NGSKPKKQEI LFKVDEGPRA DTTLEALAGL KPAFHVKGTV
TAGNSSQMSD GAAAAVVMSA EHAKKLGIKP LARFVAFATA GYKPEEMGLG PVFAIPKALK
IAGLKLEDID VFELNEAFAA QALSVIKEAG IDINKVNPNG GAVALGHPLG CTGAKLTATI
IRELKRRNGK YGIVTMCVGG GMGAAGIFEN L