Gene Acid345_3492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3492 
Symbol 
ID4069068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4118638 
End bp4119831 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content62% 
IMG OID637985514 
ProductBeta-ketoadipyl CoA thiolase 
Protein accessionYP_592567 
Protein GI94970519 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02430] beta-ketoadipyl CoA thiolase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.141414 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTCG CGTTCCTCTG TGACGCAGTG CGCACGTCCA TCGGCCGCTT CGGTGGCGCC 
CTCGCCAACG TGCGTCCCGA CGACCTCGCT GCCATCACCA TCCGCGCGTT GATGGCGCGC
AACACAGGTG CCGACTGGTC TCGTCTCGAC GAAGTTTACT TGGGCTGCGC CAATCAAGCC
GGTGAAGACA GTCGCAATGT CGCGCGCATG GCTCTGCTTC TTGCGGGATT ACCTGTCGCC
GTTCCCGGCA CCACCGTCAA TCGCCTCTGC GCCTCCGGTA TGGACGCCAT CGGCTCCGCT
GCTCGCGCCA TCGCCTCCGG CGAAATCGAA TTCGCCATTG CCGGTGGCGT GGAATCCATG
TCGCGCGCTC CGTTTGTTAT GCCCAAGGCC GACGCCGCCT TCTCGCGCAA GGCCGAAATC
TACGACACGA CGATCGGCTG GCGTTTCGTC CATCCGAGGA TGAAAGAGCT TTACGGCGCT
GACTCCATGC CGGAGACCGG CGAGAATGTT GCTGCCGATT TCGGTATCTC TCGCGCCGAT
CAGGACGCTT TCGCGCTTCG TAGCCAGCTA CGCGCCGCCC GCGCCAGAGC CGCCGGATTC
TTCGCCGAAG AGATCGTCGC CGTTGCAACC GGCAAAAATG CGGAAGTCAC CGAAGACGAG
CACCCTCGTC CCGACACCAC TCTCGAAGCC CTCGCGAGGC TCAAGCCTAT CGTCCGCCCT
GATGGAACGA TCACCGCCGG CAATGCGTCC GGCGTGAACG ACGGCGCCGC CACCATGGTC
GTCGCCTCCG AAGAAGCAGT GAAGCAGCAC GATCTCACTC CCAAAGCTCG CGTACTCGGC
ATGGCCACCG CCGGAGTTCC TCCGCGCGTC ATGGGCATCG GCCCCGTCCC CGCCGTGCAA
AAGCTCTTGC GTCGACTCAA CCTCAAAGTC AAAGACTTCG ACGTCATCGA ACTCAATGAA
GCCTTCGCCA GCCAGTCGCT CGCATGCCTG CGCCAACTCG GCATCCCTGA CGACGCCGAC
TTCGTGAATC CTAATGGTGG CGCGATCGCG TTAGGTCATC CCCTTGGAAT GAGCGGTGCT
CGCCTCGTGC TCACCGCCTC ACATCAACTC GAGAAAACAG GTGCCAATCG CGCTCTTGCC
ACCATGTGTG TCGGCGTGGG GCAGGGAGTT GCTTTGGCGA TCGAACGCGT TTAA
 
Protein sequence
MKVAFLCDAV RTSIGRFGGA LANVRPDDLA AITIRALMAR NTGADWSRLD EVYLGCANQA 
GEDSRNVARM ALLLAGLPVA VPGTTVNRLC ASGMDAIGSA ARAIASGEIE FAIAGGVESM
SRAPFVMPKA DAAFSRKAEI YDTTIGWRFV HPRMKELYGA DSMPETGENV AADFGISRAD
QDAFALRSQL RAARARAAGF FAEEIVAVAT GKNAEVTEDE HPRPDTTLEA LARLKPIVRP
DGTITAGNAS GVNDGAATMV VASEEAVKQH DLTPKARVLG MATAGVPPRV MGIGPVPAVQ
KLLRRLNLKV KDFDVIELNE AFASQSLACL RQLGIPDDAD FVNPNGGAIA LGHPLGMSGA
RLVLTASHQL EKTGANRALA TMCVGVGQGV ALAIERV