Gene Lcho_3854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3854 
Symbol 
ID6161207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp4321916 
End bp4323103 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content69% 
IMG OID641666627 
Productbeta-ketothiolase 
Protein accessionYP_001792873 
Protein GI171060524 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACAC GTGACGTTGT CGTTCTGGGC GCAGCGCGTT CGGCCATTGG CACTTTCGGC 
GGCAGCCTCG CCGACACCGA GCCCGCCGAG CTGGCCGGCA TGGTCATGAA AGAGGCGGTG
CTGCGCTCGG GCGTCGACCC GCAGGCCATC AACTACGTGA CGGTGGGCAA CTGCATCCCC
ACCGACAGCC GCTTCGCCTA CGTCGCCCGC GTGGCCTCGA TCCAGGCCGG CCTGTCGAAG
GATTCGGTGG CGATGGCCGT CAACCGCCTG TGCTCGTCGG GCCTGCAGGG CATCGTGACG
ACCTCGCAGA ACATCCTGCT GGGCGACTGC GACTACGGCG TGGGCGGCGG TGTCGAGGTG
ATGAGCCGCG GCGCCTACCT GTCGACCGCG ATGCGCAGCG GCGCGCGCAT GGGCGACACC
AAGATGATCG ACTCGATGGT CGCCACCCTG ACCGACCCGT TCGGCGTCGG CCACATGGGC
GTGACGGCCG AGAACCTGGT CACCAAGTGG GGCATCACCC GCGAGGAGCA GGACGCGCTG
GCGGTCGAGT CGCATCGCCG TGCCGCGCTG GCGATCGCCG AAGGCCGCTT CAAGAGCCAG
ATCGTGCCGA TCGTCAAGCA GACCCGCAAG GGCGAGGTCA CGTTCGACAC CGACGAGCAC
GTGAAGGCCA ACACCACGAT GGAAACGCTG GCCAAGATGA AGCCGGCGTT CAAGAAGGAA
GGCGGCACCG TGACCGCCGG CAACGCCTCG GGCATCAACG ACGGCGCCGC CTTCTTCGTG
CTGGCCGACG CCGCCCGTGC CGCTGCCGAC GGCCACAAGG CGATCGCCCG CCTGGTGTCC
TACGCCGTGG CCGGCGTGCC CAACGAGGTG ATGGGCGAAG GCCCGATCCC GGCCACCAAG
CTGGCGCTCA AGAAGGCCGG CCTGCGGCTC GACCAGATCG ACGTCATCGA GTCGAACGAA
GCCTTCGCGG CCCAGGCCAT CGCGGTCGCG CGTGGCCTGG AATTCGACAT GAGCAAGGTG
AACCCGAACG GCGGCGCGAT CGCGCTGGGC CACCCGGTCG GCTGCTCGGG CGCCTTCCTG
GCGACCAAGG CGATCTACGA ACTGCAGCGC ACCGGTGGCC GCTACGCGCT GGTGACGATG
TGCATCGGCG GCGGCCAGGG CATCGCGACG ATCTTCGAGC GCATCTGA
 
Protein sequence
MSTRDVVVLG AARSAIGTFG GSLADTEPAE LAGMVMKEAV LRSGVDPQAI NYVTVGNCIP 
TDSRFAYVAR VASIQAGLSK DSVAMAVNRL CSSGLQGIVT TSQNILLGDC DYGVGGGVEV
MSRGAYLSTA MRSGARMGDT KMIDSMVATL TDPFGVGHMG VTAENLVTKW GITREEQDAL
AVESHRRAAL AIAEGRFKSQ IVPIVKQTRK GEVTFDTDEH VKANTTMETL AKMKPAFKKE
GGTVTAGNAS GINDGAAFFV LADAARAAAD GHKAIARLVS YAVAGVPNEV MGEGPIPATK
LALKKAGLRL DQIDVIESNE AFAAQAIAVA RGLEFDMSKV NPNGGAIALG HPVGCSGAFL
ATKAIYELQR TGGRYALVTM CIGGGQGIAT IFERI