Gene Mlg_2110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2110 
SymbolfadI 
ID4270088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2394655 
End bp2395971 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content73% 
IMG OID638126866 
Product3-ketoacyl-CoA thiolase 
Protein accessionYP_742942 
Protein GI114321259 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02446] fatty oxidation complex, beta subunit FadI 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.219559 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGC GAATCAACCC CGCGCCCAGT GAGCGTGGAC GGCGGGTGGC CATCGTCGCC 
GGCCTGCGCA CGCCCTTTGC CCGGCAGCTC ACCGCCTACC GCGAGCTCTC GGCCATCGAC
CTGGGCATCC TGGTGACCGC CGAGCTGATG GCGCGCCTGG ACCTGGACCC GGCCCTGGTG
GAGCGGGTGG TTTACGGGCA GGTGGGCATC CTGCCCCAGG CCCCCAACAT CGCCCGCGAG
GTGGTGCTGG GCGCCGGCCT GCCGGCGGGC ACCGATGCCT ACAGCGTCTC CCGCGCCTGC
GCCACCAGCT TCCAGTCCAC TGCCGAGGTG GCCACCGCCA TCGCCAACGG CGAGATCGAG
ATCGGCCTGG CCGGTGGCGC CGACTCCACC TCGGTGCTAC CGCTGCAGCT CAGCCGGCCG
TTGGCCCGGG CCCTGATTCA TGCCAGCAAG GCCCGCTCCC TGGGTGAACG CCTGCGCTGC
CTCAGGGGCG TCCGGCCGCG GCACTTCATC CCCAGGCCAC CGGCGGTGAA GGATTACACC
ACCGGCCTGG GGATGGGGGA TATCGCCGAG CAGATGGCCA GGAACCACGG CATCGGCCGC
GAGGCCCAGG ACCGCTTCGC CCTCGACTCC CACCGCAAGG CCCACCAAGC CTGGGAGGAG
GGGCGGCTGG CCGACGAGGT GATGCCGGCC ATCGTCCCGC CCTATAAAGA GGCGCTGGCG
CGGGACAACA ACATCCGCGG GGACAGCAGC CCGGAGCAGC TCGCCCGGCT GCGCCCGGCC
TTCGACCGGC GCCACGGCAC CGTGACCGCG GCCAACTCGA CGCCGCTCAC CGACGGCGCC
AGCACCCTGC TGCTGATGCG CGAGGACCGG GCCCGGGCCC TGGGCCACAC CCCACTGGGC
ACCCTGCGCA GCTGGGCCTT CACCGCCATC GACCCCTTCA ACGACGGCCT GATGGGGCCC
TCCTACGCCA CCCCGCTGGC GCTGGACCGG GCCGGTGCGA CCCTGGACGA CATGGCACTG
GTGGACATGC ACGAGGCCTT CGCCGCGCAG TCGCTGGCCA ACCTCAAGCA ATGGCCCAGC
CGTCGCTTTG CCCGCGAGGC CCTCGGGCGA AACGCCCCCA TCGGCGAGGT GGACCCGGAG
CGGTTCAACG TCCTGGGGGG CTCCATCGCC TACGGCCACC CCTTCGCCGC CACCGGCGGC
CGGATGATCG TACAAACCCT GAATGAACTG CGCCGGCGCG GCGGCGGTCT CGCCCTGACC
ACCGCCTGCG CCGCCGGCGG CCTGGGCGCC GCCATGGTGT TGGAGGTGGA CGGATGA
 
Protein sequence
MTTRINPAPS ERGRRVAIVA GLRTPFARQL TAYRELSAID LGILVTAELM ARLDLDPALV 
ERVVYGQVGI LPQAPNIARE VVLGAGLPAG TDAYSVSRAC ATSFQSTAEV ATAIANGEIE
IGLAGGADST SVLPLQLSRP LARALIHASK ARSLGERLRC LRGVRPRHFI PRPPAVKDYT
TGLGMGDIAE QMARNHGIGR EAQDRFALDS HRKAHQAWEE GRLADEVMPA IVPPYKEALA
RDNNIRGDSS PEQLARLRPA FDRRHGTVTA ANSTPLTDGA STLLLMREDR ARALGHTPLG
TLRSWAFTAI DPFNDGLMGP SYATPLALDR AGATLDDMAL VDMHEAFAAQ SLANLKQWPS
RRFAREALGR NAPIGEVDPE RFNVLGGSIA YGHPFAATGG RMIVQTLNEL RRRGGGLALT
TACAAGGLGA AMVLEVDG