Gene Mlg_2108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2108 
Symbol 
ID4270086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2391351 
End bp2393306 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content69% 
IMG OID638126864 
Productacetoacetyl-CoA synthetase 
Protein accessionYP_742940 
Protein GI114321257 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID[TIGR01217] acetoacetyl-CoA synthase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.188473 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGA ACAGCGAAGT ACTCTGGCAG CCGGGCGAGG CGGAGCTGCG GCAGAGCAAT 
ATAGCCCACT ACATGGGCTG GCTGGCTGCC GAGGGCTACG GCCGGTTCGA GGACTATCAT
GCCCTGCACG CCTGGTCCAT CGAGGACCTG GAGCGTTTCT GGTCCAGCAT CTGGGACTAC
ACCGGGGTGA TCGCCAGCCG GCCCTACGAC CGGGTGCTCG GGCGCCGGGA GATGCCGGGC
GCCGAGTGGT TCCCTGGCGC AAGGCTGAAC TTTGCCGAGA ACCTGCTGCG CCATGCCCTG
CATGGGGATG CCTCGGCCGA GGCGCTGGTG GCGGTGTCCG AGTCCGGCGC GCCGGTGCGT
CTGAGCCGCG GTGAGCTGCT GGAGCAGGTG GCCGCGTTGC AGGGCTTTCT ACTCGCCCAA
GGCGTGGGCC CGGGCGACCG GGTGGCCGGC GTGGTGGGCA ACACCGAGCA TGCGCTCATC
GGCATGCTGG CGGCCACTGG GCTGGGGGCC ATCTGGAGCT CCGCCTCGCC CGATTTCGGC
GTCTCCGGGG TGCTCGACCG CTTCAGCCAG ATCGAGCCCA AGGTGCTGCT GGCGGTGAAC
GGCTACAGCT ACAACGGCAA GCCCTTCCCG CGGCTGGAGC AGAATGTCGA GTTGGCCGAG
CGCCTGCCTG GACTGGCGGC GGTGCTCAGC ATCCCGCTGC TGCCCGACGT CGGTCACCCT
GAGGGCGGGC TTTTCACCCC CTGGGACGAG GCCCTGGCCG TCCACGCCGG CGCCCGGCCG
GTATTCGAGC AGCTGCCGGC GGACCACCCG GTCTACATCC TCTACTCCTC GGGCACCACC
GGGGTGCCCA AGTGCATCGT GCACGGGGCG GGCGGGATGC TGCTCAACCA CAGCAAGGAG
CTGATGCTGC ACGCCGACCT CAAGCCCGGG GATACCTTCT TCTACTTCAC CACCTGCGGC
TGGATGATGT GGAACTGGCT GGCCTCGGGG CTGGTGACCG GGGCGCGGCT GGTGCTCTTC
GAGGGCTCGC CGGGCTACCC CGACCTGGAT GTCTGCTGGG ACCTGGCCGA ACGCGAGGGC
ATCACCCACT TCGGCACCAG CGCCAAGTTC CTGGGCGGTT GTCGCAAGAA GGAACTGGCG
CCGGGCGAGG CGCACGATCT GTCCGCCCTG CGGGTGCTCT TCTCCACCGG CTCGCCGCTG
CTGCCCGAGG ACTACGACTG GGTCTACGGC CAGGTGAAGC GTGACGTGCT GCTGGCCTCC
ATCTCCGGCG GCACCGACAT CTGCGGTTGC TTTGTCGGCG GCACGCCCAA CCTGCCGGTG
CGCCGAGGCG AGATCCAGTG CCGGCTGCTG GGCGTGGACG CCGCCGCCTA CGAGGATGAC
GGCCACGACG CCGGTCACGG CCGGGGTGAA CTGGTGGTGC GCCAGCCCTT TCCGGCCATG
CCGGTGCGGT TCTGGAACGA CCCGGACGGG AGCCGCTACA AGGGCGCCTA CTTCAAGACC
TTCCCGGGTG TCTGGGCGCA TGGCGACTAC GTGCGTTTCA CCGAGCACGG CGGGGCGGTG
ATCTACGGCC GCTCCGACGC CACGCTCAAC CCCGGTGGCG TGCGGATCGG CACCGCGGAG
ATCTACCGCC AGGTGGAGCA GGTGCCCGAG GTGGCGGACA GCCTGGTGGT GGGTGAGCCG
GTGGACGGTG ACGTGCGGGT ACTGCTGCTG GTGGTGATGG CCGAGGGCCA GACGCTGACC
GAGGCGCTGG AGCAACGGAT CAAGCAGGCG ATCAAGGAGA ACGCCAGCCC GCGGCACGTG
CCGGGGCGGA TCGTGGCGGT GCCGGATATC CCCTACACCC GCAGCGGCAA GAAGGTCGAA
CTCGCCGTTG CCCGCATGCT GCAGGGGCGC GAGCCAGGCA ACCGGGGCGC CCTCAGCAAC
CCCGAGGCGC TGGATGCCAT TGCCGAGCGA TTGTAA
 
Protein sequence
MSENSEVLWQ PGEAELRQSN IAHYMGWLAA EGYGRFEDYH ALHAWSIEDL ERFWSSIWDY 
TGVIASRPYD RVLGRREMPG AEWFPGARLN FAENLLRHAL HGDASAEALV AVSESGAPVR
LSRGELLEQV AALQGFLLAQ GVGPGDRVAG VVGNTEHALI GMLAATGLGA IWSSASPDFG
VSGVLDRFSQ IEPKVLLAVN GYSYNGKPFP RLEQNVELAE RLPGLAAVLS IPLLPDVGHP
EGGLFTPWDE ALAVHAGARP VFEQLPADHP VYILYSSGTT GVPKCIVHGA GGMLLNHSKE
LMLHADLKPG DTFFYFTTCG WMMWNWLASG LVTGARLVLF EGSPGYPDLD VCWDLAEREG
ITHFGTSAKF LGGCRKKELA PGEAHDLSAL RVLFSTGSPL LPEDYDWVYG QVKRDVLLAS
ISGGTDICGC FVGGTPNLPV RRGEIQCRLL GVDAAAYEDD GHDAGHGRGE LVVRQPFPAM
PVRFWNDPDG SRYKGAYFKT FPGVWAHGDY VRFTEHGGAV IYGRSDATLN PGGVRIGTAE
IYRQVEQVPE VADSLVVGEP VDGDVRVLLL VVMAEGQTLT EALEQRIKQA IKENASPRHV
PGRIVAVPDI PYTRSGKKVE LAVARMLQGR EPGNRGALSN PEALDAIAER L