Gene Emin_1183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1183 
Symbol 
ID6263277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1279634 
End bp1281001 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content44% 
IMG OID642611661 
Productacetyl-CoA carboxylase, biotin carboxylase 
Protein accessionYP_001876070 
Protein GI187251588 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACAG AAAAATTTAA TAAAGTTCTT ATCGCTAACC GCGGTGAAAT CGCTGTACGC 
ATCTGCCGCA CGTTAAAAGA AATGGGCATT AAATCCGTTG CTGTTTACTC GGAAGCGGAC
AGGGACTCAA TGCATGTAAG AGCGGCGGAC GAAGCCGTCT GCATAGGCCC CGCCTCCTCA
AAAGAAAGCT ATTTAAATAT TGACGCCATT GTAAGCGCGG CTAAAATAAC AAATACGGAC
GCTATTCACC CGGGTTACGG ATTTCTGTCG GAAAATCCTA AATTTTCAAA AGCGGTTACA
AAAGCGGGCA TAGTCTTTAT AGGCCCCACG CCGGAAGCTA TTGAAGCTTT AGGCGTAAAA
AGCGCTGCGC GCGAAATTGC GATTAAAGCG GGCGTACCCG TTATTCCCGG TTCAGACGGT
ATTGTCGGCA AAAATTATAA GGAAATAGCA AAAAAAATAG GCTTTCCAAT AATGATTAAA
GCCACAATGG GCGGCGGCGG CAAAGGTATG CGCGCCGTTA TGAAAGAAGA AGATTTAGAT
AAAATGATGC AAATGGCGCA AAACGAAGCC CGCGCCGCTT TTGGCGACGA CAGAGTTTAT
TTTGAAAAAC TTGTTTTAGC GCCAAGACAT ATTGAGGTGC AGGTGGCGGC CGACGCGCAC
GGAAATGTTT TGTCTTTTAC TGAACGCGAC TGCAGCATGC AAAGAAGACA TCAAAAATTG
GTTGAGGAAT CTCCCTCGCC TTTCGTTACC CCCGAGGTAA GAAAAAAGCT TACGCAGGCC
GCGTCCAAAA TGATTAAAGC CTGTCAATAT ACGGGCGTAG GCACTGTTGA ATTTTTGATG
GACCAAAATA AAGATTTTTA TTTTATGGAA GTTAACACAA GACTTCAGGT AGAACACCCC
GTAACCGAAA TGGTTTGCGG TTACGATCTT GTTAAAATGC AAATTGATAT CGCACAGGGT
AAAAAACTTC AAATAACACC CGAACAGGCT TTGCAAATTA CCTGCCACGC CATAGAACAC
CGCATAAACG CCGAAGATTG TGAAAACAAT TTTGCCCCTA ACCCGGGTTT AATTACGGAA
TGGATTCCCG CCGGAGGCTT AGGTGTAAGA GTGGACACGC ATATGTACAC AAATTACACC
ATACCCAGCT ATTATGACAG CTTAATCGCC AAGCTCATTG TTTGGGCTCC TTCACGCGAA
AGAGCTATCT CAAGAGCCAA GCGCGCTTTA TCGGAATTTC ACATCGGCGG TATTAAAACA
ACCATCCCGG TGCATAATAA AATTTTAAAT AATAAGGATT TCGCATCAGG TAATATGGAC
ACGGGCCTTT TGGAAAGGAT ATTTAAAGAA GATGAAAGCA AAAAATAA
 
Protein sequence
MTTEKFNKVL IANRGEIAVR ICRTLKEMGI KSVAVYSEAD RDSMHVRAAD EAVCIGPASS 
KESYLNIDAI VSAAKITNTD AIHPGYGFLS ENPKFSKAVT KAGIVFIGPT PEAIEALGVK
SAAREIAIKA GVPVIPGSDG IVGKNYKEIA KKIGFPIMIK ATMGGGGKGM RAVMKEEDLD
KMMQMAQNEA RAAFGDDRVY FEKLVLAPRH IEVQVAADAH GNVLSFTERD CSMQRRHQKL
VEESPSPFVT PEVRKKLTQA ASKMIKACQY TGVGTVEFLM DQNKDFYFME VNTRLQVEHP
VTEMVCGYDL VKMQIDIAQG KKLQITPEQA LQITCHAIEH RINAEDCENN FAPNPGLITE
WIPAGGLGVR VDTHMYTNYT IPSYYDSLIA KLIVWAPSRE RAISRAKRAL SEFHIGGIKT
TIPVHNKILN NKDFASGNMD TGLLERIFKE DESKK