Gene Emin_0049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0049 
Symbol 
ID6263976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp52019 
End bp53989 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content43% 
IMG OID642610511 
ProductD-lactate dehydrogenase (cytochrome) 
Protein accessionYP_001874953 
Protein GI187250471 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID[TIGR00387] glycolate oxidase, subunit GlcD 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTACAA GGGATAAATT TTTTAAAGCG CGCAAAGAAA TTATTGAAAT ATTAGGGGAA 
ACCCGAGTTC TTAGTGATGA AATTTCCCTT TCTCTTCATT CTTTTGACTG TAGTTTGGGT
AGAACCAGGC CCGATGCCGT GCTTTTAATT AACAATACTA ACGAACTCCC TTTCGTAATA
AAAACTTTAA ATAAATACTC AGTTCCTTTT GTTCCGCGCG CGGCGGCTAC AAACCATGTG
GGCGGCTGCG TGGCTTTAAA AGGCGGCGTT GTGCTTCACT TAATGGGGCT TAATAAAATT
ATAAGAGTTG ATACCCAAGA AGAATTTGCC GAAGTTGAAT GTTGTGTTGT TAATGAGGAT
TTAAACAATT TACTTAATTC TTTGGGTTAC GAATACCTGC CCGACCCCGC CAGCCAAAAA
ATTTGTACTT TAGGGGGTAA CGCCGCTTTA AACGCCGGCG GAGCCAAAGG TTTAAAATAC
GGCGCCACAA GGGAACATAT TATAAAGGCG GAAATTATTA CCCCGTTTGG AGAAGTTGTA
ACCCTTTCCA ATAAGGACAA AGGGCCTGAT TTTTTGGGTT TTATAATAGG TTCGGAAGGA
ACTTTGGGGG TTATAACAAA ACTTTGGCTC AAAATTTCTA AAAAAGACTA TTTTATAAAA
ACCATGCTTG CCGCTTTTGA CAGTGTGGAC GAATCTATGG ACGCGGTCGC GGCCATTACC
GCGCGCGGAA TAGTGCCAAG GTGTATAGAG GCCATGGATA AAACCACTAC AGCTGCGGTT
GAGGCTTTCA GTAAAAGCGG TTATCCTACG GATACGGAAG CCCTTCTTCT TATAGAAACG
GACGGAACAG CCAGAAAAGC TGCAAAAGAA ATTAAGGAAA TAGAAGCCGT TTGCAGGGAA
CATAACTGCA AAAAAATTAT TATCGCTGAA AATGAGGAAA AACGGCAGGC TCTTTGGAAG
GGGCGAAGCG GCGGTTACGC CGCGATGGCA AGGCTTGCCC CCAACGTTTT TGTTGAAGAC
GGCACCGTGC CCCGCGCCGT TTTGCCTGAG GCTATAAAAA AAACAAGGGA AATTTGCGAA
ACAAATTCCA TTACGGCGGG TTTATTTTTT CACGCCGGGG ACGGCAACCT GCATCCGAAC
GTGGTTTTTG ACCAGCGCAG CAAGCAGGAA ACAAACATTG TTGTAAAAGC GGGTAAAGAA
ATGCTTAAAG TCTGCACAGA TTTAGGCGGC ACTATTTCAG GCGAGCACGG CGTGGGCATT
GAAAAACGTT CGGCCATGTC TTTTATGTAC GGAGCTGCGG AAATTGCTTT TTTTAAAAAA
CTTAAAAACG CTTTGGACCC GGAAGGGGTA TGCAATCCCG ATAAAATTTT TCCCGTAACG
GATAAAGAAA GCGCTGTTAA AGAAGCGGCA AAAGAAATTA AGGATTTGTC CGCCGTCATT
GCCGCCGCGC GCAGTGTTAA AATACAAGGT TTAAACTCAA ATAAAATTAA AACGGACAGA
GCGAAAGTTT CTTTAAAAAA CATAAACAAA ATTTTAGATG TTGATATCAA AAACTATACA
ATTACCGTGC AGGCAGGCGC TAAAGTCACT GATGTTATTA AAGAACTTTC GGCTAAAAAA
CTTTATGCCG CGCTGCCTAA ATACAAAGGC AGCGTGGGCG GCCTGTTCGC GCAGGGGTTA
TGTAATGAAT TTAACTCCTA TGTTACAGGT ATGCAGGCCG TGCTTGCTGA GGGTGAAATT
ATAAATTACG GCGGCAAGTT TGTTAAAAAC AGCGCGGGAT ATAATTTGGC CAGGCTTTTT
GCGGGATCAC AAGGCGCTTT CGGCGCGGTT ACCGAACTTA CTTTTAGAAT ATTTTCTGAA
AAAAAGGCCG CTGTAACACA GCAAAAAACG CTTGCTGAAA ACCCTTTTTA CCAAAGAATA
AAACAGACTG TTGATAAAAA CAATAAATTT GAAGGAACTT TTGATGAATA G
 
Protein sequence
MFTRDKFFKA RKEIIEILGE TRVLSDEISL SLHSFDCSLG RTRPDAVLLI NNTNELPFVI 
KTLNKYSVPF VPRAAATNHV GGCVALKGGV VLHLMGLNKI IRVDTQEEFA EVECCVVNED
LNNLLNSLGY EYLPDPASQK ICTLGGNAAL NAGGAKGLKY GATREHIIKA EIITPFGEVV
TLSNKDKGPD FLGFIIGSEG TLGVITKLWL KISKKDYFIK TMLAAFDSVD ESMDAVAAIT
ARGIVPRCIE AMDKTTTAAV EAFSKSGYPT DTEALLLIET DGTARKAAKE IKEIEAVCRE
HNCKKIIIAE NEEKRQALWK GRSGGYAAMA RLAPNVFVED GTVPRAVLPE AIKKTREICE
TNSITAGLFF HAGDGNLHPN VVFDQRSKQE TNIVVKAGKE MLKVCTDLGG TISGEHGVGI
EKRSAMSFMY GAAEIAFFKK LKNALDPEGV CNPDKIFPVT DKESAVKEAA KEIKDLSAVI
AAARSVKIQG LNSNKIKTDR AKVSLKNINK ILDVDIKNYT ITVQAGAKVT DVIKELSAKK
LYAALPKYKG SVGGLFAQGL CNEFNSYVTG MQAVLAEGEI INYGGKFVKN SAGYNLARLF
AGSQGAFGAV TELTFRIFSE KKAAVTQQKT LAENPFYQRI KQTVDKNNKF EGTFDE