Gene Mlg_0032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0032 
Symbol 
ID4268889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp35512 
End bp36552 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content67% 
IMG OID638124759 
Productaldo/keto reductase 
Protein accessionYP_740881 
Protein GI114319198 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTACC GCAAACTGGG TCATACCGAT ATCGAGGTCA GCGCCCTCTG CCTGGGCACC 
ATGACCTTCG GCGAGCAGAA CACCGAGGCC GAGGCCCATG AGCAACTGGA CCAGGCCCTC
GCCCGGGGGA TCAACTTCAT CGACACCGCC GAGATGTACC CGGTGCCGGC CAAAAGCGAG
ACCGGCGGCC GCACCGAGCG CTATATCGGC AGCTGGCTGA AGCGGCGCCG GCGCCGCGAG
GACGTGGTGC TGGCCACCAA GATCGCGGGG CCGGGCCTGG AGACGGTACG TGAGGGGAGG
ACCCGCTACA CCCACGCCCA CCTGGTGGAG GCGGTGGAGG GCTCCCTGCA ACGGTTGCAG
ACCGACTATA TCGACCTCTA CCAACTGCAC TGGCCGGAGC GGAAGACCAA CTATTTCGGC
AAGCTGGGCT ACCAGCCCGA TCCGCGGGAG CCGGACCCCA TCCCGCAGCT TCGCGCCACC
CTGGAGGCGC TTTATGACCT GGTGGAGGCG GGCAAGATCC GCCACATCGG GCTGTCCAAC
GAGACCGCCT GGGGGGTGAT GCGCTGCCTG TGGTTGGCCG AGCAGCAGGA TCTGCCGCGC
GTGGTCAGTG TCCAGAACCC CTACAACCTG CTCAACCGCA GCTACGAGGT GGGTCTCGCC
GAGGTCTCCC ACCGCGAGGG TGTGGGGCTG ATGGCGTACT CGCCACTGGC CTTCGGGGTG
CTCAGCGGCA AGTACCTGGA CGGCCGCTGG CCCGAGGGGG CCCGTCTGTC GCTGTTCGAA
CAGTTCCAGC GCTACACCGG GCAGCGCGGG GTGCAGGCGA CCGCCGATTA TGTGGCCCTG
GCCCACCGCT TTGGCCTGGA TCCGGCACAG ATGGCCCTGG CCTGGGCCAC CTCACGCCCC
TTCGTGACCA GCACCGTCAT CGGTGCCACC GATCTGAACC AGCTGGAGAC CAACATCGAC
AGCATGGACC TGACCCTGGA CGATGAGCTG CTGGAGGCCA TCGACGCCGT CCACGCCGGC
AACCCCAACC CCTGCCCCTG A
 
Protein sequence
MEYRKLGHTD IEVSALCLGT MTFGEQNTEA EAHEQLDQAL ARGINFIDTA EMYPVPAKSE 
TGGRTERYIG SWLKRRRRRE DVVLATKIAG PGLETVREGR TRYTHAHLVE AVEGSLQRLQ
TDYIDLYQLH WPERKTNYFG KLGYQPDPRE PDPIPQLRAT LEALYDLVEA GKIRHIGLSN
ETAWGVMRCL WLAEQQDLPR VVSVQNPYNL LNRSYEVGLA EVSHREGVGL MAYSPLAFGV
LSGKYLDGRW PEGARLSLFE QFQRYTGQRG VQATADYVAL AHRFGLDPAQ MALAWATSRP
FVTSTVIGAT DLNQLETNID SMDLTLDDEL LEAIDAVHAG NPNPCP