Gene Mlg_2462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2462 
Symbol 
ID4270203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2796724 
End bp2797752 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content68% 
IMG OID638127220 
Product4-hydroxy-2-ketovalerate aldolase 
Protein accessionYP_743292 
Protein GI114321609 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR03217] 4-hydroxy-2-oxovalerate aldolase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.320363 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACA AGAAGGAAAT CCTGGTCCAC GACATGTCCC TGCGCGACGG CATGCACTCG 
GTCCGCCACC AGTTCTCGCT GGCGCAGATG ATCGAGCTGT CCACCGCCCT GGACGAGGCC
GGTGTACCCC TGATCGAGGT CACCCATGGC GATGGCCTGG GCGGCCACTC GGTCAACTAC
GGTTTCGCCG CCCACTCCGA CCGGGAGTAC CTGGAGGCCG TTATTCCGCG GATGCAGCAG
GCGCGCGTCT CGGCGTTGCT GCTGCCCGGT ATCGGCACCG TGGACGACCT GCGCATGGCC
GCCGATTGCG GCGTCCATTG CCTGCGGGTG GCGACCCAGT GCACCGAGGC GGATGTGGCG
GAGCAGCACA TCGGGCTGTC TCGCAAGCTC GGCCTGGACA CCGTCGGCTT TCTAATGATG
GCGCATATGC TGCCCGCCGA GGGCCTGTTG GAACAGGCAC GGCTGATGGA GTCCTACGGC
GCCAACTGCG TCTACATGAC CGATTCGGCC GGCTACATGC TGCCAGAGGA GGTGCGCGAA
AAAGTCTCCG CCCTGCGCGA GGGGCTCGCC GATGAGACGG AGGTCGGCTT CCACGGCCAC
CACAACCTGG CAATGGGAGT GGCCAACTCG GTAGCGGCGG TGGAGGCCGG TGCCAAGCGC
ATCGACGGCT CCGTGGCCGG CTTCGGCGCC GGCGCGGGCA ACACCCCGCT GGAGGTCTTC
ATCGCGGTCT GTGAGCGCAT GGGTATCTGC ACCGGTGTGG ACCTGGGCCG CATCCAGGAC
GTGGCCGAGG ACGTCGCGCT GCCGATGATG GACGCCCCCA CCCGGATCGA CCGCGACTCG
CTGACACTGG GTTACGCCGG GGTCTATTCC TCATTCCTGC TCCACGCCAA ACGCGCCGAG
GCCAACCACG GGGTACCGGC CCGCGACATC CTGGTGGAGC TGGGCGCCCG CCGTACCGTC
GGCGGCCAGG AGGACATGAT CGAGGACGTC GCCCTGGAGA TGAGCAGGGC GCGCAGCGCA
AGCGCCTGA
 
Protein sequence
MTDKKEILVH DMSLRDGMHS VRHQFSLAQM IELSTALDEA GVPLIEVTHG DGLGGHSVNY 
GFAAHSDREY LEAVIPRMQQ ARVSALLLPG IGTVDDLRMA ADCGVHCLRV ATQCTEADVA
EQHIGLSRKL GLDTVGFLMM AHMLPAEGLL EQARLMESYG ANCVYMTDSA GYMLPEEVRE
KVSALREGLA DETEVGFHGH HNLAMGVANS VAAVEAGAKR IDGSVAGFGA GAGNTPLEVF
IAVCERMGIC TGVDLGRIQD VAEDVALPMM DAPTRIDRDS LTLGYAGVYS SFLLHAKRAE
ANHGVPARDI LVELGARRTV GGQEDMIEDV ALEMSRARSA SA