Gene Mlg_1594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1594 
Symbol 
ID4268565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1823720 
End bp1824847 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content71% 
IMG OID638126351 
Productenoyl-CoA hydratase/isomerase 
Protein accessionYP_742431 
Protein GI114320748 
COG category[I] Lipid transport and metabolism 
COG ID[COG1024] Enoyl-CoA hydratase/carnithine racemase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGC AACAGGCCGC CTTCCAAGCG CAGGTGTTAC CGGCCCGCGA TGACGGGCCT 
GGCATCGGGG TGGCCACCCT CACCGCCCCG CGTAAGCTCA ACGCCCTGGA CCTGGGCATG
ATTCAAGCCC TGAGCCGGCA ATTGGAGGCC TGGGCCCGCG ATCCGGCGGT GGCCTGTGTG
GTCCTGGAGG GCGAGGGGGA ACGGGCCTTC TGCGCGGGCG GTGACGTACG CGCCGTGGCC
GAGGCCCTGC GCGGCAACCG CCCGGCGGGA CTGGCCTTCG CCGAGCAGTA TTTCAGCGCA
GAGTACCGCC TGGATCACCA ACTGCACGTC TACCCCAAGC CATTGCTGGT GCGGGGACAG
GGTGTGGTCA TGGGCGGCGG ACTGGGCCTG TTCCAGGGCG GCGACGTGCG CGTGCTCACC
CCCACGTCCA CCCTGGCCAT GCCCGAGATC ACCATCGGTC TCTTCCCCGA CGTCGGGGCC
GCGTGGTTCC TGCAGCGCAC CCCACCCGGG ACCGGCGAAT ACGCCGCCTT GACCGGCGCC
CGGCTCAACG CCGCAGACGC CCTCTTCATG GGCCTGGGCG ACCTGGTCCT GCCCGAGGAT
CACCGGGGGG CGCTGCTGGA GGCACTCCAG GCCGTACGCT GGCACAACGA GCCCCGCCGT
GACCGGAGCC TGCTCCACCA AGCCGCGCGC GGGCTGGCGA TCCCCCGACC GGAGCTATCC
GACTCGCCCG TTCAGGCCCG GGCCGAGCGC ATCCGTCAGG TCATGGCCTG GCCCGGCCTG
GGCCAACGGG TGGCCGCCAT CCGCGACTCG GCCCGCCACG ACCCCTGGCT GGAGGAGAAC
GCCGAACGCT TCGAGAGCGG CTCGCCGACA TCCATTGCCC TGATCCACGA GCAATTCCAG
CGCACGCGGC ACCTAGCCCT GCGCGAGTGC TTCCAGCTCG ACCTGGTCCT GGCCATCCAG
TGCTCCCGGC GGGATGACTT CCCCGAAGGG GTGCGCGCCC TGCTGCTGGA CAAGGACCAG
AACCCCCAGT GGCAGTCCGC CACCCTGAGG GAGATCACCC CGGACTGGAT CGACGCCCAC
TTCGTCTCCC CCTGGCCCGA CCAGCCCAAC CCGCTGCTGG ACCTCTGA
 
Protein sequence
MAEQQAAFQA QVLPARDDGP GIGVATLTAP RKLNALDLGM IQALSRQLEA WARDPAVACV 
VLEGEGERAF CAGGDVRAVA EALRGNRPAG LAFAEQYFSA EYRLDHQLHV YPKPLLVRGQ
GVVMGGGLGL FQGGDVRVLT PTSTLAMPEI TIGLFPDVGA AWFLQRTPPG TGEYAALTGA
RLNAADALFM GLGDLVLPED HRGALLEALQ AVRWHNEPRR DRSLLHQAAR GLAIPRPELS
DSPVQARAER IRQVMAWPGL GQRVAAIRDS ARHDPWLEEN AERFESGSPT SIALIHEQFQ
RTRHLALREC FQLDLVLAIQ CSRRDDFPEG VRALLLDKDQ NPQWQSATLR EITPDWIDAH
FVSPWPDQPN PLLDL