Gene Mlg_0926 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0926 
Symbol 
ID4268213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1051178 
End bp1052281 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content69% 
IMG OID638125678 
Productchorismate mutase / prephenate dehydratase 
Protein accessionYP_741770 
Protein GI114320087 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACG AACAGACCTC CCCCGACGAT CAGGCCGCGC TGCAGGCCGT GCGTGCGCGC 
ATCGACGCCC TGGACGACGA GATCCTGCGG CTAATCAGCG AGCGCGCCCG GATGGCCGAA
GAGGTGGCCC GGGTCAAGCG TGAGGCCGGG CACAGCAACG ATTTCTACCG CCCCGAGCGC
GAGGCGCAGG TGCTGCGCCG GGTCCGCCAG TCCAACCCCG GCCCGCTGGG CGAGGAGGCG
GTGACCCGGC TGTTCCGCGA GATCATGTCC GCCTGCCTGG CCATCCAGCT GCCCTTGCAG
GTGGCCTTCC TGGGGCCCGA AGGCACCTAT ACCCAGGAGG CGGCACTCAA GCACTTCGGC
CACGCCATGG GCACGGCACC GCTGAGCACT ATCGCCGCGG TCTTTCGTGA GGTGGAGTCC
GGTGCCGCCC ACTACGGGGT GGTGCCGGTG GAGAACTCCA CCGAGGGGGT GGTCACCCAC
ACGGTGGACC GCTTTCTCAA CTCGCCGCTG CAGATCGTCG GCGAGGTGCA GTTACCCATC
CACCACGCCC TGGCCAGCCG CGAGCAGGAC TGGAACGCCA TCCGGCGTAT CTACTCCCAC
CAGCAGGGAC TGGCCCAGTG CCGGGCCTGG GTCGATACCC ATCTGCCGGG CGTGGAACGG
GTGCCGGTCA CCAGCACCGC CGAGGCGGCG CGGCTGGCGG CGGCCGAACG GGGTGCGGCG
GCCATCGCCA GTGAGGCGGC CTGCGAGCTC TACGACTTGC CGGTGCTTGC CACCCACATC
GAGGACGAGC CGGGCAACAC CACCCGCTTT CTGGTGGTGG GGCCGGAGTC TCCACCACCC
AGCGGTGACG ACAAGACCTC CTTGGTGATC AGCCGGGCCA ACCAGCCGGG TGGCCTTTAC
CGGCTGCTGG AACCATTAGC CAGGAATGGA GTGAACATGA CCCGGATCGA ATCCCGGCCC
GCGCCGCAGG GCGTCTGGGA GTATGTGTTC TTCGTGGACC TGTTGGGTCA CGTGGAGGAC
GAACCCGTCC GCCAGGCGTT GGCCGAGATC CGCGAACAGG CCAGCCTGTG CCGCGTCCTG
GGCTCGTACC CGCGAGCGCT GTAA
 
Protein sequence
MSDEQTSPDD QAALQAVRAR IDALDDEILR LISERARMAE EVARVKREAG HSNDFYRPER 
EAQVLRRVRQ SNPGPLGEEA VTRLFREIMS ACLAIQLPLQ VAFLGPEGTY TQEAALKHFG
HAMGTAPLST IAAVFREVES GAAHYGVVPV ENSTEGVVTH TVDRFLNSPL QIVGEVQLPI
HHALASREQD WNAIRRIYSH QQGLAQCRAW VDTHLPGVER VPVTSTAEAA RLAAAERGAA
AIASEAACEL YDLPVLATHI EDEPGNTTRF LVVGPESPPP SGDDKTSLVI SRANQPGGLY
RLLEPLARNG VNMTRIESRP APQGVWEYVF FVDLLGHVED EPVRQALAEI REQASLCRVL
GSYPRAL