Gene Mfla_1686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_1686 
Symbol 
ID4000941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp1802990 
End bp1804057 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content56% 
IMG OID637938600 
Productchorismate mutase / prephenate dehydratase 
Protein accessionYP_545795 
Protein GI91776039 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATA TTCTGAAAGG TTGCCGCGAC CAGATCGATG CGATCGACGA GCAATTGCTC 
GAGCTAATCA ATGCCCGGGC TGCCTTGGCA AGGGAGATCG GTGAGCTCAA GGGCGAGGGG
CCGATTTATC GTCCCGAGCG CGAAGCCCAG GTATTACGCC GACTATTGGA AAAGAATACC
GGTCCGTTGT CTGCAGAGGC AGTGACCGCG ATTTTTCGTA GCGTGATGTC CAATTGCCGT
GCGTTGGAGC GCGAGCTTTC AGTAGCTTTT CTGGGACCGC AAGGCACATA TAGTGAAGAG
GCTGCCATCA AGCAGTTCGG TGGCCTGAAT AATCCCAAGC CCTGTATGTC GATTGATGAG
GTGTTCCGCA TGGTCGAATC CGGCAATGCG GATTATGCCG TGGTGCCTGT GGAAAACTCC
ACCGAGGGTG CGGTTGGCCG CACACTGGAT TTGCTCACGA CCACCAGTCT GCATATCTGT
GGTGAGGTTG CTCTACCTAT CCATCATTGC TTGCTTCGTC GCAGGCATGC CGACGGGGAA
ATCCGGCGTA TCTATTCCCA CGCCCAATCT TTGGGACAGT GCCATGAGTG GCTCAACCTC
AATCTGGGCG GTGTTGAGCG CGTGAGTACT GGCAGCAATG CCCAGGCAGC GGAGCTGGCC
GCACAGGATG CATTTGCCGT GGCCATTGCT GGCAGGCGCG CCGCTGATAT CTTCGGCCTG
GATATCCTGG CCGAGAACAT TGAAGACGAT CCGAAGAACG TGACACGTTT CCTGGTGCTT
GGCAAGCATG AGGCAGCCCC CTCAGGCCAG GACAAGACCT CGCTGCTGCT GGCCACGAAA
AATGTGCCGG GCGCCATTGT AGGGCTGCTG ACGCCCCTTG CCGAGCATGG CGTGGATATG
ACGGAGCTGG GCTCGCGGCC TTCCAAGCTT GGGATATGGG ATTATGTGTT CTTTGTTGAT
ATCAAAGGAC ATTATCAGGA TCCCGCTGTC GCAAGGGCCC TGCATGAGCT TGAGCAACGT
GCCTCCATGT TCAAAATCTT GGGTTCTTAT CCTGTTGCTG TTATATGA
 
Protein sequence
MSDILKGCRD QIDAIDEQLL ELINARAALA REIGELKGEG PIYRPEREAQ VLRRLLEKNT 
GPLSAEAVTA IFRSVMSNCR ALERELSVAF LGPQGTYSEE AAIKQFGGLN NPKPCMSIDE
VFRMVESGNA DYAVVPVENS TEGAVGRTLD LLTTTSLHIC GEVALPIHHC LLRRRHADGE
IRRIYSHAQS LGQCHEWLNL NLGGVERVST GSNAQAAELA AQDAFAVAIA GRRAADIFGL
DILAENIEDD PKNVTRFLVL GKHEAAPSGQ DKTSLLLATK NVPGAIVGLL TPLAEHGVDM
TELGSRPSKL GIWDYVFFVD IKGHYQDPAV ARALHELEQR ASMFKILGSY PVAVI