Gene Mfla_0340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_0340 
Symbol 
ID3999307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp342511 
End bp344361 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content59% 
IMG OID637937236 
Productdihydroxy-acid dehydratase 
Protein accessionYP_544452 
Protein GI91774696 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCAAT ACCGCTCCAG GACCTCCACC CATGGCCGCA ACATGGCAGG CGCACGCGCA 
CTCTGGCGTG CTACCGGGAT GAAGGAGGAC GACTTTCAGA AACCAATCAT TGCGATTGCC
AATTCTTTCA CCCAATTTGT TCCCGGGCAC GTACACCTCA AGGATTTGGG ACAGCTGGTG
GCTAGGGAGA TCGAGCGTGC GGGTGGCGTC GCCAAGGAAT TCAACACGAT TGCCGTGGAT
GATGGCATTG CCATGGGCCA TAGCGGCATG CTGTACAGCT TGCCCAGCCG TGACCTGATT
GCCGATGCCG TGGAGTATAT GGTGAGCGCC CACTGCGCCG ATGCGCTGGT GTGCATCTCC
AACTGTGACA AGATCACTCC TGGCATGCTC ATGGCGGCGC TACGCCTGAA TATTCCCGTG
GTATTCGTCT CGGGCGGGCC GATGGAGGCC GGCAAGGTCA ATTGGCAGGG CAATATGCAC
AAGCTCGACC TGGTGGATGC CATGGTCGCT GCTGCAGACA GCAATGTGTC CGATGCGGAG
TCCGAGGCGA TCGAACGTTC TGCCTGCCCG ACCTGCGGTT CCTGTTCCGG CATGTTCACC
GCCAACTCGA TGAACTGCCT GACCGAGGCG CTCGGCCTGA GCCTACCTGG TAACGGCTCC
ACACTGGCCA CGCATGCTGC GCGCAAGGAG CTGTTCCTCA AGGCTGGTCG CCTGATCGTG
GAAATCACCA AGCGCTACTA CGAACAGGAT GACGAGAGTG TACTGCCGCG CAGCATTGCC
AACTTCAAGG CATTCGAGAA TGCCATGAGC CTCGACGTCG CCATGGGCGG CTCGACCAAC
ACTGTATTGC ATTTGCTGGC AGCGGCGCAC GAGGCTGAGG TGGATTTCAC CATGGCTGAC
ATTGACCGTA TTTCCCGCGG CGTGCCCTGC ATATGCAAGG TGGCACCTGC TACGGCCAAA
TATCACATGG AGGATGTACA TCGTGCAGGT GGCGTGATGG CCATCCTGTC AGAGTTGAGT
CGGGCCGGGC TGATCCACCG TGACACTCCC ACCGTGCACA GTCCAACCCT GGGTGAGGCG
TTGGATAAGT GGGACATCAT GACCACCAAC GACGAGGATG TGAAGAAATT TTATCGTGCC
GCACCTGGCG GTATTAGCAC GACGATCGCA TTCAGCCAGT CCATGCTCTG GCCGGACCTG
GACACTGACC GCAAGGAAGG CTGCATCCGC GACAAGGATC ATGCGTATTC TCAGGATGGC
GGACTGGCAG TGCTTTACGG CAACATTGCC CTAGACGGCT GTATCGTGAA GACGGCCGGC
GTGGACGACA GCATTCTCAA GTTCACGGGG CGCGCGCGCA TTTTTGAAAG CCAGGATGAT
GCCGTGGCTG CCATTCTTGC AGACAAGATC GAAGCGGGGG ACATCGTCAT CATCCGCTAC
GAAGGCCCGC GCGGCGGACC TGGCATGCAG GAAATGCTTT ACCCGACCAG CTACCTCAAA
TCCAAGGGAC TTGGCAAGGC ATGCGCGCTG CTGACCGATG GCCGCTTTTC CGGCGGTACC
TCCGGTCTGA GTATCGGCCA TGCCTCTCCG GAAGCTGCCG AAGGCGGCGC AATCGGCCTG
GTGGAGGAAA ACGATATCAT CGAGATCGAC ATTCCCAATC GTACTATTCA TCTGGCGGTC
AAGGACGAAG TGCTTGCCCA CCGGCGCGCT GCCATGGAGG CACGCGGCAA AGATGCATGG
AAGCCGGTCA ACCGCCAGCG TCATATATCC GTTGCATTGC GTGCGTATGC TGCCATGAGC
ACGTCCGCGG CCAAGGGGGC TGTGCGTGAC GTCAGCCAGA TCGAGAAGTA G
 
Protein sequence
MPQYRSRTST HGRNMAGARA LWRATGMKED DFQKPIIAIA NSFTQFVPGH VHLKDLGQLV 
AREIERAGGV AKEFNTIAVD DGIAMGHSGM LYSLPSRDLI ADAVEYMVSA HCADALVCIS
NCDKITPGML MAALRLNIPV VFVSGGPMEA GKVNWQGNMH KLDLVDAMVA AADSNVSDAE
SEAIERSACP TCGSCSGMFT ANSMNCLTEA LGLSLPGNGS TLATHAARKE LFLKAGRLIV
EITKRYYEQD DESVLPRSIA NFKAFENAMS LDVAMGGSTN TVLHLLAAAH EAEVDFTMAD
IDRISRGVPC ICKVAPATAK YHMEDVHRAG GVMAILSELS RAGLIHRDTP TVHSPTLGEA
LDKWDIMTTN DEDVKKFYRA APGGISTTIA FSQSMLWPDL DTDRKEGCIR DKDHAYSQDG
GLAVLYGNIA LDGCIVKTAG VDDSILKFTG RARIFESQDD AVAAILADKI EAGDIVIIRY
EGPRGGPGMQ EMLYPTSYLK SKGLGKACAL LTDGRFSGGT SGLSIGHASP EAAEGGAIGL
VEENDIIEID IPNRTIHLAV KDEVLAHRRA AMEARGKDAW KPVNRQRHIS VALRAYAAMS
TSAAKGAVRD VSQIEK