Gene Msil_0124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0124 
Symbol 
ID7094254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp117749 
End bp119140 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content64% 
IMG OID643463458 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_002360468 
Protein GI217976321 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTCG AGTCCTGGTC GCCGTCGAGC TGGAGAGCCA AGCCGATCGA GCAGTCGCCC 
GTCTATTCCG ACGCCGCGGC GCTCGCGGAT GTCGAACGGC AGCTCGCCGG CTTTCCTCCG
CTCGTCTTTG CCGGCGAAGC GCGCAAGTTG AAGCGCATGC TCGGCAAGGT CGCCAATGGC
GAAGCTTTTC TGCTTCAGGG CGGCGATTGC GCCGAAAGCT TTGCCGAGCA TTCGGCGGAC
AATATTCGCG ATTTCTTCCG CGTCTTCCTG CAGATGGCGG TGGTGGAAAC CTTCGCCGCC
GCGCTGCCGG TGGTCAAGGT TGGCCGCATC GCCGGCCAGT TCGCCAAGCC CCGTTCGGCG
CCGAACGAGA CCGTCGGCGG CGTGTCGCTG CCGAGCTATC GCGGCGATAT CGTCAATGAC
ATTGCGTTTG AGGCGAGCGC CCGCGTGCCA GACCCCGCGC GCCAGCTCAT GGCCTATCGG
CAGGCGGCGG CGACCTTGAA CCTGCTGCGC GCCTTCGCGA CGGGCGGCTA CGCCAATCTT
GAAAACGCGC ATCAATGGAT GCTGGGCTTC ATCAAGGACA GCCCGCAGTC GGCGCGCTAT
CAGGAGCTTG CCGACCACAT CACCCAAACG CTCGGCTTCA TGCGGGCGAT CGGGCTCGAT
CCCGAGTCCC ATCAGGAGCT GCGGCAGACC GATTTTTACA CCTCGCATGA GGCGCTGCTG
CTCGGCTTCG AGGAGGCGCT GACGCGCGTC GATTCGACGA CCGGGGATTA TTACGCGACC
TCCGGCCATA TGATCTGGAT CGGCGACCGC ACGCGTCAGC CCGGCCACGC CCATATCGAA
TATGCGCGCG GCGTTAAAAA CCCGATCGGC CTCAAATGCG GCCCGACGCT GAACCCTGAT
GAGCTGATCC GGCTGATCGA CATCCTGAAC CCGGACAATG AGGCCGGGCG CCTGACGCTG
ATCTGCCGCT TCGGCGCCGA CAAGGTCGAG GCCAGCCTGC CGACCCTGAT CCGCGCCGTT
CAACAGGAAG GACGCAGCGT CGTGTGGTCT TGCGATCCGA TGCATGGCAA CACGGTCAAG
GCCGCCTCCG GCTACAAGAC GCGGCCGTTC GACAAGATCA TGAGCGAGAT CCGCTCCTTC
TTCGCGGTCC ACCAGGGCGA AGGAACCTAT CCGGGCGGCG TGCATCTCGA AATGACCGGA
AAGAACGTCA CCGAATGCAC CGGCGGCGCG CGCGCGATCT CCGACGCCGA TCTGCATGAT
CGCTATCATA CCTATTGCGA TCCGCGCCTC AATGCAGAGC AGGCGATCGA GGTGGCTTTC
CTGATCGCCG AACTGTTGAA GACCGGCCGT ATGGGCAAGG GCCTGCAGGC CCATCCCGCC
GCCGCCGAAT GA
 
Protein sequence
MSVESWSPSS WRAKPIEQSP VYSDAAALAD VERQLAGFPP LVFAGEARKL KRMLGKVANG 
EAFLLQGGDC AESFAEHSAD NIRDFFRVFL QMAVVETFAA ALPVVKVGRI AGQFAKPRSA
PNETVGGVSL PSYRGDIVND IAFEASARVP DPARQLMAYR QAAATLNLLR AFATGGYANL
ENAHQWMLGF IKDSPQSARY QELADHITQT LGFMRAIGLD PESHQELRQT DFYTSHEALL
LGFEEALTRV DSTTGDYYAT SGHMIWIGDR TRQPGHAHIE YARGVKNPIG LKCGPTLNPD
ELIRLIDILN PDNEAGRLTL ICRFGADKVE ASLPTLIRAV QQEGRSVVWS CDPMHGNTVK
AASGYKTRPF DKIMSEIRSF FAVHQGEGTY PGGVHLEMTG KNVTECTGGA RAISDADLHD
RYHTYCDPRL NAEQAIEVAF LIAELLKTGR MGKGLQAHPA AAE