Gene Msil_1479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1479 
Symbol 
ID7091822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1597917 
End bp1598957 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content65% 
IMG OID643464813 
Product4-hydroxy-2-oxovalerate aldolase 
Protein accessionYP_002361799 
Protein GI217977652 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR03217] 4-hydroxy-2-oxovalerate aldolase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones89 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGCA TCAATCCGAA CAAGCTATAC GTCCAGGATG TCACGCTGCG CGACGGCATG 
CATTCGGTGC GTCACCAGTA CAGCCTTGAC GCGGTGCGCG CCATCGCCCG TGCGCTCGAC
CGCGCCCACG TCGACGCCAT CGAGATCAGC CACGGCGACG GCATCACCGG CTCGACCTTT
AACTACGGCT TCGGCGCGCA TGACGATACC GAATGGATCG CAGCGGTGGC CGGCGAATGC
AAGTTCTCGC GCATCACTGT GCTGCTACTG CCGGGCATCG GCACCGTGCA TGATCTCAAA
TATGCCTGTG AGGCCGGCGC GCGTAGCGTG CGCGTCGCCA CCCATTGCAC GGAAGCGGAC
GTTTCGCGCC AGCATATCGA GGCGGGCCGC AAGCTTGGCA TGGATACGGT CGGCTTTTTG
ATGATGGCGC ACATGGCGCC GGTGGAGAAG CTGGTCGAAC AGGCGCTGCT GATGGAAAGC
TATGGCGCCG AATGCGTCTA TGTGACGGAT TCGGCTGGCG CGCTGCTGCC GAAACAGTAC
GCCGAACGCG TAAAAGCGGT GCGCGGCGCG CTGAAGCCCG AGACGGAAAT CGGCGTGCAC
ACCCACCACA ATCTGACCCT TGGTGTCGCG AACGCCGTGG CGGGAATTGA GGCAGGCGCC
GTTCGCGTCG ACGCCTCGCT CGCCGGCATG GGCGCGGGCG CCGGCAACGC GCCGCTCGAA
GCTCTGATCG CGGTGCTCGA CCGGATGGGA ATCGAGACCG GCTGCGACCT GCACATGTTG
ATGGACGCGG CGGACGATCT CGTGCGGCCC CTGCAGGACC GTCCGGTGCG GGTGGACCGC
GAGTCGCTTT CACTCGGCTA CGCCGGCGTC TATTCGAGCT TCCTGCGCCA TGCGGAAAGC
GCCTCGAAAC TCTATGGCGT CGATACGCGC GACATCCTCA CCGAACTCGG CAAGCGGCGC
ATGGTCGGCG GCCAGGAAGA CATGATTGTC GACGTCGCGC TGGACATTCT CAAATCACAC
GGGGCGGAGG CGGCCCAATG A
 
Protein sequence
MARINPNKLY VQDVTLRDGM HSVRHQYSLD AVRAIARALD RAHVDAIEIS HGDGITGSTF 
NYGFGAHDDT EWIAAVAGEC KFSRITVLLL PGIGTVHDLK YACEAGARSV RVATHCTEAD
VSRQHIEAGR KLGMDTVGFL MMAHMAPVEK LVEQALLMES YGAECVYVTD SAGALLPKQY
AERVKAVRGA LKPETEIGVH THHNLTLGVA NAVAGIEAGA VRVDASLAGM GAGAGNAPLE
ALIAVLDRMG IETGCDLHML MDAADDLVRP LQDRPVRVDR ESLSLGYAGV YSSFLRHAES
ASKLYGVDTR DILTELGKRR MVGGQEDMIV DVALDILKSH GAEAAQ