Gene Msil_0846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0846 
Symbol 
ID7093278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp931463 
End bp932578 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content65% 
IMG OID643464183 
Productchorismate synthase 
Protein accessionYP_002361178 
Protein GI217977031 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.742224 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCACA ATAGTTTCGG CCATCTGTTT CGCGTGACGA CCTTTGGCGA GAGCCATGGG 
CCGGCGATCG GCTGCGTCGT CGACGGCTGC CCTCCCGGCC TTCCCCTTGT CGAAGATGAC
ATCCAGATCT TTCTCGACCG GCGGCGGCCG GGCCAGAACC GCTTCATGAC GCAGCGGCAG
GAGGCGGACA AGGTCAAAAT TCTGTCGGGC GTCTTCGAGG ACGCGCAAAC CGGCGGCCAA
GTGACGACCG GAACGCCGAT CGCGCTCCTG ATCGAGAACA CCGATCAGCG TTCCAAGGAT
TATGAGGCGA TCAAGGACGT CTATCGTCCG GGCCACGCGG ATTATGTCTA TGACGCCAAA
TATGGCGTCA GGGACTATCG CGGCGGCGGC CGCTCGTCCG CGCGCGAGAC GGCGAGCCGC
GTCGCCGCCG GCGCGGTGGC GCGCAAGATC ATCGCATCGG TCAAGATCCG CGGCGCCCTC
GTGCAGATGG GGCCGCATAA AATCGACCGC GCTCATTGGG ATTGGGAGGA GGTCGACAAA
AACCCCTTCT TCTGCCCCGA CGCCGCGGCG GCGCGTTTCT TCGAAACCTA TCTCGACGGC
GTGCGCAAAG CCGGCTCCTC GATCGGGGCG GTCATCGAGA TCGTCGCCGA GAATGCGCCG
GCCGGCTGGG GCGCGCCGAT CTACGGCAAG CTCGATTCGG AGATCGCCGC GGCGCTGATG
TCGATCAACG CAGTCAAGGG CGTCGAGATC GGCGAGGGGT TCGCCGCTGC TGAATTGTCG
GGCGAGGACA ACGCCGACGA AATGCGCTCC GGCAATGAGG GCAAGCCGAT TTTCCTCTCG
AACCATGCTG GCGGCGTGCT TGGGGGAATC TCGACCGGCC AGCCGATCGT CGCGCGCTTC
GCGGTCAAGC CGACCTCCTC GATTTTGAAG CCGCGCCAGA GCATCGACCG CTTCGGGCGC
GAGAGCGAGA TCGTCACCAA GGGGCGACAC GATCCTTGCG TCGGCATCCG GGCTGTCCCG
GTCGGCGAAG CCATGGTCGC CTGCGTCCTC GCCGACCAAT TCCTGCGTCA TCGCGGCCAG
GTCGGGTCTG GTCCGGCTTG GCCGTTTCCG GCCTGA
 
Protein sequence
MSHNSFGHLF RVTTFGESHG PAIGCVVDGC PPGLPLVEDD IQIFLDRRRP GQNRFMTQRQ 
EADKVKILSG VFEDAQTGGQ VTTGTPIALL IENTDQRSKD YEAIKDVYRP GHADYVYDAK
YGVRDYRGGG RSSARETASR VAAGAVARKI IASVKIRGAL VQMGPHKIDR AHWDWEEVDK
NPFFCPDAAA ARFFETYLDG VRKAGSSIGA VIEIVAENAP AGWGAPIYGK LDSEIAAALM
SINAVKGVEI GEGFAAAELS GEDNADEMRS GNEGKPIFLS NHAGGVLGGI STGQPIVARF
AVKPTSSILK PRQSIDRFGR ESEIVTKGRH DPCVGIRAVP VGEAMVACVL ADQFLRHRGQ
VGSGPAWPFP A