Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_0846 |
Symbol | |
ID | 7093278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 931463 |
End bp | 932578 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643464183 |
Product | chorismate synthase |
Protein accession | YP_002361178 |
Protein GI | 217977031 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.742224 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCACA ATAGTTTCGG CCATCTGTTT CGCGTGACGA CCTTTGGCGA GAGCCATGGG CCGGCGATCG GCTGCGTCGT CGACGGCTGC CCTCCCGGCC TTCCCCTTGT CGAAGATGAC ATCCAGATCT TTCTCGACCG GCGGCGGCCG GGCCAGAACC GCTTCATGAC GCAGCGGCAG GAGGCGGACA AGGTCAAAAT TCTGTCGGGC GTCTTCGAGG ACGCGCAAAC CGGCGGCCAA GTGACGACCG GAACGCCGAT CGCGCTCCTG ATCGAGAACA CCGATCAGCG TTCCAAGGAT TATGAGGCGA TCAAGGACGT CTATCGTCCG GGCCACGCGG ATTATGTCTA TGACGCCAAA TATGGCGTCA GGGACTATCG CGGCGGCGGC CGCTCGTCCG CGCGCGAGAC GGCGAGCCGC GTCGCCGCCG GCGCGGTGGC GCGCAAGATC ATCGCATCGG TCAAGATCCG CGGCGCCCTC GTGCAGATGG GGCCGCATAA AATCGACCGC GCTCATTGGG ATTGGGAGGA GGTCGACAAA AACCCCTTCT TCTGCCCCGA CGCCGCGGCG GCGCGTTTCT TCGAAACCTA TCTCGACGGC GTGCGCAAAG CCGGCTCCTC GATCGGGGCG GTCATCGAGA TCGTCGCCGA GAATGCGCCG GCCGGCTGGG GCGCGCCGAT CTACGGCAAG CTCGATTCGG AGATCGCCGC GGCGCTGATG TCGATCAACG CAGTCAAGGG CGTCGAGATC GGCGAGGGGT TCGCCGCTGC TGAATTGTCG GGCGAGGACA ACGCCGACGA AATGCGCTCC GGCAATGAGG GCAAGCCGAT TTTCCTCTCG AACCATGCTG GCGGCGTGCT TGGGGGAATC TCGACCGGCC AGCCGATCGT CGCGCGCTTC GCGGTCAAGC CGACCTCCTC GATTTTGAAG CCGCGCCAGA GCATCGACCG CTTCGGGCGC GAGAGCGAGA TCGTCACCAA GGGGCGACAC GATCCTTGCG TCGGCATCCG GGCTGTCCCG GTCGGCGAAG CCATGGTCGC CTGCGTCCTC GCCGACCAAT TCCTGCGTCA TCGCGGCCAG GTCGGGTCTG GTCCGGCTTG GCCGTTTCCG GCCTGA
|
Protein sequence | MSHNSFGHLF RVTTFGESHG PAIGCVVDGC PPGLPLVEDD IQIFLDRRRP GQNRFMTQRQ EADKVKILSG VFEDAQTGGQ VTTGTPIALL IENTDQRSKD YEAIKDVYRP GHADYVYDAK YGVRDYRGGG RSSARETASR VAAGAVARKI IASVKIRGAL VQMGPHKIDR AHWDWEEVDK NPFFCPDAAA ARFFETYLDG VRKAGSSIGA VIEIVAENAP AGWGAPIYGK LDSEIAAALM SINAVKGVEI GEGFAAAELS GEDNADEMRS GNEGKPIFLS NHAGGVLGGI STGQPIVARF AVKPTSSILK PRQSIDRFGR ESEIVTKGRH DPCVGIRAVP VGEAMVACVL ADQFLRHRGQ VGSGPAWPFP A
|
| |