Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1873 |
Symbol | |
ID | 5104141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1816146 |
End bp | 1817318 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640507759 |
Product | chorismate synthase |
Protein accession | YP_001191937 |
Protein GI | 146304621 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.485609 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGGAA ATAGCTTTGG CAAGATCTTT AGGATTACCA CTTTCGGGGA AAGCCACGGG CCCGCGGTGG GCGTCGTGGT GGACGGTGTC CCTGCAGGGT TAAGGTTGTC CCAGGAGGAT CTTGAATTCG AGCTCTCCTT CAGGAGACCA GGGAGACTCT ACGTGTCAGG GAGAAGGGAG AAGGACGTTC CCGAGATCCT GAGTGGGGTC TATAATGGAA GGACCACTGG GTCTCCCATT GCCATCGTCG TGAAAAACAC TGATGTTATC TCGTCGTTCT ATGAGGAGGT TAAGGTGAAG CCAAGGCCTG GACACGCTGA CCTTCCCTTC ATCATGAGGT ATGGTTACGA AAACTGGGAC TACAGGGGAG GTGGCAGATC GAGCGCCAGG GAGACCGTTG GGAGAGTAGC TGCAGGGGCG ATAGCCAAGA AGCTGTTAAT GTTCCACGAT ACCTGGATAG CGGGGAGGCT CAAGAGTCTT GGACCTGTGG ACTCACCGCC CGCGGACTTC CTTCAAATCT TATGTTCCAA GTACAGCCCC GTGAGGGCTT CCGACCCAGA GACTGAGGCC AAGTTCCAGG AACTGGTGAA ACAGGCAACG GTTGAGGGAG ATAGTTACGG TGGAGTTGCC GAGATAGTCG TGAAGAACCC ACCTGCAGGG TTAGGTGAAC CTGTCTTCGA CAAGATCAAG GCTGACCTGG CCAAGGCGAT CCTCTCTATC CCGGCAGTGA CGGGGTTTGA GTACGGACTG GGCTTTCAAG CTTCAAGAAT GAAGGGAAGC GAGGCAAACG ACTCCATTGT AAGGAAGGGT GAGAGACTTG GTTGGAGGGA GAACAAGTCT GGCGGAATCC TGGGAGGAAT AACTACGGGT GAGGATATAG TGGTGAGATG TTCCTTCAAG CCCACAAGCT CCATAAGGAA ACCGCAGGGG ACGGTGGATC TAAGAACCGG GGAACCGGCA GAGATCTCTG TCCTGGGAAG GCATGATCCA GCTGTCGCAA TCAGGGGAGT CTCCGTGGCT GAGTCCATGG TGGCCCTGAC CCTGGTGGAT CACTCCCTGA GGTCTGGGGT CATCCCACCT GTCAAGCTCG AGGAGAGGCA GGCTGAGGTC ATTGAGGACA GATGGAGGAG GTACATGGAG GAATGCAGGC CTACGGCGGA ATCTCAGTCG TGA
|
Protein sequence | MPGNSFGKIF RITTFGESHG PAVGVVVDGV PAGLRLSQED LEFELSFRRP GRLYVSGRRE KDVPEILSGV YNGRTTGSPI AIVVKNTDVI SSFYEEVKVK PRPGHADLPF IMRYGYENWD YRGGGRSSAR ETVGRVAAGA IAKKLLMFHD TWIAGRLKSL GPVDSPPADF LQILCSKYSP VRASDPETEA KFQELVKQAT VEGDSYGGVA EIVVKNPPAG LGEPVFDKIK ADLAKAILSI PAVTGFEYGL GFQASRMKGS EANDSIVRKG ERLGWRENKS GGILGGITTG EDIVVRCSFK PTSSIRKPQG TVDLRTGEPA EISVLGRHDP AVAIRGVSVA ESMVALTLVD HSLRSGVIPP VKLEERQAEV IEDRWRRYME ECRPTAESQS
|
| |