Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2038 |
Symbol | |
ID | 5055909 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1822178 |
End bp | 1823065 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640469587 |
Product | chorismate mutase |
Protein accession | YP_001154236 |
Protein GI | 145592234 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0710] 3-dehydroquinate dehydratase [COG1605] Chorismate mutase |
TIGRFAM ID | [TIGR01093] 3-dehydroquinate dehydratase, type I [TIGR01808] monofunctional chorismate mutase, high GC gram positive type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATATGCG GAGCGGTCCC CGTCAGGAGA CCGAGAGACG TGGAAAGGGC GCTGGAGGCG CCGCTTACGT GCCTTGAGCT TAGACTCGAC TACCTAGAGG CGCCGCTGTC TGAGGCATGG CCTGTGCTGG AGGAGGCGGC GGCGCGCCGC ACGGTTATAG TCACGGTGAG GAGGAGGGAG GAGGGCGGGC ACTGGCGGGG CGGCGAGGAG GAGAGAGAGG CGTTGTACAG AAAGCTCCTC GACCTCAACC CCCACTACGT CGACGTCGAG GCGGAGTCCC CCATCGCCCC GAGAATTGCC GAGGTAAAGG GCAGGGCCAA GCTCATAGCC AGCAGACACG ACTTCGGGGG GACGCCCCCG CTGGAGGTTC TCAGAAGCTG GGCGGAGAAG GCGGCGGCGC TGGGCGACGT GGTAAAGGTG GTTACCTACG CCCGGGAGCC GGCCGACGGG CTTAGAGTCC TCTCGCTTAT AGGAGCCGTG GAGAAGCTTG TTGTCGCCTT CGCCATGGGC CCCGCCGGGA CGTACACCAG AGTGGCGGCG GCGGCGCTGG GCAGCCCCAT TATGTACGTC TCGCTGGGCG AAGCCACGGC GCCGGGGCAG ATCGCCGCAG ATGCCTACTT CGCCGCCCTC ACCGCGCTGG GCATCGCCCC GGCGGGGGAG GGCCTCCCCT CGCTTAGAGA GGCGCTGGAC TGGATAGACG GCGGCCTCAT GTACCTGCTT AGGAAGAGGC TAGAGATCTG CCGCGACATG GGCAAGCTCA AGAAGGCAGC CGGCCTGCCT GTGTACGACG ACGTGAGGGA GGCCCAAGTG TTAAGACGAT CCGGCGACTT CAAGCAGATC TTCGAGCTCG TCGTCCAGAT GTGCAAGGCT GTGCAACTGG TGGCATAG
|
Protein sequence | MICGAVPVRR PRDVERALEA PLTCLELRLD YLEAPLSEAW PVLEEAAARR TVIVTVRRRE EGGHWRGGEE EREALYRKLL DLNPHYVDVE AESPIAPRIA EVKGRAKLIA SRHDFGGTPP LEVLRSWAEK AAALGDVVKV VTYAREPADG LRVLSLIGAV EKLVVAFAMG PAGTYTRVAA AALGSPIMYV SLGEATAPGQ IAADAYFAAL TALGIAPAGE GLPSLREALD WIDGGLMYLL RKRLEICRDM GKLKKAAGLP VYDDVREAQV LRRSGDFKQI FELVVQMCKA VQLVA
|
| |