Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0568 |
Symbol | |
ID | 4710227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 642859 |
End bp | 643947 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639855026 |
Product | chorismate mutase |
Protein accession | YP_001002156 |
Protein GI | 121997369 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0077] Prephenate dehydratase [COG1605] Chorismate mutase |
TIGRFAM ID | [TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGTCG ACAATGACGA CCTCGCCGCG CTGCGGCAAC GGATCGACGG GATCGACGAC CAGATCCTCG AGCTGGTCAG TGAGCGGGCC CGCCTGGCCG AGGAGGTGGC CCGCGCCAAG GCCCGCCGCG GCGAGGCGCT CAAGGTCTAC CGCCCGGAGC GCGAGGCGGA GATCCTGCGC CGGTTGACCG AGCACAACCG TGGGCCGCTG ACCCGTGAGC AGGTCACGGT GCTCTTCCGC GAGGTGATGT CCGCCTGCCG GGCGCTCCAG CAGCCGCTGA CCGTCGCCTT TCTCGGCCCC CAGGGGACCT TCACCGAAGA GGCGGCCAGC AAGCACTTCG GCCACGATGC CGGGATGTCG CCCCAGGCCA CCATCGGCGG GGTCTTCCGC GAGGTGGAGT CGGGCGCGGC GCACTACGGG GTGGTGCCGG TGGAGAACTC CTCCGAGGGC GTCGTCAGTC ATACCCTGGA TCGCTTCCTG GATTCGGAGC TGGCCATCGT CGGTGAGGTG GAGCTACGCA TCCACCACGC CCTGGCCAGT CACGCCGGGG GGCTGACCTC GATTGACCGG GTCTACTCCC ACCAGCAGGG GCTGTCGCAG TGCCGCGCGT GGCTGGAGAC CCACCTCCCG CAGGCCGAGC GCCACCCGGT CTCCAGCACG GCCGAAGCGG CCCGGCTGGC GGCCCTGGAG CCGAGCGCGG CGGCCATCGC CAGCGAGGCG GCCGCGGAGC GCTACGGCGT GCCGCTGCTT CAGGAGCGCA TCGAGGACTA CCACGGCAAC ACCACCCGTT TTCTGGTTCT GGGCTACCAG TCCCCGCCGC CCAGCGGCCA CGACAAGACC TCGCTGGTGG TCTCCAGCGC CAACCGCTCC GGGTTGCTCT TTCAGCTGCT CGAGCCGCTG GCGCGCAACG GCATCGACAT GACCCGGATC GAGTCGCGTC CGGCACGCCA GCGCGGGGTT TGGGAGTACG TCTTCTTCAT CGATATCCTC GGGCACGCCG AGGACGAGAG CCTGCGCGGT CCCCTTGCGG AGATGCGCGA GCGTGCCAGC CTCTTCCGTA TCCTCGGCTC CTATCCAAGG GCGATCTGA
|
Protein sequence | MSVDNDDLAA LRQRIDGIDD QILELVSERA RLAEEVARAK ARRGEALKVY RPEREAEILR RLTEHNRGPL TREQVTVLFR EVMSACRALQ QPLTVAFLGP QGTFTEEAAS KHFGHDAGMS PQATIGGVFR EVESGAAHYG VVPVENSSEG VVSHTLDRFL DSELAIVGEV ELRIHHALAS HAGGLTSIDR VYSHQQGLSQ CRAWLETHLP QAERHPVSST AEAARLAALE PSAAAIASEA AAERYGVPLL QERIEDYHGN TTRFLVLGYQ SPPPSGHDKT SLVVSSANRS GLLFQLLEPL ARNGIDMTRI ESRPARQRGV WEYVFFIDIL GHAEDESLRG PLAEMRERAS LFRILGSYPR AI
|
| |