Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2166 |
Symbol | |
ID | 4026660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 2436635 |
End bp | 2437756 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637967371 |
Product | chorismate mutase / prephenate dehydratase |
Protein accession | YP_574216 |
Protein GI | 92114288 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0077] Prephenate dehydratase [COG1605] Chorismate mutase |
TIGRFAM ID | [TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0178196 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGATC ACGACATGTC CGATAACAAC GTCCCCGTCA CTTCGGCCGA CCTGCCGGCA TTGCGCGAGC GGATCGATGC GCTGGATAGC CAGATCCTCG AGCTGATCAG CGAGCGTGCC CATTGCGCGC AGCAGGTGGC GCAGGTGAAG ACCGATTCCG ATCCTCAGGC GACCTTCTAT CGACCCGAGC GCGAGGCCCA GGTCCTGCGG CGCATCATGG CGCTCAACAA AGGGCCGCTC GACGACGAGG AAATGGCCCG CCTGTTTCGC GAGATCATGT CGGCGTGCCT GGCGCTGGAG CGGCCGGTCA AGGTCGCGTA TCTGGGGCCC GAGGGTACCT TCACCCAGCA GGCAGCGCTC AAGCATTTCG GCGATAGCGC GGTGAGTCTG CCGATGGCCG CCATCGACGA GGTCTTCCGC GAGGTGGAAG CCGGCGCCGC GCATTTCGGG GTGGTGCCGG TGGAAAACTC CACCGAGGGG ATCGTCAACA GCACGCTGGA TACCTTCATG GACGCCAGCC TGCGAATCTG CGGCGAGGTG GTGCTGCGCA TTCACCACCA TCTGCTGGTT TCCGATACCA CGCGTCGCGA CAAGATCTCG CGGATCTATT CACATCCCCA GTCGCTGGCG CAGTGCCGCA AGTGGTTGGA TGCGCATTAC CCCAATGCCG AGCGAGTGCC GGTGTCCTCC AACGCCGAAG CGGCGCGCCT GATCAAGAGC GAATGGCACA GCGCCGCGAT CGCCGGCGAC ATGGCCGCCA AGCGCTACGC GCTGGACAAG GTCGCCGAGA AGATCGAGGA TCGACCCGAC AACTCGACGC GCTTTCTGAT CATCGGCCAC CAGGACACGC CGATCTCAGG CGACGACAAG ACATCCATCG TCGTCGCCAT GCGCAACCAG CCCGGGGCGC TGCACGATCT GCTCGAGCCG TTCCATCGCC ACAAGATCGA CCTGACCCGC GTCGAGACCC GGCCATCGCG CACGGGGGTC TGGAACTACG TATTCTTCAT CGACTTCAAG GGCCACCGCG ACGACCCGCA GGTGGCGGCG GTGCTCGAGG AGATCACCCT GCGTGCCGCC GAGCTCAAGG TGCTGGGGTC CTATCCGGTG GGTGTGCTGT AA
|
Protein sequence | MADHDMSDNN VPVTSADLPA LRERIDALDS QILELISERA HCAQQVAQVK TDSDPQATFY RPEREAQVLR RIMALNKGPL DDEEMARLFR EIMSACLALE RPVKVAYLGP EGTFTQQAAL KHFGDSAVSL PMAAIDEVFR EVEAGAAHFG VVPVENSTEG IVNSTLDTFM DASLRICGEV VLRIHHHLLV SDTTRRDKIS RIYSHPQSLA QCRKWLDAHY PNAERVPVSS NAEAARLIKS EWHSAAIAGD MAAKRYALDK VAEKIEDRPD NSTRFLIIGH QDTPISGDDK TSIVVAMRNQ PGALHDLLEP FHRHKIDLTR VETRPSRTGV WNYVFFIDFK GHRDDPQVAA VLEEITLRAA ELKVLGSYPV GVL
|
| |