Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1538 |
Symbol | |
ID | 7316565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 1651118 |
End bp | 1652206 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643616429 |
Product | chorismate mutase |
Protein accession | YP_002513609 |
Protein GI | 220934710 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0077] Prephenate dehydratase [COG1605] Chorismate mutase |
TIGRFAM ID | [TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAC AGGAATCACT GGAACAGATC CGGGCGCGCA TCGATGCCCT GGACGATGAA CTGCTCAGGC TGATCAGCGA GCGGGCCCGC TGTGCCCAGG CGGTGGCCAA GGTCAAGCGC GACGCGGATC CCAACGCCGA GTTCTATCGT CCCGAGCGCG AGGCGCAGAT CCTGCGCAAG ATCCAGCAGC GCAATCCAGG CCCTCTGGAT GCCGAGGAGA TGGCGCGCCT GTTCCGGGAG ATCATGTCCG CCTGCCTCGC CCTGGAAGAA CCCCTCAACG TCGCCTTCCT GGGTCCCGAG GGCACTTTCA CCCAGGCGGC GGCCCTCAAG CACTTCGGCC ACTCGGTGCA CACCGTGCCC CTGGGTGCCA TCGACGAGGT GTTCCGGGAG GTGGAGTCCG GCGCCGCCCA TTACGGCGTG GTGCCGGTGG AAAATTCCAC CGAGGGCGTG GTCACCCACA CCCTGGACCG CTTCATGCAG TCGCCGCTCA AGATCTGCGG CGAGGTGGCC CTGCGCATCC ATCATCACCT GATGGCGAAA CCCGGCCTCG CCCGGGAACA GGTCAAGCGC ATCTATTCCC ACCAGCAGTC TCTCGCCCAG TGCCGGGAGT GGCTGGACGC CAACCTGCCC CAGGCCGAGC GCATCCCGGT GAGCAGCAAC GCCGTGGCCG CGCGGCGTGC CGCCGAGGAG GAGGGCGCCG GTGCCATCGC CAGCCAGGCG GCCGCCGAAC GTTACGTGCT CAATGTCCTC AACGCCAACA TCGAGGATGC CCCGGACAAC ACCACCCGTT TCCTGGTGAT TGGTCAGCGG GCCAGCGGGC CGTCAGGCCG GGACAAGACT TCCCTGCTGC TGTCCACCCG AAACCGCCCC GGTTCCCTGT ACCGCCTGCT GGAGCCCTTC GCCCGCGCGG ACGTGAGCCT GACGCGCATC GAATCCCGTC CCTCCCACTG CGTGAACTGG GATTACGTGT TCTTCATCGA CGTGGAAGGT CACGAGGAAG ACGACAAGGT GCGCCAGGCC ATCGCGGCCC TGGAACAGGA AGCGGATCTG GTCAAGGTGC TGGGGTCCTA TCCCAGAGCG GTGCTTTGA
|
Protein sequence | MSEQESLEQI RARIDALDDE LLRLISERAR CAQAVAKVKR DADPNAEFYR PEREAQILRK IQQRNPGPLD AEEMARLFRE IMSACLALEE PLNVAFLGPE GTFTQAAALK HFGHSVHTVP LGAIDEVFRE VESGAAHYGV VPVENSTEGV VTHTLDRFMQ SPLKICGEVA LRIHHHLMAK PGLAREQVKR IYSHQQSLAQ CREWLDANLP QAERIPVSSN AVAARRAAEE EGAGAIASQA AAERYVLNVL NANIEDAPDN TTRFLVIGQR ASGPSGRDKT SLLLSTRNRP GSLYRLLEPF ARADVSLTRI ESRPSHCVNW DYVFFIDVEG HEEDDKVRQA IAALEQEADL VKVLGSYPRA VL
|
| |