Gene Tgr7_1538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTgr7_1538 
Symbol 
ID7316565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. HL-EbGR7 
KingdomBacteria 
Replicon accessionNC_011901 
Strand
Start bp1651118 
End bp1652206 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content66% 
IMG OID643616429 
Productchorismate mutase 
Protein accessionYP_002513609 
Protein GI220934710 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAC AGGAATCACT GGAACAGATC CGGGCGCGCA TCGATGCCCT GGACGATGAA 
CTGCTCAGGC TGATCAGCGA GCGGGCCCGC TGTGCCCAGG CGGTGGCCAA GGTCAAGCGC
GACGCGGATC CCAACGCCGA GTTCTATCGT CCCGAGCGCG AGGCGCAGAT CCTGCGCAAG
ATCCAGCAGC GCAATCCAGG CCCTCTGGAT GCCGAGGAGA TGGCGCGCCT GTTCCGGGAG
ATCATGTCCG CCTGCCTCGC CCTGGAAGAA CCCCTCAACG TCGCCTTCCT GGGTCCCGAG
GGCACTTTCA CCCAGGCGGC GGCCCTCAAG CACTTCGGCC ACTCGGTGCA CACCGTGCCC
CTGGGTGCCA TCGACGAGGT GTTCCGGGAG GTGGAGTCCG GCGCCGCCCA TTACGGCGTG
GTGCCGGTGG AAAATTCCAC CGAGGGCGTG GTCACCCACA CCCTGGACCG CTTCATGCAG
TCGCCGCTCA AGATCTGCGG CGAGGTGGCC CTGCGCATCC ATCATCACCT GATGGCGAAA
CCCGGCCTCG CCCGGGAACA GGTCAAGCGC ATCTATTCCC ACCAGCAGTC TCTCGCCCAG
TGCCGGGAGT GGCTGGACGC CAACCTGCCC CAGGCCGAGC GCATCCCGGT GAGCAGCAAC
GCCGTGGCCG CGCGGCGTGC CGCCGAGGAG GAGGGCGCCG GTGCCATCGC CAGCCAGGCG
GCCGCCGAAC GTTACGTGCT CAATGTCCTC AACGCCAACA TCGAGGATGC CCCGGACAAC
ACCACCCGTT TCCTGGTGAT TGGTCAGCGG GCCAGCGGGC CGTCAGGCCG GGACAAGACT
TCCCTGCTGC TGTCCACCCG AAACCGCCCC GGTTCCCTGT ACCGCCTGCT GGAGCCCTTC
GCCCGCGCGG ACGTGAGCCT GACGCGCATC GAATCCCGTC CCTCCCACTG CGTGAACTGG
GATTACGTGT TCTTCATCGA CGTGGAAGGT CACGAGGAAG ACGACAAGGT GCGCCAGGCC
ATCGCGGCCC TGGAACAGGA AGCGGATCTG GTCAAGGTGC TGGGGTCCTA TCCCAGAGCG
GTGCTTTGA
 
Protein sequence
MSEQESLEQI RARIDALDDE LLRLISERAR CAQAVAKVKR DADPNAEFYR PEREAQILRK 
IQQRNPGPLD AEEMARLFRE IMSACLALEE PLNVAFLGPE GTFTQAAALK HFGHSVHTVP
LGAIDEVFRE VESGAAHYGV VPVENSTEGV VTHTLDRFMQ SPLKICGEVA LRIHHHLMAK
PGLAREQVKR IYSHQQSLAQ CREWLDANLP QAERIPVSSN AVAARRAAEE EGAGAIASQA
AAERYVLNVL NANIEDAPDN TTRFLVIGQR ASGPSGRDKT SLLLSTRNRP GSLYRLLEPF
ARADVSLTRI ESRPSHCVNW DYVFFIDVEG HEEDDKVRQA IAALEQEADL VKVLGSYPRA
VL