Gene MCA1418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1418 
SymbolpheA 
ID3102688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1504421 
End bp1505509 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content64% 
IMG OID637170593 
Productchorismate mutase/prephenate dehydratase 
Protein accessionYP_113875 
Protein GI53804245 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAACG ATCCTTCTCT CGCGGAACTG CGCAAGCGCA TCGACGAACT CGACGACCGG 
GTGCTGGAAC TGCTCAACCA GCGGGCGAGG TGTGCCCAGC GGGTGGCCGA CATCAAGGTG
GCGGCAGGCG AGACCGACTG CTTCTACCGT CCCGAACGGG AAGCGGAAAT CCTGCAGCGG
TTGACAGCGC ACAATCCCGG CCCGCTCGGC CGAGAGGCCG TGGTCCGCTT TTTCCGCGAA
GTGATGTCGG AATGCCTGGC CCTCGAAAAG CCGCTGAGCG TCGCCTTCCT CGGACCGGAA
GGAACCTTCA CCCAACAGGC GGCCTACAGG CATTTCGGTC ATGCCATCCA GGCCGTCCCG
ATGCCGGCCA TCGACGAAAT CTTCCGGGCT GTGGAGAGCG GTGCCTGTCA TTACGGTGTG
GTGCCGGTCG AGAATTCGAC TGAAGGCGTC ATCACCCACA CCCTGGATAG CTTCGTGCGC
TTCAGCCTGA TCATCGCCGG GGAGGTGCAG CTGCGCATCC ACCACAACCT GCTGTGCAGG
ACACCGACCG CGCTGACCGA GCTGACCGAA GTGTTCTCCC ATCCGCAGTC GCTGGCGCAA
TGCCGGGGCT GGCTGGACCG TTTTCTGCCG GGTGTACGCC GCACCCCCCT CGGCAGCAAC
GCCGAAGCCG CCCGGCGGGC GGCGGAAACC GCCGGTACGG CGGCGATCGC CGGCGAAGTG
GCGGCGGGAC TCTATGGCCT GGAGATCCTG AACCGCAACA TCGAAGACGA ACCCGACAAT
ACCACCCGGT TCCTGGTCAT CGGCGGCCAG CCGGTGGGAC CGACTGGCCA CGACAAAACT
TCGCTGTTAC TGTCCACCCG CAATGACCCG GGTGCGCTTT TCCGCCTCAT CGAGCCATTC
GCGCGCCTGG GGATCAGCAT GACCAAGATC GAATCGCGGC CTTCGCGGCG CGGCATGTGG
GACTACTTTT TTTTCATCGA CGTGGAAGGG CATCAGGCTG ATCCCACCCT GGCGCAGGCC
CTCGCCGAGG TGCGTGAACA CTGCTGCATG ATGCGTATCC TCGGTTCCTA TCCACGCGCA
CTGAGCTGA
 
Protein sequence
MANDPSLAEL RKRIDELDDR VLELLNQRAR CAQRVADIKV AAGETDCFYR PEREAEILQR 
LTAHNPGPLG REAVVRFFRE VMSECLALEK PLSVAFLGPE GTFTQQAAYR HFGHAIQAVP
MPAIDEIFRA VESGACHYGV VPVENSTEGV ITHTLDSFVR FSLIIAGEVQ LRIHHNLLCR
TPTALTELTE VFSHPQSLAQ CRGWLDRFLP GVRRTPLGSN AEAARRAAET AGTAAIAGEV
AAGLYGLEIL NRNIEDEPDN TTRFLVIGGQ PVGPTGHDKT SLLLSTRNDP GALFRLIEPF
ARLGISMTKI ESRPSRRGMW DYFFFIDVEG HQADPTLAQA LAEVREHCCM MRILGSYPRA
LS