Gene DET0461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDET0461 
SymboltyrA 
ID3230182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDehalococcoides ethenogenes 195 
KingdomBacteria 
Replicon accessionNC_002936 
Strand
Start bp434281 
End bp435357 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content50% 
IMG OID637120027 
Productchorismate mutase/prephenate dehydratase 
Protein accessionYP_181205 
Protein GI57234713 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01808] monofunctional chorismate mutase, high GC gram positive type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.123975 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCTTT CAGACCTGCG CAAACAGATA GATGAACTGG ATGCCGAGCT GGTCAAGCTG 
ATGGCAAAGA GGCTTGAGGT ATCCGACCAG ATAGGCAAAG TCAAAGAAGA AACTAACTCC
CCTGTTCAGG ACCTCTCACG TGAATCAGAG GTTTTAAACA GGGTTCAGTC ACTGGCCCGG
TCTCTGGGTC TTGACCCGCA GGATATTGAA TCCCTGTACC AGGAAATACT GTTTATATCC
AAGAAACAGC AGCGGTTCAC CGTAGCCTTT CAGGGAGCGG CCGGTGCTTA CAGTGAGGAA
ACTGCCCTGA AAATATTCGG CCCCAACACC CTCGCCCTGC CTTACGAACA GCTGGACGGG
GCTTTTGAGG CAGTGGAAAA AGGAATGGCC CGCTTTGCGG TAGTGCCGGT GGAAAACTCA
CTTGAGGGCT CTATTTCCCG CACCTATGAC CTGCTGTTTG ATTCTAACCT TATGGTTGCC
GCCGAACATG AGCTAAGGGT TTCCCACTGT CTGATAGCCA ACCCCGAAAC CACTCTGGAA
GGGGTAAAAA CCATTTATTC CCACCCCCAG GCACTGGGGC AATGCCAGTC ATTTTTAAAA
CACCTGCGGG CAGAGCTGAT ACCGGCCTAC GATACCGCCG GCAGTGTCAA AATGATTAAA
GAAAAACACC TTTTAGACGG GGCGGCCATC GCCTCTGAAA GAGCGGCCGT AATTTATAAT
ATGAAAGTGC TGGAACGGGA AATAGAGGAT AATATAAACA ACTACACCCG TTTCTTCGTC
CTTGCCAAGC AGGATTCCGC ACCCAGCGGC AATGATAAAA CTTCGGTGGT CTTCGCCGTC
AAACATGAGG CCGGGGCGCT GTATGACTTC ATAAAGGAAC TGGCCTCCAG AAAAATAAAT
ATGACCAAGC TGGAATCCCG TCCCACCCGC CTTAAACCCT GGGAGTACAA CTTTTACCTG
GATATAGAAG GCCACCGCCA GGACGAAAAC ATTAAACAGG CGCTGGCAAA AGCCGAAGAC
CATGTTATAT TTATGAAAGT GCTTGGGTCT TACCCCAAAA TGAAAAAACG AATATGA
 
Protein sequence
MNLSDLRKQI DELDAELVKL MAKRLEVSDQ IGKVKEETNS PVQDLSRESE VLNRVQSLAR 
SLGLDPQDIE SLYQEILFIS KKQQRFTVAF QGAAGAYSEE TALKIFGPNT LALPYEQLDG
AFEAVEKGMA RFAVVPVENS LEGSISRTYD LLFDSNLMVA AEHELRVSHC LIANPETTLE
GVKTIYSHPQ ALGQCQSFLK HLRAELIPAY DTAGSVKMIK EKHLLDGAAI ASERAAVIYN
MKVLEREIED NINNYTRFFV LAKQDSAPSG NDKTSVVFAV KHEAGALYDF IKELASRKIN
MTKLESRPTR LKPWEYNFYL DIEGHRQDEN IKQALAKAED HVIFMKVLGS YPKMKKRI