Gene SeD_A2996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2996 
SymboltyrA 
ID6871990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2889292 
End bp2890413 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content55% 
IMG OID642786032 
Productbifunctional chorismate mutase/prephenate dehydrogenase 
Protein accessionYP_002216678 
Protein GI198246229 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0287] Prephenate dehydrogenase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01799] chorismate mutase domain of T-protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGCTG AATTGACCGC GTTACGCGAT CAAATAGATG ATGTCGATAA AGCGTTGTTG 
AATTTACTGG CTAAGCGCCT GGAACTGGTT GCCAAAGTCG GCGAGGTGAA AAGCCGTTTT
GGCCTGCCTA TTTACGTGCC GGAGCGTGAG GCCTCTATGC TGGCTTCACG ACGGGCGGAA
GCAGAAGCGA TCGGTGTCCC GCCCGATCTC ATTGAAGATG TCCTGCGCCG GGTAATGCGT
GAATCTTACT CCAGCGAAAA TGATAAGGGG TTCAAAACGC TTTGTCCTTC TCTGCGTCCG
GTCGTCATTG TGGGCGGCGG CGGACAGATG GGGCGTCTGT TTGAAAAAAT GCTCACGCTG
TCGGGCTATC AGGTCCGTAT TCTGGAACAG CAGGACTGGC CGCGCGCCAG GGACATTGTC
GCCGATGCCG GAATGGTGAT CGTCAGCGTG CCGATTCATG TTACTGAACA GGTCATAGCG
CAACTGCCGC CCCTGCCGTC CGACTGTATT CTGGTCGATC TGGCATCGGT GAAAAGCGAT
CCGTTACAGG CAATGTTGGC GGCCCATGAT GGCCCCGTGT TGGGCTTGCA TCCGATGTTT
GGCCCGGACA GCGGGAGCCT GGCGAAGCAG GTGGTGGTCT GGTGTGACGG GCGTCAACCG
GAAGCGTATC AGTGGTTCCT TGAGCAAATC CAGGTGTGGG GCGCTCGGTT GCACCGAATT
AGCGCTGTCG AGCACGATCA GAACATGGCT TTTATCCAGG CGTTGCGCCA TTTTGCTACC
TTCGCTTATG GGCTGCATCT GGCGGAAGAG AACGTCCAGC TTGAGCAGCT TCTGGCGCTA
TCATCGCCGA TTTATCGACT GGAGCTGGCG ATGGTCGGGC GTCTGTTCGC CCAGGACCCG
CAGCTGTATG CGGACATTAT TATGTCGTCG GAGCGCAATC TGGCGCTTAT CAAGCGTTAC
TATAAACGTT TTGGCGATGC GATCGGGTTA CTGGAACAAG GTGATAAGCA GGCTTTTATC
GACAGTTTTC GCAAAGTTGA ACACTGGTTT GGCGATTATG CCAGACGCTT CCAGAATGAA
AGCCGTGTGT TATTGCGTCA GGCGAATGAC AGCCGACCAT AA
 
Protein sequence
MVAELTALRD QIDDVDKALL NLLAKRLELV AKVGEVKSRF GLPIYVPERE ASMLASRRAE 
AEAIGVPPDL IEDVLRRVMR ESYSSENDKG FKTLCPSLRP VVIVGGGGQM GRLFEKMLTL
SGYQVRILEQ QDWPRARDIV ADAGMVIVSV PIHVTEQVIA QLPPLPSDCI LVDLASVKSD
PLQAMLAAHD GPVLGLHPMF GPDSGSLAKQ VVVWCDGRQP EAYQWFLEQI QVWGARLHRI
SAVEHDQNMA FIQALRHFAT FAYGLHLAEE NVQLEQLLAL SSPIYRLELA MVGRLFAQDP
QLYADIIMSS ERNLALIKRY YKRFGDAIGL LEQGDKQAFI DSFRKVEHWF GDYARRFQNE
SRVLLRQAND SRP