Gene ECD_02489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02489 
SymboltyrA 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2605462 
End bp2606583 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content52% 
IMG OID 
Productfused chorismate mutase T/prephenate dehydrogenase 
Protein accessionACT44308 
Protein GI253978638 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.555514 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGCTG AATTGACCGC ATTACGCGAT CAAATTGATG AAGTCGATAA AGCGCTGCTG 
AATTTATTAG CGAAGCGTCT GGAACTGGTT GCTGAAGTGG GCGAGGTGAA AAGCCGCTTT
GGACTGCCTA TTTATGTTCC GGAGCGCGAG GCATCTATGT TGGCCTCGCG GCGCGCAGAG
GCGGAAGCTC TGGGTGTACC GCCAGATCTG ATTGAGGATG TTTTGCGTCG GGTGATGCGT
GAATCTTACT CCAGTGAAAA CGACAAAGGA TTTAAAACAC TTTGTCCGTC ACTGCGTCCG
GTGGTTATCG TCGGCGGTGG CGGTCAGATG GGACGCCTGT TCGAGAAGAT GCTGACACTA
TCGGGTTATC AGGTGCGGAT TCTGGAGCAA CATGACTGGG ATCGAGCGGC TGATATTGTT
GCCGATGCCG GAATGGTGAT TGTTAGTGTG CCAATCCACG TTACTGAGCA AGTTATCGGC
AAATTACCGC CTTTACCGAA AGATTGTATT CTGGTTGATC TGGCATCAGT GAAAAATGGA
CCATTACAGG CCATGCTGGC GGCGCACGAT GGCCCGGTAC TGGGGTTACA CCCGATGTTC
GGCCCGGACA GCGGTAGCCT GGCAAAGCAA GTTGTGGTCT GGTGTGATGG ACGTAAGCCG
GAAGCATACC AATGGTTTCT GGAGCAAATT CAGGTCTGGG GCGCTCGGCT GCATCGTATT
AGCGCTGTCG AGCACGATCA GAATATGGCG TTTATTCAGG CTCTGCGCCA CTTTGCTACT
TTTGCTTATG GGCTGCATCT GGCAGAAGAA AATGTTCAGC TTGAGCAACT TCTGGCGCTC
TCTTCGCCGA TTTACCGCCT TGAGCTGGCG ATGGTCGGGC GACTGTTTGC TCAGGATCCG
CAGCTTTATG CCGACATTAT TATGTCGTCA GAGCGTAATC TGGCGTTAAT CAAACGTTAC
TATAAGCGTT TCGGCGAGGC GATTGAGTTG CTGGAGCAGG GCGATAAGCA GGCGTTTATT
GACAGTTTCC GCAAGGTGGA GCACTGGTTC GGCGATTACG CACAGCGTTT TCAGAGTGAA
AGCCGCGTGT TATTGCGTCA GGCGAATGAC AACCGCCAGT AA
 
Protein sequence
MVAELTALRD QIDEVDKALL NLLAKRLELV AEVGEVKSRF GLPIYVPERE ASMLASRRAE 
AEALGVPPDL IEDVLRRVMR ESYSSENDKG FKTLCPSLRP VVIVGGGGQM GRLFEKMLTL
SGYQVRILEQ HDWDRAADIV ADAGMVIVSV PIHVTEQVIG KLPPLPKDCI LVDLASVKNG
PLQAMLAAHD GPVLGLHPMF GPDSGSLAKQ VVVWCDGRKP EAYQWFLEQI QVWGARLHRI
SAVEHDQNMA FIQALRHFAT FAYGLHLAEE NVQLEQLLAL SSPIYRLELA MVGRLFAQDP
QLYADIIMSS ERNLALIKRY YKRFGEAIEL LEQGDKQAFI DSFRKVEHWF GDYAQRFQSE
SRVLLRQAND NRQ