Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_02453 |
Symbol | tyrA |
ID | 8114415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 2604670 |
End bp | 2605791 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644848653 |
Product | hypothetical protein |
Protein accession | YP_003000226 |
Protein GI | 251785922 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0287] Prephenate dehydrogenase |
TIGRFAM ID | [TIGR01799] chorismate mutase domain of T-protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.58394 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTGCTG AATTGACCGC ATTACGCGAT CAAATTGATG AAGTCGATAA AGCGCTGCTG AATTTATTAG CGAAGCGTCT GGAACTGGTT GCTGAAGTGG GCGAGGTGAA AAGCCGCTTT GGACTGCCTA TTTATGTTCC GGAGCGCGAG GCATCTATGT TGGCCTCGCG GCGCGCAGAG GCGGAAGCTC TGGGTGTACC GCCAGATCTG ATTGAGGATG TTTTGCGTCG GGTGATGCGT GAATCTTACT CCAGTGAAAA CGACAAAGGA TTTAAAACAC TTTGTCCGTC ACTGCGTCCG GTGGTTATCG TCGGCGGTGG CGGTCAGATG GGACGCCTGT TCGAGAAGAT GCTGACACTA TCGGGTTATC AGGTGCGGAT TCTGGAGCAA CATGACTGGG ATCGAGCGGC TGATATTGTT GCCGATGCCG GAATGGTGAT TGTTAGTGTG CCAATCCACG TTACTGAGCA AGTTATCGGC AAATTACCGC CTTTACCGAA AGATTGTATT CTGGTTGATC TGGCATCAGT GAAAAATGGA CCATTACAGG CCATGCTGGC GGCGCACGAT GGCCCGGTAC TGGGGTTACA CCCGATGTTC GGCCCGGACA GCGGTAGCCT GGCAAAGCAA GTTGTGGTCT GGTGTGATGG ACGTAAGCCG GAAGCATACC AATGGTTTCT GGAGCAAATT CAGGTCTGGG GCGCTCGGCT GCATCGTATT AGCGCTGTCG AGCACGATCA GAATATGGCG TTTATTCAGG CTCTGCGCCA CTTTGCTACT TTTGCTTATG GGCTGCATCT GGCAGAAGAA AATGTTCAGC TTGAGCAACT TCTGGCGCTC TCTTCGCCGA TTTACCGCCT TGAGCTGGCG ATGGTCGGGC GACTGTTTGC TCAGGATCCG CAGCTTTATG CCGACATTAT TATGTCGTCA GAGCGTAATC TGGCGTTAAT CAAACGTTAC TATAAGCGTT TCGGCGAGGC GATTGAGTTG CTGGAGCAGG GCGATAAGCA GGCGTTTATT GACAGTTTCC GCAAGGTGGA GCACTGGTTC GGCGATTACG CACAGCGTTT TCAGAGTGAA AGCCGCGTGT TATTGCGTCA GGCGAATGAC AACCGCCAGT AA
|
Protein sequence | MVAELTALRD QIDEVDKALL NLLAKRLELV AEVGEVKSRF GLPIYVPERE ASMLASRRAE AEALGVPPDL IEDVLRRVMR ESYSSENDKG FKTLCPSLRP VVIVGGGGQM GRLFEKMLTL SGYQVRILEQ HDWDRAADIV ADAGMVIVSV PIHVTEQVIG KLPPLPKDCI LVDLASVKNG PLQAMLAAHD GPVLGLHPMF GPDSGSLAKQ VVVWCDGRKP EAYQWFLEQI QVWGARLHRI SAVEHDQNMA FIQALRHFAT FAYGLHLAEE NVQLEQLLAL SSPIYRLELA MVGRLFAQDP QLYADIIMSS ERNLALIKRY YKRFGEAIEL LEQGDKQAFI DSFRKVEHWF GDYAQRFQSE SRVLLRQAND NRQ
|
| |