Gene ECH74115_5556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5556 
SymboltyrB 
ID6968947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5195427 
End bp5196662 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content53% 
IMG OID643389197 
Productaromatic amino acid aminotransferase 
Protein accessionYP_002273594 
Protein GI209395724 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1448] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.187921 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTGTTTTA ACCACCTGCC CGTAAACCTA GAGAACCATC GCGTGTTTCA AAAAGTTGAC 
GCCTACGCTG GCGACCCGAT TCTTACGCTT ATGGAGCGTT TTAAAGAAGA CCCTCGCAGC
GACAAAGTGA ATTTAAGTAT CGGTCTGTAC TACAACGAAG ACGGAATTAT TCCACAATTG
AAAGCCGTGG CGGAGGCGGA AGCGCGCCTG AATGCGGTGC CTCATGGCGC TTCGCTTTAT
TTACCGATGG AAGGGCTTAA CAGCTATCGC CATGCCATTG CGCCGCTGCT GTTTGGTGCC
GACCATCCGG TACTGCAACA ACAGCGCGTA GCAACCATTC AAACCCTTGG CGGCTCAGGG
GCATTGAAAG TGGGCGCGGA TTTCCTGAAA CGCTACTTCC CGGAATCAGG CGTCTGGGTC
AGCGATCCTA CCTGGGAAAA CCACGTAGCA ATATTCGCCG GGGCTGGATT CGAAGTAAGC
ACTTACCCCT GGTATGACGA AGCGACTAAC GGCGTGCGCT TTAATGACCT GTTGGCGATG
CTGAAAACAT TACCTGCCCG CAGTATTGTG TTGCTGCATC CATGTTGCCA CAACCCAACG
GGTGCCGATC TCACTAATGA CCAGTGGGAT GCGGTGATTG AAATTCTCAA AGCCCGCGAG
CTTATCCCAT TCCTTGATAT TGCCTATCAA GGATTTGGTG CCGGTATGGA AGAGGATGCC
TACGCCATTC GCGCCATTGC CAGCGCTGGA TTACCCGCTC TGGTGAGCAA TTCGTTCTCG
AAAATTTTCT CCCTTTACGG CGAGCGCGTC GGCGGACTTT CTGTTCTGTG TGAAGATGCC
GAAGCTGCAG GCCGCGTACT GGGGCAATTG AAAGCAACAG TTCGCCGCAA CTACTCCAGC
CCGCCGAATT TTGGTGCGCA GGTGGTGGCT GCAGTGCTGA ATGACGAGGC ATTGAAAGCC
AGCTGGCTGG CGGAAGTAGA AGAGATGCGT ACTCGCATTC TGGCAATGCG TCAGGAACTG
GTGAAGGTAT TAAGCACAGA GATGCCAGAA CGCAATTTCG ATTATCTGCT TAATCAGCGC
GGCATGTTCA GTTATACCGG TTTAAGTGCC GCTCAGGTTG ACCGACTACG TGAAGAATTT
GGTGTCTATC TCATCGCCAG CGGTCGCATG TGTGTCGCCG GGTTAAATAC GGCAAATGTG
CAACGTGTGG CAAAGGCGTT TGCTGCGGTG ATGTAA
 
Protein sequence
MCFNHLPVNL ENHRVFQKVD AYAGDPILTL MERFKEDPRS DKVNLSIGLY YNEDGIIPQL 
KAVAEAEARL NAVPHGASLY LPMEGLNSYR HAIAPLLFGA DHPVLQQQRV ATIQTLGGSG
ALKVGADFLK RYFPESGVWV SDPTWENHVA IFAGAGFEVS TYPWYDEATN GVRFNDLLAM
LKTLPARSIV LLHPCCHNPT GADLTNDQWD AVIEILKARE LIPFLDIAYQ GFGAGMEEDA
YAIRAIASAG LPALVSNSFS KIFSLYGERV GGLSVLCEDA EAAGRVLGQL KATVRRNYSS
PPNFGAQVVA AVLNDEALKA SWLAEVEEMR TRILAMRQEL VKVLSTEMPE RNFDYLLNQR
GMFSYTGLSA AQVDRLREEF GVYLIASGRM CVAGLNTANV QRVAKAFAAV M