Gene ECH74115_5139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5139 
SymboltnaA 
ID6972188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4780534 
End bp4781949 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content51% 
IMG OID643388810 
Producttryptophanase 
Protein accessionYP_002273236 
Protein GI209397514 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3033] Tryptophanase 
TIGRFAM ID[TIGR02617] tryptophanase, leader peptide-associated 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.685998 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.24086 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAACT TTAAACATCT CCCTGAACCG TTCCGCATTC GTGTTATTGA GCCAGTAAAA 
CGTACCACTC GCGCTTATCG TGAAGAGGCA ATTATTAAAT CCGGTATGAA CCCGTTCCTG
CTGGATAGCG AAGATGTGTT TATCGATTTA CTGACCGACA GCGGCACCGG GGCGGTGACG
CAGAGTATGC AGGCCGCGAT GATGCGCGGC GACGAAGCTT ACAGCGGCAG CCGCAGCTAC
TATGCGTTAG CCGAGTCAGT GAAAAATATC TTTGGTTATC AATACACCAT TCCGACTCAC
CAGGGCCGTG GTGCAGAGCA AATCTATATT CCAGTACTGA TTAAAAAGCG CGAGCAGGAA
AAAGGCCTGG ATCGCAGCAA AATGGTGGCG TTCTCTAACT ATTTCTTTGA TACCACGCAG
GGCCATAGCC AGATTAACGG CTGTACCGTG CGTAACGTCT ATATCAAAGA AGCCTTCGAT
ACGGGCGTGC GTTACGACTT TAAAGGCAAC TTTGACCTTG AGGGATTAGA ACGCGGTATT
GAAGAAGTTG GTCCGAATAA CGTGCCGTAT ATCGTTGCAA CCATCACCAG TAACTCCGCA
GGCGGTCAGC CGGTTTCACT GGCAAACTTA AAAGCGATGT ACAGCATCGC GAAGAAATAT
GATATTCCGG TGGTAATGGA CTCCGCGCGC TTTGCTGAAA ACGCCTATTT CATCAAGCAG
CGTGAAGCAG AATACAAAGA CTGGACCATC GAGCAGATCA CCCGCGAAAC CTACAAATAT
GCCGATATGC TGGCGATGTC CGCCAAGAAA GATGCGATGG TGCCGATGGG CGGCTTGCTG
TGCATGAAAG ACGACAGTTT CTTTGATGTG TACACCGAGT GCAGAACCCT TTGCGTAGTT
CAGGAAGGCT TCCCGACGTA TGGCGGCCTG GAAGGCGGCG CGATGGAGCG TCTGGCCGTA
GGTCTGTATG ACGGCATGAA TCTGGACTGG CTGGCTTATC GTATTGCGCA GGTGCAGTAT
CTGGTCGATG GTCTGGAAGA GATTGGCGTT GTCTGCCAGC AGGCGGGCGG TCACGCGGCA
TTCGTTGATG CCGGTAAACT GCTTCCGCAT ATCCCGGCAG ATCAGTTCCC GGCACAGGCG
CTGGCCTGCG AGCTGTATAA AGTCGCCGGT ATCCGCGCGG TAGAAATTGG CTCTTTCCTG
TTAGGCCGCG ATCCGAAAAC CGGTAAACAA CTGCCATGCC CGGCTGAACT GCTGCGTTTA
ACCATTCCGC GCGCAACATA TACTCAAACA CATATGGACT TCATTATTGA AGCCTTCAAA
CATGTGAAAG AGAACGCGTC GAATATTAAA GGGTTAACCT TTACCTACGA ACCAAAAGTA
TTGCGTCACT TCACCGCAAA ACTGAAAGAA GTTTAA
 
Protein sequence
MENFKHLPEP FRIRVIEPVK RTTRAYREEA IIKSGMNPFL LDSEDVFIDL LTDSGTGAVT 
QSMQAAMMRG DEAYSGSRSY YALAESVKNI FGYQYTIPTH QGRGAEQIYI PVLIKKREQE
KGLDRSKMVA FSNYFFDTTQ GHSQINGCTV RNVYIKEAFD TGVRYDFKGN FDLEGLERGI
EEVGPNNVPY IVATITSNSA GGQPVSLANL KAMYSIAKKY DIPVVMDSAR FAENAYFIKQ
REAEYKDWTI EQITRETYKY ADMLAMSAKK DAMVPMGGLL CMKDDSFFDV YTECRTLCVV
QEGFPTYGGL EGGAMERLAV GLYDGMNLDW LAYRIAQVQY LVDGLEEIGV VCQQAGGHAA
FVDAGKLLPH IPADQFPAQA LACELYKVAG IRAVEIGSFL LGRDPKTGKQ LPCPAELLRL
TIPRATYTQT HMDFIIEAFK HVKENASNIK GLTFTYEPKV LRHFTAKLKE V