Gene EcDH1_4259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_4259 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4624324 
End bp4625739 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content51% 
IMG OID 
Producttryptophanase 
Protein accessionACX41857 
Protein GI260451435 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAACT TTAAACATCT CCCTGAACCG TTCCGCATTC GTGTTATTGA GCCAGTAAAA 
CGTACCACTC GCGCTTATCG TGAAGAGGCA ATTATTAAAT CCGGTATGAA CCCGTTCCTG
CTGGATAGCG AAGATGTTTT TATCGATTTA CTGACCGACA GCGGCACCGG GGCGGTGACG
CAGAGCATGC AGGCTGCGAT GATGCGCGGC GACGAAGCCT ACAGCGGCAG TCGTAGCTAC
TATGCGTTAG CCGAGTCAGT GAAAAATATC TTTGGTTATC AATACACCAT TCCGACTCAC
CAGGGCCGTG GCGCAGAGCA AATCTATATT CCGGTACTGA TTAAAAAACG CGAGCAGGAA
AAAGGCCTGG ATCGCAGCAA AATGGTGGCG TTCTCTAACT ATTTCTTTGA TACCACGCAG
GGCCATAGCC AGATCAACGG CTGTACCGTG CGTAACGTCT ATATCAAAGA AGCCTTCGAT
ACGGGCGTGC GTTACGACTT TAAAGGCAAC TTTGACCTTG AGGGATTAGA ACGCGGTATT
GAAGAAGTTG GTCCGAATAA CGTGCCGTAT ATCGTTGCAA CCATCACCAG TAACTCTGCA
GGTGGTCAGC CGGTTTCACT GGCAAACTTA AAAGCGATGT ACAGCATCGC GAAGAAATAC
GATATTCCGG TGGTAATGGA CTCCGCGCGC TTTGCTGAAA ACGCCTATTT CATCAAGCAG
CGTGAAGCAG AATACAAAGA CTGGACCATC GAGCAGATCA CCCGCGAAAC CTACAAATAT
GCCGATATGC TGGCGATGTC CGCCAAGAAA GATGCGATGG TGCCGATGGG CGGCCTGCTG
TGCATGAAAG ACGACAGCTT CTTTGATGTG TACACCGAGT GCAGAACCCT TTGCGTGGTG
CAGGAAGGCT TCCCGACATA TGGCGGCCTG GAAGGCGGCG CGATGGAGCG TCTGGCGGTA
GGTCTGTATG ACGGCATGAA TCTCGACTGG CTGGCTTATC GTATCGCGCA GGTACAGTAT
CTGGTCGATG GTCTGGAAGA GATTGGCGTT GTCTGCCAGC AGGCGGGCGG TCACGCGGCA
TTCGTTGATG CCGGTAAACT GTTGCCGCAT ATCCCGGCAG ACCAGTTCCC GGCACAGGCG
CTGGCCTGCG AGCTGTATAA AGTCGCCGGT ATCCGTGCGG TAGAAATTGG CTCTTTCCTG
TTAGGCCGCG ATCCGAAAAC CGGTAAACAA CTGCCATGCC CGGCTGAACT GCTGCGTTTA
ACCATTCCGC GCGCAACATA TACTCAAACA CATATGGACT TCATTATTGA AGCCTTTAAA
CATGTGAAAG AGAACGCGGC GAATATTAAA GGATTAACCT TTACGTACGA ACCGAAAGTA
TTGCGTCACT TCACCGCAAA ACTTAAAGAA GTTTAA
 
Protein sequence
MENFKHLPEP FRIRVIEPVK RTTRAYREEA IIKSGMNPFL LDSEDVFIDL LTDSGTGAVT 
QSMQAAMMRG DEAYSGSRSY YALAESVKNI FGYQYTIPTH QGRGAEQIYI PVLIKKREQE
KGLDRSKMVA FSNYFFDTTQ GHSQINGCTV RNVYIKEAFD TGVRYDFKGN FDLEGLERGI
EEVGPNNVPY IVATITSNSA GGQPVSLANL KAMYSIAKKY DIPVVMDSAR FAENAYFIKQ
REAEYKDWTI EQITRETYKY ADMLAMSAKK DAMVPMGGLL CMKDDSFFDV YTECRTLCVV
QEGFPTYGGL EGGAMERLAV GLYDGMNLDW LAYRIAQVQY LVDGLEEIGV VCQQAGGHAA
FVDAGKLLPH IPADQFPAQA LACELYKVAG IRAVEIGSFL LGRDPKTGKQ LPCPAELLRL
TIPRATYTQT HMDFIIEAFK HVKENAANIK GLTFTYEPKV LRHFTAKLKE V