Gene EcSMS35_4075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4075 
SymboltnaA 
ID6145087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4165397 
End bp4166812 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content51% 
IMG OID641618900 
Producttryptophanase 
Protein accessionYP_001746038 
Protein GI170681687 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3033] Tryptophanase 
TIGRFAM ID[TIGR02617] tryptophanase, leader peptide-associated 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.420944 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAACT TTAAACATCT CCCTGAACCA TTCCGCATTC GTGTTATTGA GCCAGTAAAA 
CGTACGACTC GCGCTTACCG TGAAGAGGCA ATTATTAAAT CCGGTATGAA CCCGTTCCTG
CTGGATAGCG AAGATGTGTT TATCGATTTA CTGACCGACA GCGGCACCGG GGCGGTGACG
CAGAGCATGC AGGCCGCGAT GATGCGCGGC GACGAAGCCT ACAGCGGCAG CCGCAGCTAC
TATGCGTTAG CCGAGTCAGT GAAAAATATC TTTGGTTATC AATACACCAT TCCGACTCAC
CAGGGCCGTG GCGCAGAACA AATCTATATT CCGGTACTGA TTAAAAAACG CGAGCAGGAA
AAAGGCCTGG ATCGCAGCAA AATGGTGGCG TTCTCTAACT ATTTCTTTGA TACCACGCAG
GGCCATAGCC AGATTAACGG CTGTACCGTG CGTAACGTCT ATATCAAAGA AGCCTTCGAT
ACTGGCGTGC GTTACGACTT TAAAGGCAAC TTTGACCTTG AGGGATTAGA ACGCGGTATT
GAAGAAGTTG GCCCGAATAA CGTGCCGTAT ATCGTTGCAA CCATCACCAG TAACTCCGCA
GGTGGTCAGC CGGTTTCACT GGCAAACTTA AAAGCGATGT ACAGCATCGC GAAGAAATAC
GATATTCCGG TGGTCATGGA CTCCGCACGC TTTGCCGAAA ACGCCTATTT CATCAAGCAG
CGTGAAGCAG AATACAAAGA CTGGACCATC GAGCAGATCA CCCGCGAAAC CTACAAATAT
GCCGATATGC TGGCGATGTC CGCCAAGAAA GATGCAATGG TGCCGATGGG CGGCCTGCTG
TGCGTGAAAG ACGACAGCTT CTTTGATGTG TACACCGAGT GCAGAACCCT TTGCGTGGTA
CAGGAAGGCT TCCCGACATA TGGCGGCCTG GAAGGCGGCG CGATGGAGCG TCTGGCGGTA
GGTCTGTATG ACGGCATGAA TCTGGACTGG CTGGCTTATC GTATCGCGCA GGTGCAGTAT
CTGGTCGATG GTCTGGAAGA GATTGGCGTT GTCTGCCAGC AGGCGGGCGG TCACGCTGCA
TTCGTTGATG CCGGTAAACT GCTGCCGCAT ATCCCGGCAG ATCAGTTCCC GGCACAGGCG
CTGGCCTGCG AGCTGTATAA AGTCGCCGGT ATCCGTGCGG TAGAAATTGG CTCTTTCCTG
TTAGGCCGCG ATCCGAAAAC CGGTAAACAA CTGCCATGCC CGGCTGAACT GCTGCGTTTA
ACCATTCCGC GCGCAACATA TACTCAAACA CATATGGACT TCATTATTGA AGCCTTCAAA
CATGTGAAAG AGAACGCGGC GAATATTAAA GGGTTAACCT TTACCTACGA ACCGAAAGTA
TTGCGTCACT TCACCGCAAA ACTGAAAGAA GTTTAA
 
Protein sequence
MENFKHLPEP FRIRVIEPVK RTTRAYREEA IIKSGMNPFL LDSEDVFIDL LTDSGTGAVT 
QSMQAAMMRG DEAYSGSRSY YALAESVKNI FGYQYTIPTH QGRGAEQIYI PVLIKKREQE
KGLDRSKMVA FSNYFFDTTQ GHSQINGCTV RNVYIKEAFD TGVRYDFKGN FDLEGLERGI
EEVGPNNVPY IVATITSNSA GGQPVSLANL KAMYSIAKKY DIPVVMDSAR FAENAYFIKQ
REAEYKDWTI EQITRETYKY ADMLAMSAKK DAMVPMGGLL CVKDDSFFDV YTECRTLCVV
QEGFPTYGGL EGGAMERLAV GLYDGMNLDW LAYRIAQVQY LVDGLEEIGV VCQQAGGHAA
FVDAGKLLPH IPADQFPAQA LACELYKVAG IRAVEIGSFL LGRDPKTGKQ LPCPAELLRL
TIPRATYTQT HMDFIIEAFK HVKENAANIK GLTFTYEPKV LRHFTAKLKE V