Gene ECD_01943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01943 
Symbolwzx 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2007194 
End bp2008621 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content30% 
IMG OID 
ProductO antigen translocase (Wzx) 
Protein accessionACT43794 
Protein GI253978124 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATGAATA ACAAACTAGC CAGGCAAGGA TTATTATGGA GCACAATTGA AAGGCTAGGA 
ACGCAAGCCA TACAATTATT ATTAATGTTG TATTTGGGTA GGGTTCTTGG ACCAAGCTCT
TTTGGTTATG TTGGTATGCT AACTTTATTT CTATCATTAG CTCAGGTTCT AATTGATAGT
GGATTTAGTG CAGCATTAAT TAGAAAATCG GAGCGTACTG AAAAAGATTA TTCGACAGTA
TTTATATTTA ATATTTGTCT ATCAATACTA ATATATATTG TACTTTATTG TTTATCTTCT
TATATTGCTT TATTCTATCA AATACCTCTG CTGGAATCTT TACTTGATAT TTTAGCATTA
ACAGTTATAG CAAATGGTCT AACTTTAATC CCCAAAATTC AACTCACTGT AGACGTGAAT
TTTAAGATCC AAGCTAAAAG TTCACTTATC GCTATTACCA GTAGTAGTGT AATAGCGATA
ACGTTAGCTG CTTTGGGTTA TGGAGTATGG ACATTGGTTT TCCAATCGTT ATCCTATAGT
GTCATAAATT GTATTGTACT GAATATTTAT AATCCATGGT ATCCAAAAGA GAAATTTTGC
TATGAAACAT TCAAAAAGCT ATTTTCATTT AGCAGTAACT TACTAATTGC TGGAGTACTT
GAAGCTTTTT ATTCAAATAT ATATCAATTA ATTATAGGGA AATTTTTTAC ACCACAGCTT
GTAGGGCAAT TTACACAGGC TATTCAGATC TCTAGCGTAC CTGCAATGAC ACTTACTAAT
ATTATTCAGA GAGTAACTTA TCCACTTTTT TGTAATATTT ATAATGGCAA AGGGAAAATA
GATGATGCTT ATTTAAACAC ACTTAAAATT GCAGGCGTTG TAGTTTTTCC AATTCTATTA
GGAATAGGTT TAATCTCGAA GCCATTAGTC TCTGTATTAT TAGGAAGTGA GTGGAAATTT
ACATCTGATA TTCTGTTTCT ACTTTGTATT GCATTTATGA TTTATCCTAT ACATGCAATA
AATATTAATA TCCTACAGGT TCATGGGCGA AGTGATCTTT TTTTGCGTTT GGAAATACTA
AAAAAGACAT TGATGACAAT TATTCTTGTC ATTACGATGC AAATAAATAT AAATGCAATG
GTGATTGGTT TAGTTTTACA CTCATATCTT TCGTGGTTTT TAAATGGATT ATTTAGCCAG
AAAGTATCAA CGATCACAAT AAGGAAGCAG TTAAAAGAGT TACTACCTAT ATGGTGTCTA
TGTTTAATTA GTGGGTTAGG TTCTAACTAT ATTATAAATG AAATAAGTAT GGAACCTTTA
TTAAATATAA TATTTACTAT TTTATTAAAT GCCATTATAT ACATCACACT AATTCGAATT
ATGTACAAAG AAATATTTAT GACTATTATT TCTCTTATAA AAAAATAA
 
Protein sequence
MMNNKLARQG LLWSTIERLG TQAIQLLLML YLGRVLGPSS FGYVGMLTLF LSLAQVLIDS 
GFSAALIRKS ERTEKDYSTV FIFNICLSIL IYIVLYCLSS YIALFYQIPL LESLLDILAL
TVIANGLTLI PKIQLTVDVN FKIQAKSSLI AITSSSVIAI TLAALGYGVW TLVFQSLSYS
VINCIVLNIY NPWYPKEKFC YETFKKLFSF SSNLLIAGVL EAFYSNIYQL IIGKFFTPQL
VGQFTQAIQI SSVPAMTLTN IIQRVTYPLF CNIYNGKGKI DDAYLNTLKI AGVVVFPILL
GIGLISKPLV SVLLGSEWKF TSDILFLLCI AFMIYPIHAI NINILQVHGR SDLFLRLEIL
KKTLMTIILV ITMQININAM VIGLVLHSYL SWFLNGLFSQ KVSTITIRKQ LKELLPIWCL
CLISGLGSNY IINEISMEPL LNIIFTILLN AIIYITLIRI MYKEIFMTII SLIKK