Gene ECH74115_A0017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_A0017 
Symbol 
ID6966518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011351 
Strand
Start bp11883 
End bp13850 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content39% 
IMG OID643384048 
Productconjugal transfer protein TraK 
Protein accessionYP_002268527 
Protein GI209395641 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA AAATCATATT AGTGGTTTTT TTATTGCTTA TTCTCGTTGC TTGTCTTATT 
GGCGGTAATT ATCTTGGGGG ATATGCAGCA TTAAAATATT CAGGATTGAG TATTGACATG
TTGCAATGGA ATACATTTAA TAATGTTATT ACACAATTTA GCGGAAAACC TGAATATAAA
AAATTGGTCG CGATTACATG GGCTGGATTT GCTGCACCAT GTGCAATATT TATTGGTTTT
GTTGTTATTG TCATAGCCGG ACTGATGCCA AAAAAAGTTA TTTATGGCAA TGCTCGTCTT
GCTACTGATA TGGACCTGGC TAAATCAACC TTTTTCCCAA CAGATAAAGA GTTGAAAGAA
GCAAGATTAA AGAACAGTAA ACCATACTGC TATCCACCTA TTCTTATTGG TAAACAGTTT
AAAGGAAGGT TTAAAAACAA ATATATATAT TTCTTCGGGC AACAGTTTTT AATCCTTTAT
GCTCCTACGC GTTCTGGTAA AGGTGTTGGT ATTGTTATAC CTAACTGTGT GAATTACCCG
GATTCGATGG TTGTTCTTGA TATAAAACTT GAAAACTGGT TTTTGTCTTC CGGATACAGA
CAGAATGTGT TAGGGCAGGA ATGTTTTCTG TTTGCTCCTG CGGGATATGC AGAAAACCAG
CAGGAGGCGA AAAAAGGTAA TATCAGATCA CACCGCTGGA ATCCTCTGGA CTGCGTTAAT
CGTTCTGATA TTCAGCGTTC TGGAGATTTG GAAAAAATTG CCGCTATGTT GATTCCGGCA
AGTGATGATC CTATCTGGTC TGATTCTGCA CGGAAGCTCT TTTGTGGGCT TGCTTTATAT
CTTCTGGATA AGGAACGATT CCATCTTCAG CAAAAGAAAA AGGGTGTTGC TGATGTACCT
GATGTTCTTG TGTCCATGTC GGCTATTATG AAACTCTCTG TCCCTGAAAG TGGTCAGAAA
CTATCAGCCT GGATGGGTAC GGAAATAGAT CAGAAAGAAT ACCTGAGTGA TGAAACAAAG
AGTCTGTTTC GTGAATTTAT GGCAGCGCCG GAGAAGACAC AAGGAAGTAT TATTACCAAC
TTCTCGTCTC CATTGAGTAT TTTTAAGAAC CCAATCACAG CAGCAGCTAC TGATGCAAGT
GATTTTGATG TGCGAATGGT CAGGAAAAAG CCTATGTCGA TATATCTTGG GCTGACTCCT
GATGCATTAA TTACTCACTC ACGGCTTGTG AATCTATTTT TCTCTATGCT AGTGAATGAG
AATACAAAGG AGCTTCCAGA ACAGAATCCT GAGTTGAAAT ATCAGTGTCT TGTCCTTCTT
GATGAATTTA CATCAATGGG TAAGTCCGAA ATTATTGAAA AAGGTGTTGG CTTTACCGCT
GGTTTTAACC TGCGATTCGT TATTATTTTG CAGAACGAAG GGCAGGGTAA AAAAGATGAT
ATGTATGGCA CTCACGGTTG GGAAACTTTC GTTGAAAACT CGGCTGTAGT TCTTTATTAC
CCGCCAAAGG CTAAAAATGA ACTGGCGAAA AAGATCTCTG AGGAAATTGG AGTCAGGGAT
ATGAAAATTA TCAAGAAATC TGACTCACGG AGTGGTGGCA AAGGTGGTAG TTCTCGTTCC
CGAAATCATG AAATAGTCAG CCGTGCTGTA TTACTGCCAG AAGAGATTGT CGAGCTTCGG
GATTATAAAA ATAAAATGGG TAATATGGCT GTTCGTGAAA TCGTAATGAG TGAATACACT
CGCCCATTTG TAGCTAACAA AATAATCTGG TTTGAAGAGG AGGAGTTTAA AAAACGAGTA
GATATAGCTA AAGAAAATCC GGTAGAAATG CCTGTATTGT TCGATGAAGA AATGAGAAAT
CAGATTCTGG AACAGGCAAA AATATATTCA GATGATATTA GTATGTTTTC TGATGTTATG
ACTAAACCAG ATCTTGAGAC ACACGATCTG GAGGATGTTT CTGAATAA
 
Protein sequence
MKKKIILVVF LLLILVACLI GGNYLGGYAA LKYSGLSIDM LQWNTFNNVI TQFSGKPEYK 
KLVAITWAGF AAPCAIFIGF VVIVIAGLMP KKVIYGNARL ATDMDLAKST FFPTDKELKE
ARLKNSKPYC YPPILIGKQF KGRFKNKYIY FFGQQFLILY APTRSGKGVG IVIPNCVNYP
DSMVVLDIKL ENWFLSSGYR QNVLGQECFL FAPAGYAENQ QEAKKGNIRS HRWNPLDCVN
RSDIQRSGDL EKIAAMLIPA SDDPIWSDSA RKLFCGLALY LLDKERFHLQ QKKKGVADVP
DVLVSMSAIM KLSVPESGQK LSAWMGTEID QKEYLSDETK SLFREFMAAP EKTQGSIITN
FSSPLSIFKN PITAAATDAS DFDVRMVRKK PMSIYLGLTP DALITHSRLV NLFFSMLVNE
NTKELPEQNP ELKYQCLVLL DEFTSMGKSE IIEKGVGFTA GFNLRFVIIL QNEGQGKKDD
MYGTHGWETF VENSAVVLYY PPKAKNELAK KISEEIGVRD MKIIKKSDSR SGGKGGSSRS
RNHEIVSRAV LLPEEIVELR DYKNKMGNMA VREIVMSEYT RPFVANKIIW FEEEEFKKRV
DIAKENPVEM PVLFDEEMRN QILEQAKIYS DDISMFSDVM TKPDLETHDL EDVSE