Gene ECH74115_A0006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_A0006 
Symbol 
ID6966540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011351 
Strand
Start bp2450 
End bp3451 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content37% 
IMG OID643384037 
Productconjugal transfer protein 
Protein accessionYP_002268516 
Protein GI209395686 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3704] Type IV secretory pathway, VirB6 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.623118 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCATTTTT ATGTGGAAAT AAAAATGAAT ATAATATCTA CGTTTTTAAG TACTGTTACA 
ACACTAGTTG AATCAGGGGC AGCTAGTAAC GCAGCAAAAA TTGCAAATGC AATATCACCT
GTATTTTTTG CAGCCATTGG TGTTTATATT ATTTTTGTTG CTTATGAAAT AATATATTCA
CAACGTGATG TGGTTATGTC GGAAGTTACT AAAAATATAA TGAAATTGGC TCTGGTTGGT
GTTTTCACAT ATAGCTCAAC ATATTATTCA CAATATGTAA TTCCTTTCGT AATGCATTCT
GGAGATGAAT TGTCTTCTGC TTTGACTGGA CAGTCTGATA TAGCCAATTC AATAGATAAT
CTTTGGCAAG CATTGTCAGA TACAATGGAG CAATTTTGGA GTGATGCTAC GGGACAGCTT
GGTATGACTG ATTTTGGTCT ATGGATTAAA GCTGGTCTTA TATGGATCAC AGGGTATGCA
GGAGGATTTT TGCTTGTTTT TTATACTACC GTTTTCCTTT GCGTATCAAA GTTTATGGTC
GGAATGGTGT TATCGGTAGG TATTTTATTT ATATGTTTTT CTGCATTCTC CCCAACGCGT
GGTATGTTTA CTGCATGGTG TGGTAGTTGT TTAAACTACA TTTTGTTAAA TGTATTTTAT
ACCATATCTT TTGGCTTCGT ATTGTCACTA ATACAACAAA CGGCTAACTT GGATGCAAAG
ACCATTACAT TCATGTCTGT GGCTACTTTA TTAGCGGTTG TGTTAATATC TGTTTACCTG
ATAGAGCAAA TTGGGACGTT GTGCTCATCT CTAACAGGTG GAGTAGGTAT TAATGGATTG
ACTGCCTCAG CTAACGGCGC AGCAAATAAA TTAGCTGCTG TTTCAGGTGT AAGGGCTATG
AGCAATGCAG GTAAAGGATT GGCTGCCTCT GTTGCCAAAA AAGCTGGGAA TAGAGTAGTA
AGCCTGGCAG GTCAATTAGG AAAGAATGTA TTAGGAGGGT AA
 
Protein sequence
MHFYVEIKMN IISTFLSTVT TLVESGAASN AAKIANAISP VFFAAIGVYI IFVAYEIIYS 
QRDVVMSEVT KNIMKLALVG VFTYSSTYYS QYVIPFVMHS GDELSSALTG QSDIANSIDN
LWQALSDTME QFWSDATGQL GMTDFGLWIK AGLIWITGYA GGFLLVFYTT VFLCVSKFMV
GMVLSVGILF ICFSAFSPTR GMFTAWCGSC LNYILLNVFY TISFGFVLSL IQQTANLDAK
TITFMSVATL LAVVLISVYL IEQIGTLCSS LTGGVGINGL TASANGAANK LAAVSGVRAM
SNAGKGLAAS VAKKAGNRVV SLAGQLGKNV LGG