Gene ECH74115_0262 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0262 
Symbol 
ID6970319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp278758 
End bp279930 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content38% 
IMG OID643384330 
Productputative transposase, IS4 family 
Protein accessionYP_002268846 
Protein GI209397232 
COG category[L] Replication, recombination and repair 
COG ID[COG5433] Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.400165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTATTC AAAGTTTGCT TGATTATATT TCAGTGATCC CTGATATACG ACAACAAGGA 
AAGGTTAAAC ATAAATTATC TGATATTTTG TTTCTCACCG TATGTGCAGT AATTGCAGGT
GCCGATGAGT GGCAGGAAAT TGAAGATTTT GGACATGAAA GACTTGAATG GCTAAAGAAA
TATGGTGATT TTGATAATGG CATTCCGGTC GATGACACCA TTGCACGCGT TGTGAGTAAC
ATTGACAGTT TGGCCTTTGA AAAGATATTT ATTGAATGGA TGCAGGAGTG CCATGAAATC
ACTGATGGTG AAATTATAGC AATAGATGGA AAGACCATAA GAGGCTCCTT TGATAAGGGA
AAAAGAAAAG GAGCAATCCA TATGGTGAGT GCATTCTCGA ACGAAAATGG TGTTGTACTG
GGGCAGGTAA AAACGGAAGC CAAAAGTAAT GAGATTACAG CCATTCCAGA GTTGCTTAAC
CTACTGGATT TAAAGAAAAA TTTGATAACC ATTGATGCTA TGGGCTGTCA GAAAGATATC
GCTTCGAAGA TCAAAGATAA AAAAGCAGAT TATCTTCTGG CAGTAAAAGG CAATCAGGGG
AAATTACATC ATGCATTCGA GGAAAAATTT CCTGTAAATG TGTTTTCTAA TTATAAAGGC
GATTCGTTTA GTACGCAGGA GATAAGTCAT GGAAGAAAAG AAACACGTTT GCATATTGTC
AGTAACGTAA CGCCTGAATT TTGTGATTTT GAATTCGAAT GGAAGGGATT AAAAAAGCTT
TGTGTAGCAT TGTCATTCAG GCAGAAGAAA GAAGATAAAT CAGCAGAAGG TGTAAGCATC
CGATATTATA TTTCATCAAA GGATATGGAT GCTAAAGAAT TTGCACATGC TATCAGAGCG
CACTGGCTGA TCGAGCACAG TCTTCATTGG GTGTTAGATG TAAAAATGAA TGAAGATGCC
AGCCGGATAA GAAGAGGAAA CGCAGCCGAA ATAATATCTG GAATAAAGAA GATGGCACTG
AATTTATTAA GAGATTGCAA AGACATTAAG GGTGGGGTGA AGAGGAAAAG AAAGAAGGTT
GCGTTAAACA CATGTTATAT AGAAGAAGTG CTTGCATCCT GCTCAGAGCT TGGGTTTCGA
ACTGACAAAA TGAAAAATTT AACACAGATT TAA
 
Protein sequence
MSIQSLLDYI SVIPDIRQQG KVKHKLSDIL FLTVCAVIAG ADEWQEIEDF GHERLEWLKK 
YGDFDNGIPV DDTIARVVSN IDSLAFEKIF IEWMQECHEI TDGEIIAIDG KTIRGSFDKG
KRKGAIHMVS AFSNENGVVL GQVKTEAKSN EITAIPELLN LLDLKKNLIT IDAMGCQKDI
ASKIKDKKAD YLLAVKGNQG KLHHAFEEKF PVNVFSNYKG DSFSTQEISH GRKETRLHIV
SNVTPEFCDF EFEWKGLKKL CVALSFRQKK EDKSAEGVSI RYYISSKDMD AKEFAHAIRA
HWLIEHSLHW VLDVKMNEDA SRIRRGNAAE IISGIKKMAL NLLRDCKDIK GGVKRKRKKV
ALNTCYIEEV LASCSELGFR TDKMKNLTQI