Gene ECH74115_4292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4292 
Symbol 
ID6971324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3974258 
End bp3975796 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content58% 
IMG OID643388023 
ProductIS66 family element, transposase 
Protein accessionYP_002272461 
Protein GI209400142 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACA TCTCTTCTGA CGACATCTTC CTGCTGAAAC AGCGCCTGGC CGAACAGGAA 
GCGCTGATCC ACGCCCTGCA GGAAAAGCTG AGCAACCGGG AGCGCGAAAT AGACCATCTG
CAGGCGCAGC TGGATAAACT CCGCCGGATG AACTTCGGCA GTCGTTCCGA AAAAGTCTCC
CGCCGTATCG CACAAATGGA AGCCGATCTG AACCGGCTTC AGAAAGAGAG CGATACGCTG
ACTGGTAGGG TGTATGACCC GGCAGTACAG CGTCCGTTGC GTCAGACCCG CACCCGTAAG
CCGTTCCCTG AATCACTACC CCGTGACGAA AAGCGACTGT TGCCTGCGGC GCCGTGCTGC
CCGAACTGCG GCGGTTCACT GAGCTATCTG GGCGAGGATA CCGCCGAACA GCTGGAGTTG
ATGCGTAGCG CCTTCCGGGT TATCCGGACG GTACGGGAAA AACATGCCTG TACTCAGTGC
GATGCCATCG TGCAGGCACC TGCACCTTCG CGGCCCATCG AGCGGGGTAT CGCCGGACCG
GGGCTGCTGG CCCGCGTGCT GACCTCGAAG TATGCAGAGC ACACCCCGCT GTATCGCCAG
TCAGAAATAT ACGGCCGGCA AGGTGTGGAG CTGAGGCGTT CACTGCTGTC GGGCTGGGTG
GATGCATGCT GCCGGCTGCT GTCTCCGCTG GAAGAGGCGC TTCATGGCTA TGTCATGACT
GACGGCAAAC TCCATGCCGA TGATACCCCG GTCCAGGTAC TGCTGCCGGG TAATAAGAAG
ACGAAGACCG GGCGGTTGTG GGCGTATGTT CGTGATGACC GCAATGCAGG GTCAGCGTTG
GCACCTGCAG TGTGGTTCGC TTACAGCCCG GACAGAAAAG GCATCCATCC GCAGACTCAT
CTTGCCTGCT TCAGCGGTGT GCTGCAAGCG GATGCGTACG CCGGGTTCAA CGAGCTGTAT
CGCAATGGTG GGATAACGGA AGCTGCCTGC TGGGCTCATG CCCGCCGAAA GATCCACGAT
GTGCACGTCC GCATCCCGTC AGCACTGACG GAAGAAGCCC TGGAGCAGAT CGGTCAGTTG
TACGCCATAG AGGCGGATAT AAGGGGAATG CCGGCAGAGC AGCGGCTTGC TGAACGTCAG
CGAAAAACGA AACCGTTGTT GAAATCCCTG GAAAGCTGGT TGCGTGAAAA GATGAAGACC
CTGTCGCGAC ACTCAGAGTT GGCGAAGGCG TTCGCGTACG CACTTAACCA GTGGCCGGCA
CTGACGTACT ATGCGAACGA TGGCTGGGTG GAAATCGACA ACAACATCGC TGAAAATGCC
CTGCGGGCGG TCAGTCTGGG TCGTAAAAAC TTCCTGTTCT TCGGCTCTGA TCATGGTGGT
GAGCGGGGAG CGCTACTGTA CAGCCTGATC GGGACGTGCA AACTGAATGA CGTGGATCCA
GAAAGCTACC TTCGCCATGT GCTTGGCGTC ATAGCAGACT GGCCGGTCAA CCGGGTCAGC
GAACTGCTTC CGTGGCGCAT AGCACTGCCA GCTGAATAA
 
Protein sequence
MNDISSDDIF LLKQRLAEQE ALIHALQEKL SNREREIDHL QAQLDKLRRM NFGSRSEKVS 
RRIAQMEADL NRLQKESDTL TGRVYDPAVQ RPLRQTRTRK PFPESLPRDE KRLLPAAPCC
PNCGGSLSYL GEDTAEQLEL MRSAFRVIRT VREKHACTQC DAIVQAPAPS RPIERGIAGP
GLLARVLTSK YAEHTPLYRQ SEIYGRQGVE LRRSLLSGWV DACCRLLSPL EEALHGYVMT
DGKLHADDTP VQVLLPGNKK TKTGRLWAYV RDDRNAGSAL APAVWFAYSP DRKGIHPQTH
LACFSGVLQA DAYAGFNELY RNGGITEAAC WAHARRKIHD VHVRIPSALT EEALEQIGQL
YAIEADIRGM PAEQRLAERQ RKTKPLLKSL ESWLREKMKT LSRHSELAKA FAYALNQWPA
LTYYANDGWV EIDNNIAENA LRAVSLGRKN FLFFGSDHGG ERGALLYSLI GTCKLNDVDP
ESYLRHVLGV IADWPVNRVS ELLPWRIALP AE