Gene ECH74115_2224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2224 
Symbol 
ID6970911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2114108 
End bp2115646 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content58% 
IMG OID643386112 
ProductIS66 family element, transposase 
Protein accessionYP_002270599 
Protein GI209400332 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.101822 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0000000000417229 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGACA TCTCTTCTGA CGACATCTTC CTGCTGAAAC AGCGCCTGGC CGAACAGGAA 
GCGCTGATCC ACGCCCTGCA GGAAAAGCTG AGCAACCGGG AGCGCGAAAT AGACCATCTG
CAGGCGCAGC TGGATAAACT CCGCCGGATG AACTTCGGCA GTCGTTCCGA AAAAGTCTCC
CGCCGTATCG CACAAATGGA AGCCGATCTG AACCGGCTTC AGAAAGAGAG CGATACGCTG
ACTGGTAGGG TGTATGACCC GGCAGTACAG CGTCCGTTGC GTCAGACCCG CACCCGTAAG
CCGTTCCCTG AATCACTACC CCGTGACGAA AAGCGACTGT TGCCTGCGGC GCCGTGCTGC
CCGAACTGCG GCGGTTCACT GAGCTATCTG GGCGAGGATA CCGCCGAACA GCTGGAGTTG
ATGCGTAGCG CCTTCCGGGT TATCCGGACG GTACGGGAAA AACATGCCTG TACTCAGTGC
GATGCCATCG TGCAGGCACC TGCACCTTCG CGGCCCATCG AGCGGGGTAT CGCCGGACCG
GGGCTGCTGG CCCGCGTGCT GACCTCGAAG TATGCAGAGC ACACCCCGCT GTATCGCCAG
TCAGAAATAT ACGGCCGGCA AGGTGTGGAG CTGAGGCGTT CACTGCTGTC GGGCTGGGTG
GATGCATGCT GCCGGCTGCT GTCTCCGCTG GAAGAGGCGC TTCATGGCTA TGTCATGACT
GACGGCAAAC TCCATGCCGA TGATACCCCG GTCCAGGTAC TGCTGCCGGG TAATAAGAAG
ACGAAGACCG GGCGGTTGTG GGCGTATGTT CGTGATGACC GCAATGCAGG GTCAGCGTTG
GCACCTGCAG TGTGGTTCGC TTACAGCCCG GACAGAAAAG GCATCCATCC GCAGACTCAT
CTTGCCTGCT TCAGCGGTGT GCTGCAAGCG GATGCGTACG CCGGGTTCAA CGAGCTGTAT
CGCAATGGTG GGATAACGGA AGCTGCCTGC TGGGCTCATG CCCGCCGAAA GATCCACGAT
GTGCACGTCC GCATCCCGTC AGCACTGACG GAAGAAGCCC TGGAGCAGAT CGGTCAGTTG
TACGCCATAG AGGCGGATAT AAGGGGAATG CCGGCAGAGC AGCGGCTTGC TGAACGTCAG
CGAAAAACGA AACCGTTGTT GAAATCCCTG GAAAGCTGGT TGCGTGAAAA GATGAAGACC
CTGTCGCGAC ACTCAGAGTT GGCGAAGGCG TTCGCGTACG CACTTAACCA GTGGCCGGCA
CTGACGTACT ATGCGAACGA TGGCTGGGTG GAAATCGACA ACAACATCGC TGAAAATGCC
CTGCGGGCGG TCAGTCTGGG TCGTAAAAAC TTCCTGTTCT TCGGCTCTGA TCATGGTGGT
GAGCGGGGAG CGCTACTGTA CAGCCTGATC GGGACGTGCA AACTGAATGA CGTGGATCCA
GAAAGCTACC TTCGCCATGT GCTTGGCGTC ATAGCAGACT GGCCGGTCAA CCGGGTCAGC
GAACTGCTTC CGTGGCGCAT AGCACTGCCA GCTGAATAA
 
Protein sequence
MNDISSDDIF LLKQRLAEQE ALIHALQEKL SNREREIDHL QAQLDKLRRM NFGSRSEKVS 
RRIAQMEADL NRLQKESDTL TGRVYDPAVQ RPLRQTRTRK PFPESLPRDE KRLLPAAPCC
PNCGGSLSYL GEDTAEQLEL MRSAFRVIRT VREKHACTQC DAIVQAPAPS RPIERGIAGP
GLLARVLTSK YAEHTPLYRQ SEIYGRQGVE LRRSLLSGWV DACCRLLSPL EEALHGYVMT
DGKLHADDTP VQVLLPGNKK TKTGRLWAYV RDDRNAGSAL APAVWFAYSP DRKGIHPQTH
LACFSGVLQA DAYAGFNELY RNGGITEAAC WAHARRKIHD VHVRIPSALT EEALEQIGQL
YAIEADIRGM PAEQRLAERQ RKTKPLLKSL ESWLREKMKT LSRHSELAKA FAYALNQWPA
LTYYANDGWV EIDNNIAENA LRAVSLGRKN FLFFGSDHGG ERGALLYSLI GTCKLNDVDP
ESYLRHVLGV IADWPVNRVS ELLPWRIALP AE