Gene ECH74115_2841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2841 
Symbol 
ID6971629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2643994 
End bp2645532 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content58% 
IMG OID643386689 
ProductIS66 family element, transposase 
Protein accessionYP_002271160 
Protein GI209396341 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0678669 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACA TCTCTTCTGA CGACATCTTC CTGCTGAAAC AGCGCCTGGC CGAACAGGAA 
GCGCTGATCC ACGCCCTGCA GGAAAAGCTG AGCAACCGGG AGCGCGAAAT AGACCATCTG
CAGGCGCAGC TGGATAAACT CCGCCGGATG AACTTCGGCA GTCGTTCCGA AAAAGTCTCC
CGCCGTATCG CACAAATGGA AGCCGATCTG AACCGGCTTC AGAAAGAGAG CGATACGCTG
ACTGGTAGGG TGTATGACCC GGCAGTACAG CGTCCGTTGC GTCAGACCCG CACCCGTAAG
CCGTTCCCTG AATCACTACC CCGTGACGAA AAGCGACTGT TGCCTGCGGC GCCGTGCTGC
CCGAACTGCG GCGGTTCACT GAGCTATCTG GGCGAGGATA CCGCCGAACA GCTGGAGTTG
ATGCGTAGCG CCTTCCGGGT TATCCGGACG GTACGGGAAA AACATGCCTG TACTCAGTGC
GATGCCATCG TGCAGGCACC TGCACCTTCG CGGCCCATCG AGCGGGGTAT CGCCGGACCG
GGGCTGCTGG CCCGCGTGCT GACCTCGAAG TATGCAGAGC ACACCCCGCT GTATCGCCAG
TCAGAAATAT ACGGCCGGCA AGGTGTGGAG CTGAGGCGTT CACTGCTGTC GGGCTGGGTG
GATGCATGCT GCCGGCTGCT GTCTCCGCTG GAAGAGGCGC TTCATGGCTA TGTCATGACT
GACGGCAAAC TCCATGCCGA TGATACCCCG GTCCAGGTAC TGCTGCCGGG TAATAAGAAG
ACGAAGACCG GGCGGTTGTG GGCGTATGTT CGTGATGACC GCAATGCAGG GTCAGCGTTG
GCACCTGCAG TGTGGTTCGC TTACAGCCCG GACAGAAAAG GCATCCATCC GCAGACTCAT
CTTGCCTGCT TCAGCGGTGT GCTGCAAGCG GATGCGTACG CCGGGTTCAA CGAGCTGTAT
CGCAATGGTG GGATAACGGA AGCTGCCTGC TGGGCTCATG CCCGCCGAAA GATCCACGAT
GTGCACGTCC GCATCCCGTC AGCACTGACG GAAGAAGCCC TGGAGCAGAT CGGTCAGTTG
TACGCCATAG AGGCGGATAT AAGGGGAATG CCGGCAGAGC AGCGGCTTGC TGAACGTCAG
CGAAAAACGA AACCGTTGTT GAAATCCCTG GAAAGCTGGT TGCGTGAAAA GATGAAGACC
CTGTCGCGAC ACTCAGAGTT GGCGAAGGCG TTCGCGTACG CACTTAACCA GTGGCCGGCA
CTGACGTACT ATGCGAACGA TGGCTGGGTG GAAATCGACA ACAACATCGC TGAAAATGCC
CTGCGGGCGG TCAGTCTGGG TCGTAAAAAC TTCCTGTTCT TCGGCTCTGA TCATGGTGGT
GAGCGGGGAG CGCTACTGTA CAGCCTGATC GGGACGTGCA AACTGAATGA CGTGGATCCA
GAAAGCTACC TTCGCCATGT GCTTGGCGTC ATAGCAGACT GGCCGGTCAA CCGGGTCAGC
GAACTGCTTC CGTGGCGCAT AGCACTGCCA GCTGAATAA
 
Protein sequence
MNDISSDDIF LLKQRLAEQE ALIHALQEKL SNREREIDHL QAQLDKLRRM NFGSRSEKVS 
RRIAQMEADL NRLQKESDTL TGRVYDPAVQ RPLRQTRTRK PFPESLPRDE KRLLPAAPCC
PNCGGSLSYL GEDTAEQLEL MRSAFRVIRT VREKHACTQC DAIVQAPAPS RPIERGIAGP
GLLARVLTSK YAEHTPLYRQ SEIYGRQGVE LRRSLLSGWV DACCRLLSPL EEALHGYVMT
DGKLHADDTP VQVLLPGNKK TKTGRLWAYV RDDRNAGSAL APAVWFAYSP DRKGIHPQTH
LACFSGVLQA DAYAGFNELY RNGGITEAAC WAHARRKIHD VHVRIPSALT EEALEQIGQL
YAIEADIRGM PAEQRLAERQ RKTKPLLKSL ESWLREKMKT LSRHSELAKA FAYALNQWPA
LTYYANDGWV EIDNNIAENA LRAVSLGRKN FLFFGSDHGG ERGALLYSLI GTCKLNDVDP
ESYLRHVLGV IADWPVNRVS ELLPWRIALP AE