Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2841 |
Symbol | |
ID | 6971629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2643994 |
End bp | 2645532 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643386689 |
Product | IS66 family element, transposase |
Protein accession | YP_002271160 |
Protein GI | 209396341 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3436] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0678669 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGACA TCTCTTCTGA CGACATCTTC CTGCTGAAAC AGCGCCTGGC CGAACAGGAA GCGCTGATCC ACGCCCTGCA GGAAAAGCTG AGCAACCGGG AGCGCGAAAT AGACCATCTG CAGGCGCAGC TGGATAAACT CCGCCGGATG AACTTCGGCA GTCGTTCCGA AAAAGTCTCC CGCCGTATCG CACAAATGGA AGCCGATCTG AACCGGCTTC AGAAAGAGAG CGATACGCTG ACTGGTAGGG TGTATGACCC GGCAGTACAG CGTCCGTTGC GTCAGACCCG CACCCGTAAG CCGTTCCCTG AATCACTACC CCGTGACGAA AAGCGACTGT TGCCTGCGGC GCCGTGCTGC CCGAACTGCG GCGGTTCACT GAGCTATCTG GGCGAGGATA CCGCCGAACA GCTGGAGTTG ATGCGTAGCG CCTTCCGGGT TATCCGGACG GTACGGGAAA AACATGCCTG TACTCAGTGC GATGCCATCG TGCAGGCACC TGCACCTTCG CGGCCCATCG AGCGGGGTAT CGCCGGACCG GGGCTGCTGG CCCGCGTGCT GACCTCGAAG TATGCAGAGC ACACCCCGCT GTATCGCCAG TCAGAAATAT ACGGCCGGCA AGGTGTGGAG CTGAGGCGTT CACTGCTGTC GGGCTGGGTG GATGCATGCT GCCGGCTGCT GTCTCCGCTG GAAGAGGCGC TTCATGGCTA TGTCATGACT GACGGCAAAC TCCATGCCGA TGATACCCCG GTCCAGGTAC TGCTGCCGGG TAATAAGAAG ACGAAGACCG GGCGGTTGTG GGCGTATGTT CGTGATGACC GCAATGCAGG GTCAGCGTTG GCACCTGCAG TGTGGTTCGC TTACAGCCCG GACAGAAAAG GCATCCATCC GCAGACTCAT CTTGCCTGCT TCAGCGGTGT GCTGCAAGCG GATGCGTACG CCGGGTTCAA CGAGCTGTAT CGCAATGGTG GGATAACGGA AGCTGCCTGC TGGGCTCATG CCCGCCGAAA GATCCACGAT GTGCACGTCC GCATCCCGTC AGCACTGACG GAAGAAGCCC TGGAGCAGAT CGGTCAGTTG TACGCCATAG AGGCGGATAT AAGGGGAATG CCGGCAGAGC AGCGGCTTGC TGAACGTCAG CGAAAAACGA AACCGTTGTT GAAATCCCTG GAAAGCTGGT TGCGTGAAAA GATGAAGACC CTGTCGCGAC ACTCAGAGTT GGCGAAGGCG TTCGCGTACG CACTTAACCA GTGGCCGGCA CTGACGTACT ATGCGAACGA TGGCTGGGTG GAAATCGACA ACAACATCGC TGAAAATGCC CTGCGGGCGG TCAGTCTGGG TCGTAAAAAC TTCCTGTTCT TCGGCTCTGA TCATGGTGGT GAGCGGGGAG CGCTACTGTA CAGCCTGATC GGGACGTGCA AACTGAATGA CGTGGATCCA GAAAGCTACC TTCGCCATGT GCTTGGCGTC ATAGCAGACT GGCCGGTCAA CCGGGTCAGC GAACTGCTTC CGTGGCGCAT AGCACTGCCA GCTGAATAA
|
Protein sequence | MNDISSDDIF LLKQRLAEQE ALIHALQEKL SNREREIDHL QAQLDKLRRM NFGSRSEKVS RRIAQMEADL NRLQKESDTL TGRVYDPAVQ RPLRQTRTRK PFPESLPRDE KRLLPAAPCC PNCGGSLSYL GEDTAEQLEL MRSAFRVIRT VREKHACTQC DAIVQAPAPS RPIERGIAGP GLLARVLTSK YAEHTPLYRQ SEIYGRQGVE LRRSLLSGWV DACCRLLSPL EEALHGYVMT DGKLHADDTP VQVLLPGNKK TKTGRLWAYV RDDRNAGSAL APAVWFAYSP DRKGIHPQTH LACFSGVLQA DAYAGFNELY RNGGITEAAC WAHARRKIHD VHVRIPSALT EEALEQIGQL YAIEADIRGM PAEQRLAERQ RKTKPLLKSL ESWLREKMKT LSRHSELAKA FAYALNQWPA LTYYANDGWV EIDNNIAENA LRAVSLGRKN FLFFGSDHGG ERGALLYSLI GTCKLNDVDP ESYLRHVLGV IADWPVNRVS ELLPWRIALP AE
|
| |