Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1339 |
Symbol | |
ID | 6969663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1345569 |
End bp | 1347182 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643385323 |
Product | IS66 family element, transposase |
Protein accession | YP_002269818 |
Protein GI | 209396105 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3436] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCGGA AATACCTCAT TCGTATCACT GAGCTGGAAA GGCTGCTCTC TGAGCAGGCT GAAGCCCTCC GTCAGAAAGA TCAGCAACTG AGTCTGGTTG AAGAGACGGA GGCCTTCCTG CGCTCTGCAC TGGCACGTGC CGAGGAAAAG ATCGAAGAGG AGGAGCGGGA AATAGAACAT CTGAGGGCCC AGATAGAAAA ACTGCGCCGG ATGTTGTTCG GTACCCGTTC TGAAAAACTG CAGCGTGAGG TTGAACAGGC TGAGGCCCAA CTGAAACAAC GCGAACAGGA AAGCGATCGT TACAGTGGGC GTGAGGATGA CCCGCAGGTT CCCCGCCAGT TGCGACAGTC ACGTCATCGT CGTCCGTTAC CGGCACATCT TCCCCGTGAA ATACACCGTC TGGAGCCTGA AGAAAGTTGT TGCCCGGAGT GTGGCAGTGA GCTGGATTAT CTGGGGGAAG TCAGTGCTGA GCAGCTGGAA CTGGTGAGTA GCGCCCTGAA AGTGATCCGC ACTGTTCGGG TAAAAAAAGC CTGTACAAAA TGTGACTGTA TAGTTGAAGC GCCAGCGCCG TCCCGCCCGA TAGAGCGCGG CATCGCGGGC TCCGGATTAC TTGCCCGCGT GTTAACGGGA AAATACTGCG AACACCTGCC ACTGTATCGT CAGAGTGAAA TCTTTGCCCG CCAGGGCATC GAACTGAGCC GGGCATTACT CTCCAACTGG GTTGACGCAT GTTGTCAGTT AATGACTCTG CTGAATGATA CCCTGTACCG TTACGTGATG AACACCCGCA AGGTTCACAC TGACGACACA CCAGTAAAAG TGCTGGCACC GGGCAGAAAA AAGGCAAAAA CAGGACGCAT CTGGACGTAT GTCCGGGATG ACCGGAATGC GGGCTCATCA GAGCCACCGG CGGTCTGGTT CGCCTACTCA CCAGACAGGC AGGGAAAACA TCCGGTACAA CACCTTCGTC CCTTCCGGGG TATCCTGCAG GCGGATGCGT TCAGCGGTTA CGATCGGCTG TTCAGTGCAG AACGTGAAGG TGGTGCACTG ACAGAAGTTG CGTGCTGGGC CCATGCCCGG CGAAAAATCC ACGATGTATA CATCAGCAGC AAAAGTGCGA CGGCAGAAGA AGCCCTGAAG CGAATCAGTG AACTGTACGC CATCGAGGAT GAAATACGGG GATTACCGGA GTCAGAGCGT CTTGCCGTCA GGCAGCAGCG AAGCAAAGTG TTACTGACGT CACTGCATGA ATGGATGGTG GAGAAGAATG GTACGCTGTC GAAAAAATCC AGACTGGGCG AAGCGTGCAG CTATGTACTG AATCAGTGGG ATGCCCTCTG TTATTACAGT GATGACGGTC TGGCGGAGGC GGATAATAAT GCTGCGGAAA GAGCGCTTCG TGCAGTCTGT CTCGGAAAGA AAAACTTTAT GTTCTTTGGC AGCGATCACG GCGGCGAGCG TGGAGCACTG TTGTACGGGC TGATCGGCAC CTGCCGTCTG AACGGTATCG ATCCGGAAGC GTATCTGCGC CATATCCTGA GCGTACTGCC AGAATGGCCT TCCAACCGAG TTGATGAACT CCTGCCATGG AACGTAGTAC TCACCAATAA ATAA
|
Protein sequence | MSRKYLIRIT ELERLLSEQA EALRQKDQQL SLVEETEAFL RSALARAEEK IEEEEREIEH LRAQIEKLRR MLFGTRSEKL QREVEQAEAQ LKQREQESDR YSGREDDPQV PRQLRQSRHR RPLPAHLPRE IHRLEPEESC CPECGSELDY LGEVSAEQLE LVSSALKVIR TVRVKKACTK CDCIVEAPAP SRPIERGIAG SGLLARVLTG KYCEHLPLYR QSEIFARQGI ELSRALLSNW VDACCQLMTL LNDTLYRYVM NTRKVHTDDT PVKVLAPGRK KAKTGRIWTY VRDDRNAGSS EPPAVWFAYS PDRQGKHPVQ HLRPFRGILQ ADAFSGYDRL FSAEREGGAL TEVACWAHAR RKIHDVYISS KSATAEEALK RISELYAIED EIRGLPESER LAVRQQRSKV LLTSLHEWMV EKNGTLSKKS RLGEACSYVL NQWDALCYYS DDGLAEADNN AAERALRAVC LGKKNFMFFG SDHGGERGAL LYGLIGTCRL NGIDPEAYLR HILSVLPEWP SNRVDELLPW NVVLTNK
|
| |