Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3630 |
Symbol | |
ID | 6967253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3350144 |
End bp | 3351352 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643387425 |
Product | transposase, IS605 orfB family |
Protein accession | YP_002271882 |
Protein GI | 209400602 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000427387 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAACGAC TACAGGCATT TAAATTCCAG TTAAGACCAG GTGGTCAACA GGAGCGTGAA ATGAGGCGCT TCGCAGGCGC ATGTCGTTTC GTTTTCAATC GTGCTCTGGC GCTTCAGAAT GAGAATCATG AGGCCGGTAA TAAATACATC CCTTACGGGA AAATGGCTTC CTGGCTGGTT GAGTGGAAAA ACGCCACTGA AACGCAATGG CTTAAAGATG CCCCGTCACA GCCATTGCAA CAGTCACTGA AAGAGCTTGA GCGGGCTTAC AAAAACTTCT TCCGGAAGCG GGCTGCTTTT CCCCGATTCA AAAAGCGGGG ACAGAATGAT GCATTCCGCT ACCCGCAGGG CGTTAAGCTC GATCAGGAAA ACAGCCGTAT TTTTCTGCCG AAACTTGGCT GGATGCGCTA CCTGAACAGC CGTCAGGTCA CGGGTGTTGT GAAAAATGTC ACTGTCAGCC AGTCCTGCGG TAAGTGGTAC ATCAGTATTC AGACAGAAAG TGAAGTATCA ACTCCGGTTC ACCCTTCAGC ATCAATGGTC GGACTGGATG CTGGCGTGGC TAAACTCGCC ACGCTGTCAG ATGGCACAGT CTTTGAGCCT GTAAACAGTT TTCAGAAAAA CCAGAAGAAG TTGGCGAGAC TTCAGCGACA GTTAAGCCGC AAGGTCAAAT TCAGCAATAA CTGGCAGAAG CAGAAACGCA AAATACAGCG ACTGCATTCC TGTACCGCAA ATATCCGCAG GGACTACCTT CACAAAGTCA CAACGACCGT CAGCAAAAAC CACGCAATGA TAGTCATTGA GGATTTGAAG GTCAGCAACA TGTCAAAGTC AGCAGCGGGT ACGGTCAGCC AGCCGGGGCG CAATGTCCGG GCAAAATCAG GTTTAAACCG TTCGATACTG GATCAGGGCT GGTATGAAAT GCGCCGCCAG CTTGAGTACA AGCAGCTCTG GCGTGGCGGT CAGGTACTGG CTGTTCCGCC AGCGTACACA AGCCAGCGTT GCGCGTGCTG TGGTCATACA GCGAAAGAAA ACCGCCTGTC ACAAAGTAAA TTCAGATGCC AGGTATGTGG ATATACAGCG AACGCCGATG TAAACGGCGC TCGTAACATT TTAGCGGCGG GGCACGCCGT TCTTGCCTGT GGAGAGATGG TGCAGTCAGG CCGCTCGTTG AAGCAGGAAC CCACCGAAAT GATTCAGGCG ACAGCCTGA
|
Protein sequence | MKRLQAFKFQ LRPGGQQERE MRRFAGACRF VFNRALALQN ENHEAGNKYI PYGKMASWLV EWKNATETQW LKDAPSQPLQ QSLKELERAY KNFFRKRAAF PRFKKRGQND AFRYPQGVKL DQENSRIFLP KLGWMRYLNS RQVTGVVKNV TVSQSCGKWY ISIQTESEVS TPVHPSASMV GLDAGVAKLA TLSDGTVFEP VNSFQKNQKK LARLQRQLSR KVKFSNNWQK QKRKIQRLHS CTANIRRDYL HKVTTTVSKN HAMIVIEDLK VSNMSKSAAG TVSQPGRNVR AKSGLNRSIL DQGWYEMRRQ LEYKQLWRGG QVLAVPPAYT SQRCACCGHT AKENRLSQSK FRCQVCGYTA NADVNGARNI LAAGHAVLAC GEMVQSGRSL KQEPTEMIQA TA
|
| |