Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5722 |
Symbol | |
ID | 6967121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 5357410 |
End bp | 5358618 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643389355 |
Product | transposase, IS605 orfB family |
Protein accession | YP_002273748 |
Protein GI | 209397707 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000324341 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 77 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAACGAC TACAGGCATT TAAATTCCAG TTAAGACCCG GTGGTCAACA GGAGCGTGAA ATGAGGCGCT TCGCAGGCGC ATGTCGTTTC GTTTTCAATC GTGCTCTGGC GCTTCAGAAT GAGAATCATG AGGCCGGTAA TAAATACATC CCTTACGGGA AAATGGCTTC CTGGCTGGTT GAGTGGAAAA ATGCCACTGA AACGCAATGG CTTAAAGATG CCCCGTCACA GCCATTGCAA CAGTCACTGA AAGACCTTGA GCGGGCTTAC AAAAACTTCT TCCGGAAGCG GGCTGCTTTT CCCCGATTCA AAAAGCGGGG ACAGAATGAT GCATTCCGCT ACCCGCAGGG CGTTAAGCTC GATCAGGAAA ACAGCCGTAT TTTTCTGCCG AAACTTGGCT GGATGCGCTA CCGGAATAGC CGTCAGGTCA CGGGTGTTGT GAAAAATGTC ACTGCCAGCC AGTCCTGCGG TAAGTGGTAC ATCAGTATTC AGACTGAAAA TGAAGTATCA ACTCCGGTTC ACCCTTCAGC ATTAATGGTC GGACTGGATG CTGGCGTGGC TAAACTCGCC ACGCTGTCAG ATGGCACAGT CTTTGGACCT GTAAACAGTT TTCAGAAAAA CCAGAAGACG CTGGCGAGAC TTCAGCGACA GTTAAGCCGC AAGGTCAAAT TCAGCAACAA CTGGCAGAAG CAGAAACGCA AAATACAGCG ACTGCATTCC TGTATCGCAA ATATCCGCAG GGACTACCTT CACAAAGTCA CAACGACCGT CAGCAAAAAC CACGCAATGA TAGTCATTGA GGATTTGAAG GTCAGCAACA TGTCAAAGTC AGCAGCGGGT ACGGTCAGCC AGCCGGGGCG CAATGTCCGG GCAAAATCAG GTTTAAACCG TTCGATACTG GATCAGGGCT GGTATGAAAT GCGCCGCCAG CTTGAGTACA AGCAGCTCTG GCGTGGCGGT CAGGTGCTTG CTGTTCCGCC AGCGTACACA AGCCAGCGTT GCGCGTGCTG TGGTCATACA GCGAAAGAAA ATCGCCTGTC ACAAAGTAAA TTCAGATGCC AGGCATGTGG ATATACAGCG AACGCTGATG TAAATGGCGC TCGTAACATT TTAGCGGCGG GGCACGCCGT TCTTGCCTGT GGGGAGATGG TGCAGTCAGG CCGCCCGTTG AAGCAGGAAC CCACCGAAAT GATTCAGGCG ACAGCCTGA
|
Protein sequence | MKRLQAFKFQ LRPGGQQERE MRRFAGACRF VFNRALALQN ENHEAGNKYI PYGKMASWLV EWKNATETQW LKDAPSQPLQ QSLKDLERAY KNFFRKRAAF PRFKKRGQND AFRYPQGVKL DQENSRIFLP KLGWMRYRNS RQVTGVVKNV TASQSCGKWY ISIQTENEVS TPVHPSALMV GLDAGVAKLA TLSDGTVFGP VNSFQKNQKT LARLQRQLSR KVKFSNNWQK QKRKIQRLHS CIANIRRDYL HKVTTTVSKN HAMIVIEDLK VSNMSKSAAG TVSQPGRNVR AKSGLNRSIL DQGWYEMRRQ LEYKQLWRGG QVLAVPPAYT SQRCACCGHT AKENRLSQSK FRCQACGYTA NADVNGARNI LAAGHAVLAC GEMVQSGRPL KQEPTEMIQA TA
|
| |