Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1516 |
Symbol | |
ID | 5594564 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1524087 |
End bp | 1525295 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640920671 |
Product | IS605 family transposase OrfB |
Protein accession | YP_001458227 |
Protein GI | 157160909 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 52 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAACGAT TACAGGCATT TAAATTCCAG TTAAGACCAG GTGGTCAACA GGAGTGTGAA ATGAGGCGCT TCGCAGGCGC ATGTCGTTTC GTTTTCAATC GTGCTCTGGC ACGTCAGAAT GAGAATCATG AGGCCGGTAA TAAATACATC CCTTACGGGA AAATGGCTTC CTGGCTGGTT GAGTGGAAAA ATGCCACTGA AACGCAATGG CTTAAACATT CTCCCTCACA GCCATTGCAA CAGTCACTGA AAGACCTTGA GCGGGCTTAC AAAAACTTCT TCCGGAAGCG GGCTGCTTTT CCCCGATTCA AAAAGCGGGG ACAGAATGAT GCATTCCGCT ACCCGCAGGG CGTTAAGCTC GATCAGGAAA ACAGCCGTAT TTTTCTGCCG AAACTGGGCT GGATGCGCTA CCGGAACAGC CGTCAGGTCA CGGGTGTTGT GAAAAATGTC ACTGTCAGCC AGTCCTGCGG TAAGTGGTAC ATCAGTATTC AGACAGAAAG TGAAGTATCA ACTCCGGTTC ACCCTTCAGC ATCAATGGTC GGGCTGGATG CTGGCGTGGC TAAACTCGCC ACGCTGTCAG ATGGCACAGT CTTTGAGCCT GTAAACAGTT TTCAGAAAAA CCAGAAGAAG CTGGCGAGAC TTCAGCGACA GTTAAGCCGC AAGGTCAAAT TCAGCAACAA CTGGCAAAAG CAGAAACGCA AAATACAGCG ACAGCATTCC TGTATCGCAA ATATCCGCAG GGACTACCTT CACAAAGTCA CAACGGCCGT CAGCAAAAAC CACGCAATGA TAGTCATTGA GGATTTGAAG GTCAGCAACA TGTCAAAGTC AGCAGCGGGT ACGGTCAGCC AGCCGGGGCG CAATGTCCGG GCAAAATCAG GTTTAAACCG TTCGATACTG GATCAGGGCT GGTATGAAAT GCGCCGCCAG CTTGCGTACA AGCAGCTCTG GCGTGGCGGT CAGGTGCTTG CTGTTCCGCC AGCGTATACA AGCCAGCGTT GCGCGTACTG TGGTCATACA GCGAAAGAGA ACCGCCTGTC ACAAAGTAAA TTCAGATGCC AGGTATGTGG ATATACAGCG AACGCCGATG TAAATGGCGC TCGCAACATT TTAGCGGCGG GGCACGCCGT TCTTGCCTGT GGAGAGATGG TGCAGTCAGG CCGCCCGTTG AAGCAGGAAC CCACCGAAAT GATTCAGGCG ACAGCCTGA
|
Protein sequence | MKRLQAFKFQ LRPGGQQECE MRRFAGACRF VFNRALARQN ENHEAGNKYI PYGKMASWLV EWKNATETQW LKHSPSQPLQ QSLKDLERAY KNFFRKRAAF PRFKKRGQND AFRYPQGVKL DQENSRIFLP KLGWMRYRNS RQVTGVVKNV TVSQSCGKWY ISIQTESEVS TPVHPSASMV GLDAGVAKLA TLSDGTVFEP VNSFQKNQKK LARLQRQLSR KVKFSNNWQK QKRKIQRQHS CIANIRRDYL HKVTTAVSKN HAMIVIEDLK VSNMSKSAAG TVSQPGRNVR AKSGLNRSIL DQGWYEMRRQ LAYKQLWRGG QVLAVPPAYT SQRCAYCGHT AKENRLSQSK FRCQVCGYTA NADVNGARNI LAAGHAVLAC GEMVQSGRPL KQEPTEMIQA TA
|
| |