Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3890 |
Symbol | |
ID | 5595081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3886058 |
End bp | 3887266 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640923000 |
Product | IS605 family transposase OrfB |
Protein accession | YP_001460477 |
Protein GI | 157163159 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 63 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAACGAT TACAGGCATT TAAATTCCAG TTAAGACCAG GTGGTCAACA GGAGTGTGAA ATGAGGCGCT TCGCAGGCGC ATGTCGTTTC GTTTTCAATC GTGCTCTGGC ACGTCAGAAT GAGAATCATG AGGCCGGTAA TAAATACATC CCTTACGGGA AAATGGCTTC CTGGCTGGTT GAGTGGAAAA ATGCCACTGA AACGCAATGG CTTAAAGATT CTCCCTCACA GCCATTGCAA CAGTCACTGA AAGACCTTGA GCGGGCTTAC AAAAACTTCT TCCGGAAGCG GGCTGCTTTT CCCCGATTCA AAAAGCGGGG ACAGAATGAT GCATTCCGCT ACCCGCAGGG CGTTAAGCTC GATCAGGAAA ACAGCCGTAT TTTTCTGCCG AAACTGGGCT GGATGCGCTA CCGGAATAGC CGTCAGGTCA CGGGTGTTGT GAAAAATGTC ACTGTCAGCC AGTCCTGCGG TAAGTGGTAC ATCAGTATTC AGACAGAAAG TGAAGTATCC ACTCCGGCTC ACCCTTCAGC ATCAATGGTC AGGCTGGATG CTGGCGTGGC TAAACTCGCC ACGCTGTCAG ATGGCACAGT CTTTGAGCCT GTAAACAGTT TTCAGAAAAA CCAGAAGACG CTAGCGAGAC TTCAGCGACA GTTAAGCCGC AAGGTCAAAT TCAGCAACAA CTGGCAGAAG CAGAAACGCA AAATACAGCG ACTGCATTCC TGTATCGCAA ATATCCGCAG GGACTACCTT CACAAAGTCA CAACGGCCGT CAGCAAAAAC CACGCAATGA TAGTCATTGA GGATTTGAAG GTCAGCAACA TGTCAAAGTC AGCAGCGGGT ACGGTCAGCC AGCCGGGGCG CAATGTCCGG GCAAAATCAG GTTTAAACCG TTCGATACTG GATCAGGGCT GGTATGAAAT GCACCGCCAG CTTGAGTACA AGCAGCTCTG GCGTGGCGGT CAGGTGCTTG CTGTTCCGCC AGCGTATACA AGCCAGCGTT GCGCGTACTG TGGTCATACA GCGAAAGAGA ACCGCCTGTC ACAAAGTAAA TTCAGATGCC AGGTATGTGG ATATACAGCG AACGCCGATG TAAATGGCGC TCGTAACATT TTAGCGGCGG GGCACGCCGT TCTTGCCTGT GGAGAGATGG TGCAGTCAGG CCGCCCGTTG AAGCAGGAAC CCACCGAAAT GATTCAGGCG ACAGCCTGA
|
Protein sequence | MKRLQAFKFQ LRPGGQQECE MRRFAGACRF VFNRALARQN ENHEAGNKYI PYGKMASWLV EWKNATETQW LKDSPSQPLQ QSLKDLERAY KNFFRKRAAF PRFKKRGQND AFRYPQGVKL DQENSRIFLP KLGWMRYRNS RQVTGVVKNV TVSQSCGKWY ISIQTESEVS TPAHPSASMV RLDAGVAKLA TLSDGTVFEP VNSFQKNQKT LARLQRQLSR KVKFSNNWQK QKRKIQRLHS CIANIRRDYL HKVTTAVSKN HAMIVIEDLK VSNMSKSAAG TVSQPGRNVR AKSGLNRSIL DQGWYEMHRQ LEYKQLWRGG QVLAVPPAYT SQRCAYCGHT AKENRLSQSK FRCQVCGYTA NADVNGARNI LAAGHAVLAC GEMVQSGRPL KQEPTEMIQA TA
|
| |