Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0023 |
Symbol | |
ID | 6068477 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 23681 |
End bp | 24889 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641599427 |
Product | IS605 family transposase OrfB |
Protein accession | YP_001723037 |
Protein GI | 170018083 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAACGAT TACAGGCATT TAAATTCCAG TTAAGACCCG GTGGTCAACA GGAGCGTGAA ATGAGGCGCT TCGCAGGCGC ATGTCGTTTC GTTTTCAATC GTGCTCTGGC GCTTCAGAAT GAGAATCATG AGGCCGGTAA TAAATACATC CCTTACGGGA AAATGGCTTC CTGGCTGGTT GAGTGGAAAA ACGCCACTGA AACGCAATGG CTTAAAGATT CGCCCTCACA GCCATTGCAA CAGTCACTGA AAGGCCTTGA GCGGGCTTAC AAAAACTTCT TCCAGAAGCG GGCTGCTTTT CCCCGATTCA AAAAGCGGGG ACAGAATGAT GCATTCCGCT ACCCGCAGGG CGTTAAGCTC GATCAGGAAA ACAGCCGTAT TTTTCTGCCG AAACTGGGCT GGATGCGCTA CCGGAACAGC CGTCAGGTCA CGGGTGTTGT GAAAAATGTC ACTGTCAGCC AGTCCTGCGG TAAGTGGTAC ATCAGTATTC AGACAGAAAG TGAAGTATCA ACTCCGGTTC ACCCTTCAGC ATCAATGATC GGGCTGGATG CTGGCGTGGC TAAACTCGCC ACGCTGTCAG ATGGCACAGT CTTTGAGCCT GTAAACAGTT TTCAGAAAAA CCAGAAGACG CTGGCGAGAC TTCAGCGACA GTTAAGCCGC AAGGTCAAAT TCAGCAACAA CTGGCAGAAG CAGAAACGCA AAATACAGCG ACTGCATTCC TGTATCGCAA ATATCCGCAG GGACTACCTT CACAAAGTCA CAACGACCGT CAGCAAAAAC CACGCAATGA TAGTCATTGA GGATTTGAAG GTCAGCAACA TGTCAAAGTC AGCAGCGGGT ACGGTCAGCC TGCCGGGGCG CAATGTCCGG GCAAAATCAG GTTTAAACCG TTCGATACTG GATCAGGGCT GGTATGAAAT ACGCCGCCAG CTTGCGTACA AGCAGCTCTG GCGTGGTGGT CAGGTGCTTG CTGTTCCGCC AGCGTACACA AGCCAGCGTT GCGTGTGCTG TGGTCATACA GCGAAAGAAA ATCGCCTGTC ACAAAGTAAA TTCAGATGCC AGGTATGTGG ATATACAGCG AACGCCGATG TAAATGGCGC TCGTAACATT TTAGCGGCGG GGCACGCCGT TCTTGCCTGT GGAGAGATGG TGCAGTCAGG CCGCTCGTTG AAGCAGGAAC CCACCGAAAT GATTCAGGCG ACAGCCTGA
|
Protein sequence | MKRLQAFKFQ LRPGGQQERE MRRFAGACRF VFNRALALQN ENHEAGNKYI PYGKMASWLV EWKNATETQW LKDSPSQPLQ QSLKGLERAY KNFFQKRAAF PRFKKRGQND AFRYPQGVKL DQENSRIFLP KLGWMRYRNS RQVTGVVKNV TVSQSCGKWY ISIQTESEVS TPVHPSASMI GLDAGVAKLA TLSDGTVFEP VNSFQKNQKT LARLQRQLSR KVKFSNNWQK QKRKIQRLHS CIANIRRDYL HKVTTTVSKN HAMIVIEDLK VSNMSKSAAG TVSLPGRNVR AKSGLNRSIL DQGWYEIRRQ LAYKQLWRGG QVLAVPPAYT SQRCVCCGHT AKENRLSQSK FRCQVCGYTA NADVNGARNI LAAGHAVLAC GEMVQSGRSL KQEPTEMIQA TA
|
| |