Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0448 |
Symbol | |
ID | 6142930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 457626 |
End bp | 458834 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615342 |
Product | IS605 family transposase orfB |
Protein accession | YP_001742549 |
Protein GI | 170682659 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAACGAC TACAGGCATT TAAATTCCAG TTAAGACCCG GTGGTCAACA GGAGCATGAA ATGAGGCGCT TCGCAGGCGC ATGTCGTTTC GTTTTCAATC GTGCTCTGGC GCTTCAGAAT GAGAATCATG AGGCCGGTAA TAAATACATC CCTTACGGGA AAATGGCTTC CTGGCTGGTT GAGTGGAAAA ACGCCACTGG AACGCAATGG CTTAAAGATT CGCCCTCACA GCCATTGCAA CAGTCACTGA AAGACCTTGA GCGGGCTTAC AAAAACTTCT TCCAGAAGCG GGCTGCTTTT CCCCGATTCA AAAAGCGGGG ACAGAATGAT GCATTCCGCT ACCCGCAGGG CGTTAAGCTC GATCAGGAAA ACAGCCGTAT TTTTCTGCCG AAACTGGGCT GGATGCGCTA CCGGAACAGC CGCCAGGTCA CGGGTGTTGT GAAAAATGTC ACTGTCAGCC AGTCCTGCGG TAAGTGGTAC ATCAGTATTC AGACAGAAAG TGAAGTATCC ACTCCGGTTC ACCCTTCAGC ATCAATGGTC GGGCTGGATG CTGGCGTGGC TAAACTCGCC ACGCTGTCAG ATGGCACAGT CTTTGAGCCT GTAAACAGTT TTCAGAAAAA CCAGAAGACG CTGGCGAGAC TTCAGCGACA GTTAAGCCGC AAGGTCGAAT TCAGCAATAA CTGGCAGAAG CAGAAACGCA AAATACAGCG ACTGCATTCC CGTATCGCAA ATATCCGCAG GGACTACCTT CACAAAGTCA CAACGACCGT CAGCAAAAAC CACGCAATGA TAGTCATTGA GGATTTGAAG GTCAGCAACA TGTCAAAGTC AGCAGCGGGT ACGGTCAGCC AGCCGGGACG CAATGTCCGG GCAAAATCAG GTTTAAACCG TTCGGTACTG GATCAGGGCT GGTATGAAAT GCGCCGCCAG CTTGAGTACA AGCAGCTCTG GCGTGGCGGT CAGGTGCTTG CTGTTCCGCC AGCGTACACA AGCCAGCGTT GCACGTGCTG TGGTCATACA GCGAAAGAAA ATCGCCTGTC ACAAAGTCAA TTCAGATGCC AGGTATGTGG ATATACAGCG AACGCCGATG TAAATGGCGC TCGTAACATT TTAGCGGCGG GGCACGCCGT TCTTGCCTGT GGAGGGATGG TGCAGTCAGG CCGCCCGTTG AAGCAGGAAC CCACCGAAAT GATTCAGGCG ACAGCCTGA
|
Protein sequence | MKRLQAFKFQ LRPGGQQEHE MRRFAGACRF VFNRALALQN ENHEAGNKYI PYGKMASWLV EWKNATGTQW LKDSPSQPLQ QSLKDLERAY KNFFQKRAAF PRFKKRGQND AFRYPQGVKL DQENSRIFLP KLGWMRYRNS RQVTGVVKNV TVSQSCGKWY ISIQTESEVS TPVHPSASMV GLDAGVAKLA TLSDGTVFEP VNSFQKNQKT LARLQRQLSR KVEFSNNWQK QKRKIQRLHS RIANIRRDYL HKVTTTVSKN HAMIVIEDLK VSNMSKSAAG TVSQPGRNVR AKSGLNRSVL DQGWYEMRRQ LEYKQLWRGG QVLAVPPAYT SQRCTCCGHT AKENRLSQSQ FRCQVCGYTA NADVNGARNI LAAGHAVLAC GGMVQSGRPL KQEPTEMIQA TA
|
| |