Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2272 |
Symbol | |
ID | 6144701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2292586 |
End bp | 2294205 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617147 |
Product | ISL3 family transposase |
Protein accession | YP_001744320 |
Protein GI | 170679893 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3464] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00373654 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.467747 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAACG CAATGCACTC TCTTAAGACA CTTCTACAGT TACCTTGCGG ATGGCGATGC AGTCGACAAA TTATTAGCTC TGACGGTATC ACCCTCCATC TCCACGGAAA ACGCAAAACA GCACAATGTC CTGAATGCTC TAAGCGTAGC GACTCTGTTC ATAGTTCTCG TCGGCGCCGG ATACAGCATC TACCCTGCTC CGGGCAGACG CTATGGCTTG TATTTTCCGT CCGCCACTGG TACTGCCGTA ACCCTGTTTG TTCACGAAAA ATTTTTGCCG AGTCGCTTGC TCCCTTCGCC GGTTCACACC AGCAGTCTTC ACAGGCGTTA CAAAATTTAC AACGTCAACT GGGATTAATA GCCGGAGGTG AGGCTGGAAA ACGGGCTGCA ACGGCAGTGG GTCTCCGTTG CAGTGCAGAT ACTCTTCTTC GCAGGGTTAT CAATACCCCG GGGACGAAAC AGTCAGGCGC GCCTCATGTC GGTATTGATG AGTGGGCGTG GCATCGGGGC CACCGTTACG GTAAGTTAAT CGTCAATCTT GATACTCACC GTCCCCTCGT CCTGCTTCCC GGTCGTGATC AGCGTACGCT GGCGACCTGG TTCAGAAAAT ATCCGGAAAT ACAGGTTGTC TCGCGTGATC GCAGTGGAGT CTATGCAACA GCAGCACGTG AAGGTGCACC TCAGGCCAGA CAGGTGGCCG ATCGATGGCA CCTGCTAAAA AATATTGGCG ATGCGCTTGA ACGAATGATG TACAGACATA TACCTCTGAT ACGTCTTGTT GCCAGTGAGT TGTCACTAAA GAAATCACCT GAGCCAGAAC TGTCTGTGCC TGCAGTATCG CTCCGTCGTC CGGAACGCCT TAAACAGCAA ACCCGCAAAA AACGGCATCA GCGTTGGACA GAGGTTATGG CCCTGCATAA CAAGGGATGT AGTTTCAGGG AAATATCCCG TATTACAGGC CTGTCGCGTG TGACAGTCAG TCGCTGGGTG CGTTCAGGAA CATTCCCTGA AATGTCAACC CGACCTCCAA AGCGAGGGCT TCTGGACCCA TGGAGGGAGT GGTTAAAAGA GCAACGAGAA AGCGGTAATT ATAACGCCAG CCGGATATGG CGGGAAATGG TGGCCCGGGG GTTTACAGGC AGTGAAACCA TCGTCAGGGA TGCTGTTGCC AAATGGCGTA AAGGCTGGAT CCCACCGGTT ACTACTGCCG CCAGACTTCC TTCAGTGTCC CGGGTAAGCC GGTGGTTGAT GCCCTGGAGA ATAATCAGGG GGGAAGAAAA TTATGCTTCC CGATTTATTA GTCTGATGTG TGAAAAAGAA CCGGAGCTGA AAATAGCGCA GCAACTGGTA CTCGAGTTCT ACCGTATTCT GAAAACCCAA AATAAATCAC AGCTTAGCAG CTGGTTCACT CGAGTCCACG AAAGCGGCTC AGCAGAACTT CGGCGCGTGG CTGCGGGGAT GGAAGCTGAT GCTGCGGCTA TATGTGAGGC AATCAGCAGT CGCTGGAGTA ATGGTGTTGT CGAAGGTCAT GTAAATCGCC TGAAGATGTT GAAACGCCAG ATGTATGGTC GAGCCGGATT TGAACTGCTC AGGCAGAGGG TCATGAGTCC ACTGGCATGA
|
Protein sequence | MGNAMHSLKT LLQLPCGWRC SRQIISSDGI TLHLHGKRKT AQCPECSKRS DSVHSSRRRR IQHLPCSGQT LWLVFSVRHW YCRNPVCSRK IFAESLAPFA GSHQQSSQAL QNLQRQLGLI AGGEAGKRAA TAVGLRCSAD TLLRRVINTP GTKQSGAPHV GIDEWAWHRG HRYGKLIVNL DTHRPLVLLP GRDQRTLATW FRKYPEIQVV SRDRSGVYAT AAREGAPQAR QVADRWHLLK NIGDALERMM YRHIPLIRLV ASELSLKKSP EPELSVPAVS LRRPERLKQQ TRKKRHQRWT EVMALHNKGC SFREISRITG LSRVTVSRWV RSGTFPEMST RPPKRGLLDP WREWLKEQRE SGNYNASRIW REMVARGFTG SETIVRDAVA KWRKGWIPPV TTAARLPSVS RVSRWLMPWR IIRGEENYAS RFISLMCEKE PELKIAQQLV LEFYRILKTQ NKSQLSSWFT RVHESGSAEL RRVAAGMEAD AAAICEAISS RWSNGVVEGH VNRLKMLKRQ MYGRAGFELL RQRVMSPLA
|
| |