Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1079 |
Symbol | |
ID | 6143142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1094513 |
End bp | 1095496 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641615966 |
Product | ISL3 family transposase |
Protein accession | YP_001743158 |
Protein GI | 170680811 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3464] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.464356 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGTCTCC GTTGCAGTGC TGATACTCTT CTTCGCAGGC TTATCAATAC CCCGGAGACG AAACAGTCAG GCGCGCCTCA TGTCGGTATT GATGAGTGGG CGTGGCATCG GGGCCACTGC TGCGGTATGT TAATAGTCAA TCCTGATACT CACCGTCCCC TCGTCCTGCT TCCCGGCCGT GATCAGCGTA CGCTGGCGAC CTGGTTCAGA AAATATCCGG AAATACAGGT TGTCTCGCGT GATCGCAGTG GAGTCTATGC GACAGCAGCA CGTGAAGGTG CACCTCAGGC CAGACAGGTG GCCGATCGAT GGCACCTGCT AAAAAATATT GGTGATGAGC CTGAACGAAT GATGTACAGA CATATGCCTC TGATACGTCT TGTTGTCAGA GAGTTATCAC TGAAGAAATC ACCTGAGCCA GAAATATCTG TGCCTGTAGC ATCGCTCCGT CGTCTGGAAC GCCTTAAACA GCACATCCGC AAAAAACGGC ATCAGCGTTG GACAGAGGTT ATGGCCCTGC ATAACAAGGG ATGTAGTTTC AGGGAAATAT CCCGTATTAC AGGCCTGTCG CGAGTGACAG TCAGTCGCTG GGTGGGTTCA GGAACATTCC CTGAAATGTC AACCAGGCCT CCAAAGCGAG GGCTTCTGGA CCCATGGAGG GAGTGGTTAA AAGAGCAACG AGAATGTGGT AATTATAACT CCGGCCGGAT ATGGCGGGAA ATGGTGGCCA GGGGGGTTAC AGGCAGTGAA ACCATCGTCA GGGATGCTGT TGCCAAATGG CATAAAGGCT GGATCCCACC GGTTACTACT GCCGCAAGAC TTCCTTCAGT GTCCCGGGTA AGCCGCTGGT TGATGCCCTG GAGAATAATC AGGGGTGAAG AAAATTATGC TTTCCGATTT ATTAGTCTGA TGTGTGAAAA AGAACCGGAG TTGAAAATAG CGCAGCAACT GGTACTCGAG TTCTACCGTA TTCTGAAAAC CTAA
|
Protein sequence | MGLRCSADTL LRRLINTPET KQSGAPHVGI DEWAWHRGHC CGMLIVNPDT HRPLVLLPGR DQRTLATWFR KYPEIQVVSR DRSGVYATAA REGAPQARQV ADRWHLLKNI GDEPERMMYR HMPLIRLVVR ELSLKKSPEP EISVPVASLR RLERLKQHIR KKRHQRWTEV MALHNKGCSF REISRITGLS RVTVSRWVGS GTFPEMSTRP PKRGLLDPWR EWLKEQRECG NYNSGRIWRE MVARGVTGSE TIVRDAVAKW HKGWIPPVTT AARLPSVSRV SRWLMPWRII RGEENYAFRF ISLMCEKEPE LKIAQQLVLE FYRILKT
|
| |