Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_A0067 |
Symbol | |
ID | 6106569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010488 |
Strand | + |
Start bp | 50338 |
End bp | 51309 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641614814 |
Product | IS110 family transposase |
Protein accession | YP_001739955 |
Protein GI | 170650843 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.0110214 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCAAC CAAATCTGCA ATGCATGGGT ATTGATGTTG CCAAACTATC GCTGGACATC GCCACCACCG ACACGATTGA GCCATTCACT GTGGGTAACG ATGAGGATGG TTTCGCTGTT ATCACAGATA AACTGAAGCA CACCAAAATT AACCTGATTC TCATGGAAGC TACCGGTGGC CTTGAAGCAG CCATTGCCTG TAAGCTTCAG TCAGAAGGAT ACGATGTGGT TGTGATCAAC CCACGACAGG CTAGGGATTT TGCCCGTTCA ATGGGATATC TGGCTAAAAC AGATAAACTT GACGCCGCCA TGCTAGCACA ACTGGCCCTG GTCATTGATC GCCATCCGGA CCGCAGTCGT TATATACGGC ATCTGCCAGA TGAGGCACGA GCAGTACTTG CCGCAATGGT CGTCCGTCGT CGTCAGTTGA ACCATATGCT GGTCGCTGAG CGTAATCGTC TCTATCCTTC TCATCCCCAA AGCAGGAAGA GTATCGATAA CATTATTGAT GCGCTTCAAA ACGAGCTCGA CCGGATCAAT GAGCAAATGA AACAACACAT GACAGCATTC TTCCAGGAGC AGGCCAGACT GATAGGCAGC GTGAAAGGCG TCGGCGATAT CACCGTCGCG TCGCTGATTG CCGAACTACC GGAACAGGGG AAACTCAATC GACGGGAGAT TAGTGCTCTA ACTGGCGTCG CTCCTCTAAA CAGAGACTCC GGGAAAATGC GAGGGAAACG GACCACGTTT GGTGGCAGAG CCGGAGTGAG AGCAACGTTG AACATGGCAG CTCTGGTGGC TACGCAGTTT AATCCTGCCA TAAAGCTGTT CTACCAGCGT TTGCTTGCCG CCGGAAAACC CAAAAAACTT GCTCTGGTCG CCTGCATGCG CAAACTCATC ACCATTCTGA ATACCATGCT CAGAAAAGGG GAAGAGTGGA ACGCCTCATT TCAATCACAG GTAATCTCAT GA
|
Protein sequence | MSQPNLQCMG IDVAKLSLDI ATTDTIEPFT VGNDEDGFAV ITDKLKHTKI NLILMEATGG LEAAIACKLQ SEGYDVVVIN PRQARDFARS MGYLAKTDKL DAAMLAQLAL VIDRHPDRSR YIRHLPDEAR AVLAAMVVRR RQLNHMLVAE RNRLYPSHPQ SRKSIDNIID ALQNELDRIN EQMKQHMTAF FQEQARLIGS VKGVGDITVA SLIAELPEQG KLNRREISAL TGVAPLNRDS GKMRGKRTTF GGRAGVRATL NMAALVATQF NPAIKLFYQR LLAAGKPKKL ALVACMRKLI TILNTMLRKG EEWNASFQSQ VIS
|
| |