Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0904 |
Symbol | |
ID | 6143518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 910096 |
End bp | 911121 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641615792 |
Product | IS110 family transposase |
Protein accession | YP_001742984 |
Protein GI | 170681747 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.20367 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTAT CAACTCTTGG TATCGACCTG GCAAAGAACG TTTTTCAGCT TCATGGTGTC GATCATGAAG GCCATACTAT TTTGCGTAAA AAGCTCACCC GGGCTAAGTT TGTTCAGTTT GTGATTCAAC TGGAACCTTG TCTGATTGGC ATGGAAGCCT GCTCATCCAG TCATTATTTT GCGCGATTAT TCACCCGCTA TGGTCATGAG GTAAAACTCA TACCTCCGCA GTATGTGAAG CCTTATGTGA AAACGAACAA GACGGATGCA GCAGATGCTG AAGCAATCTG CGAAGCGGTA ACCCGTCCGA ATATGCGTTT TGTTCAGATA AAAACCGAAG AGCAGCAGGC CGTTTTAGCG TTACACACTG AACGGGGAAT ACTTATCCGT GAGCGGATTG CCTGTGCCAA TAGTTTAAGA GCCACACTTG CTGAGTTTGG TATTACGATT GCGGCCGGAC AAAGCCATTT AACACGTGAG CTGCCAGCCA TTCTGGAGGA TGGCGAAAAT GGTTTATCTC CCTTTGTCAG AACCAGCATC TACAGACAGT CTAAACATAT CCGGGAACTT GAAGAACAAG TTAAACAGGT AGAAGAAGCT CTGGCCTCCT GGTATAGAAC GCAGGAAGCC TGCCAGAGAA TGGCCAAGAT CCCGGGGGTT GGCATGCTAA CGGCCACTTA TGTGGTAGCA GCAGTGGGTA ATGCCCGACA ATTCAGTACC GCAAAGCAGT TCGCTTCATG GCTGGGGCTG ACACCAAAGG AACATTCCAG CGGCGGGAAA CAGCAACTGG GAGGGATCAG CAAACGTGGA GATGGATATT TCCGATACCT GCTGGTTCAC GGCGCACGCG CACTTACCGC CTGGGTCAAC CGAAACGGCG CGGTTGAGGA GAATTCCTGG CTTCAGGGGC TCCTTGAGCG GAAGCACTAC AATGTAGCTG TTGTCGCCAT GGCGGCAAAA ACAGCGAGGA TCATGTGGTC AATGTTGTCA CACAATACTG AATATCAACC TCGGCAGCTC GCCTGA
|
Protein sequence | MKVSTLGIDL AKNVFQLHGV DHEGHTILRK KLTRAKFVQF VIQLEPCLIG MEACSSSHYF ARLFTRYGHE VKLIPPQYVK PYVKTNKTDA ADAEAICEAV TRPNMRFVQI KTEEQQAVLA LHTERGILIR ERIACANSLR ATLAEFGITI AAGQSHLTRE LPAILEDGEN GLSPFVRTSI YRQSKHIREL EEQVKQVEEA LASWYRTQEA CQRMAKIPGV GMLTATYVVA AVGNARQFST AKQFASWLGL TPKEHSSGGK QQLGGISKRG DGYFRYLLVH GARALTAWVN RNGAVEENSW LQGLLERKHY NVAVVAMAAK TARIMWSMLS HNTEYQPRQL A
|
| |