Gene EcSMS35_2286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2286 
Symbol 
ID6143500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2314068 
End bp2315093 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content49% 
IMG OID641617160 
ProductIS110 family transposase 
Protein accessionYP_001744333 
Protein GI170682362 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.438286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.00653664 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGTAT CAACTCTTGG TATCGACCTG GCAAAGAACG TTTTTCAGCT TCATGGTGTC 
GATCATGAAG GCCATACTAT TTTGCGTAAA AAGCTCACCC GGGCTAAGTT TGTTCAGTTT
GTGATTCAAC TGGAACCTTG TCTGATTGGC ATGGAAGCCT GCTCATCCAG TCATTATTTT
GCGCGATTAT TCACCCGCTA TGGTCATGAG GTAAAACTCA TACCTCCGCA GTATGTGAAG
CCTTATGTGA AAACGAACAA GACGGATGCA ACAGATGCTG AAGCAATCTG CGAAGCGGTA
ACACGTCCGA ATATGCGTTT TGTTCAGATA AAAACCGAAG AGCAGCAGGC CGTTTTAGCG
TTACACACTG AACGGGGAAT ACTTATCCGT GAGCGGATTG CCTGTGCCAA TAGTTTAAGA
GCCACACTTG CTGAGTTTGG TATTACGATT GCGGCCGGAC AAAACCATTT AACCCGTGAG
CTGCCAGCCA TTCTGGAGGA TGGCGAAAAT GGTTTATCTC CCTTTGTCAG AACCAGCATC
TACAGACAGT CTAAACATAT CCGGGAACTT GAAGAACAAG TTAAACAGGT AGAAGAAGCT
CTGGCCTCCT GGTATAGAAC GCAGGAAGCC TGCCAGAGAA TGGCCAAGAT CCCGGGGGTT
GGCATGCTAA CGGCCACTTA TGTGGTAGCA GCAGTGGGTA ATGCCCGACA ATTCAGTACC
GCAAAGCAGT TCGCTTCATG GCTGGGGCTG ACACCAAAGG AACATTCCAG CGGCGGGAAA
CAGCAACTGG GAGGGATCAG CAAACGTGGA GATGGATATT TCCGATACCT GCTGGTTCAC
GGCGCACGCG CACTTACCGC CAGGGTCAAC CGAAACGGCG CGGTTGAGAA GAATTCCTGG
CTTCAGGGAC TCCTTGAGCG GAAGCACTAC AATGTAGCTG TTGTCGCCAT GGCGGCAAAA
ACAGCGAGGA TCATGTGGTC AATGTTGTCA CACAATACTG AATATCAACC TCGGCAGCTC
GCCTGA
 
Protein sequence
MKVSTLGIDL AKNVFQLHGV DHEGHTILRK KLTRAKFVQF VIQLEPCLIG MEACSSSHYF 
ARLFTRYGHE VKLIPPQYVK PYVKTNKTDA TDAEAICEAV TRPNMRFVQI KTEEQQAVLA
LHTERGILIR ERIACANSLR ATLAEFGITI AAGQNHLTRE LPAILEDGEN GLSPFVRTSI
YRQSKHIREL EEQVKQVEEA LASWYRTQEA CQRMAKIPGV GMLTATYVVA AVGNARQFST
AKQFASWLGL TPKEHSSGGK QQLGGISKRG DGYFRYLLVH GARALTARVN RNGAVEKNSW
LQGLLERKHY NVAVVAMAAK TARIMWSMLS HNTEYQPRQL A