Gene EcSMS35_0904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0904 
Symbol 
ID6143518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp910096 
End bp911121 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content50% 
IMG OID641615792 
ProductIS110 family transposase 
Protein accessionYP_001742984 
Protein GI170681747 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.20367 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTAT CAACTCTTGG TATCGACCTG GCAAAGAACG TTTTTCAGCT TCATGGTGTC 
GATCATGAAG GCCATACTAT TTTGCGTAAA AAGCTCACCC GGGCTAAGTT TGTTCAGTTT
GTGATTCAAC TGGAACCTTG TCTGATTGGC ATGGAAGCCT GCTCATCCAG TCATTATTTT
GCGCGATTAT TCACCCGCTA TGGTCATGAG GTAAAACTCA TACCTCCGCA GTATGTGAAG
CCTTATGTGA AAACGAACAA GACGGATGCA GCAGATGCTG AAGCAATCTG CGAAGCGGTA
ACCCGTCCGA ATATGCGTTT TGTTCAGATA AAAACCGAAG AGCAGCAGGC CGTTTTAGCG
TTACACACTG AACGGGGAAT ACTTATCCGT GAGCGGATTG CCTGTGCCAA TAGTTTAAGA
GCCACACTTG CTGAGTTTGG TATTACGATT GCGGCCGGAC AAAGCCATTT AACACGTGAG
CTGCCAGCCA TTCTGGAGGA TGGCGAAAAT GGTTTATCTC CCTTTGTCAG AACCAGCATC
TACAGACAGT CTAAACATAT CCGGGAACTT GAAGAACAAG TTAAACAGGT AGAAGAAGCT
CTGGCCTCCT GGTATAGAAC GCAGGAAGCC TGCCAGAGAA TGGCCAAGAT CCCGGGGGTT
GGCATGCTAA CGGCCACTTA TGTGGTAGCA GCAGTGGGTA ATGCCCGACA ATTCAGTACC
GCAAAGCAGT TCGCTTCATG GCTGGGGCTG ACACCAAAGG AACATTCCAG CGGCGGGAAA
CAGCAACTGG GAGGGATCAG CAAACGTGGA GATGGATATT TCCGATACCT GCTGGTTCAC
GGCGCACGCG CACTTACCGC CTGGGTCAAC CGAAACGGCG CGGTTGAGGA GAATTCCTGG
CTTCAGGGGC TCCTTGAGCG GAAGCACTAC AATGTAGCTG TTGTCGCCAT GGCGGCAAAA
ACAGCGAGGA TCATGTGGTC AATGTTGTCA CACAATACTG AATATCAACC TCGGCAGCTC
GCCTGA
 
Protein sequence
MKVSTLGIDL AKNVFQLHGV DHEGHTILRK KLTRAKFVQF VIQLEPCLIG MEACSSSHYF 
ARLFTRYGHE VKLIPPQYVK PYVKTNKTDA ADAEAICEAV TRPNMRFVQI KTEEQQAVLA
LHTERGILIR ERIACANSLR ATLAEFGITI AAGQSHLTRE LPAILEDGEN GLSPFVRTSI
YRQSKHIREL EEQVKQVEEA LASWYRTQEA CQRMAKIPGV GMLTATYVVA AVGNARQFST
AKQFASWLGL TPKEHSSGGK QQLGGISKRG DGYFRYLLVH GARALTAWVN RNGAVEENSW
LQGLLERKHY NVAVVAMAAK TARIMWSMLS HNTEYQPRQL A