Gene EcSMS35_2272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2272 
Symbol 
ID6144701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2292586 
End bp2294205 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content52% 
IMG OID641617147 
ProductISL3 family transposase 
Protein accessionYP_001744320 
Protein GI170679893 
COG category[L] Replication, recombination and repair 
COG ID[COG3464] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00373654 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.467747 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAACG CAATGCACTC TCTTAAGACA CTTCTACAGT TACCTTGCGG ATGGCGATGC 
AGTCGACAAA TTATTAGCTC TGACGGTATC ACCCTCCATC TCCACGGAAA ACGCAAAACA
GCACAATGTC CTGAATGCTC TAAGCGTAGC GACTCTGTTC ATAGTTCTCG TCGGCGCCGG
ATACAGCATC TACCCTGCTC CGGGCAGACG CTATGGCTTG TATTTTCCGT CCGCCACTGG
TACTGCCGTA ACCCTGTTTG TTCACGAAAA ATTTTTGCCG AGTCGCTTGC TCCCTTCGCC
GGTTCACACC AGCAGTCTTC ACAGGCGTTA CAAAATTTAC AACGTCAACT GGGATTAATA
GCCGGAGGTG AGGCTGGAAA ACGGGCTGCA ACGGCAGTGG GTCTCCGTTG CAGTGCAGAT
ACTCTTCTTC GCAGGGTTAT CAATACCCCG GGGACGAAAC AGTCAGGCGC GCCTCATGTC
GGTATTGATG AGTGGGCGTG GCATCGGGGC CACCGTTACG GTAAGTTAAT CGTCAATCTT
GATACTCACC GTCCCCTCGT CCTGCTTCCC GGTCGTGATC AGCGTACGCT GGCGACCTGG
TTCAGAAAAT ATCCGGAAAT ACAGGTTGTC TCGCGTGATC GCAGTGGAGT CTATGCAACA
GCAGCACGTG AAGGTGCACC TCAGGCCAGA CAGGTGGCCG ATCGATGGCA CCTGCTAAAA
AATATTGGCG ATGCGCTTGA ACGAATGATG TACAGACATA TACCTCTGAT ACGTCTTGTT
GCCAGTGAGT TGTCACTAAA GAAATCACCT GAGCCAGAAC TGTCTGTGCC TGCAGTATCG
CTCCGTCGTC CGGAACGCCT TAAACAGCAA ACCCGCAAAA AACGGCATCA GCGTTGGACA
GAGGTTATGG CCCTGCATAA CAAGGGATGT AGTTTCAGGG AAATATCCCG TATTACAGGC
CTGTCGCGTG TGACAGTCAG TCGCTGGGTG CGTTCAGGAA CATTCCCTGA AATGTCAACC
CGACCTCCAA AGCGAGGGCT TCTGGACCCA TGGAGGGAGT GGTTAAAAGA GCAACGAGAA
AGCGGTAATT ATAACGCCAG CCGGATATGG CGGGAAATGG TGGCCCGGGG GTTTACAGGC
AGTGAAACCA TCGTCAGGGA TGCTGTTGCC AAATGGCGTA AAGGCTGGAT CCCACCGGTT
ACTACTGCCG CCAGACTTCC TTCAGTGTCC CGGGTAAGCC GGTGGTTGAT GCCCTGGAGA
ATAATCAGGG GGGAAGAAAA TTATGCTTCC CGATTTATTA GTCTGATGTG TGAAAAAGAA
CCGGAGCTGA AAATAGCGCA GCAACTGGTA CTCGAGTTCT ACCGTATTCT GAAAACCCAA
AATAAATCAC AGCTTAGCAG CTGGTTCACT CGAGTCCACG AAAGCGGCTC AGCAGAACTT
CGGCGCGTGG CTGCGGGGAT GGAAGCTGAT GCTGCGGCTA TATGTGAGGC AATCAGCAGT
CGCTGGAGTA ATGGTGTTGT CGAAGGTCAT GTAAATCGCC TGAAGATGTT GAAACGCCAG
ATGTATGGTC GAGCCGGATT TGAACTGCTC AGGCAGAGGG TCATGAGTCC ACTGGCATGA
 
Protein sequence
MGNAMHSLKT LLQLPCGWRC SRQIISSDGI TLHLHGKRKT AQCPECSKRS DSVHSSRRRR 
IQHLPCSGQT LWLVFSVRHW YCRNPVCSRK IFAESLAPFA GSHQQSSQAL QNLQRQLGLI
AGGEAGKRAA TAVGLRCSAD TLLRRVINTP GTKQSGAPHV GIDEWAWHRG HRYGKLIVNL
DTHRPLVLLP GRDQRTLATW FRKYPEIQVV SRDRSGVYAT AAREGAPQAR QVADRWHLLK
NIGDALERMM YRHIPLIRLV ASELSLKKSP EPELSVPAVS LRRPERLKQQ TRKKRHQRWT
EVMALHNKGC SFREISRITG LSRVTVSRWV RSGTFPEMST RPPKRGLLDP WREWLKEQRE
SGNYNASRIW REMVARGFTG SETIVRDAVA KWRKGWIPPV TTAARLPSVS RVSRWLMPWR
IIRGEENYAS RFISLMCEKE PELKIAQQLV LEFYRILKTQ NKSQLSSWFT RVHESGSAEL
RRVAAGMEAD AAAICEAISS RWSNGVVEGH VNRLKMLKRQ MYGRAGFELL RQRVMSPLA