Gene EcSMS35_A0067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_A0067 
Symbol 
ID6106569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010488 
Strand
Start bp50338 
End bp51309 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content51% 
IMG OID641614814 
ProductIS110 family transposase 
Protein accessionYP_001739955 
Protein GI170650843 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.0110214 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAAC CAAATCTGCA ATGCATGGGT ATTGATGTTG CCAAACTATC GCTGGACATC 
GCCACCACCG ACACGATTGA GCCATTCACT GTGGGTAACG ATGAGGATGG TTTCGCTGTT
ATCACAGATA AACTGAAGCA CACCAAAATT AACCTGATTC TCATGGAAGC TACCGGTGGC
CTTGAAGCAG CCATTGCCTG TAAGCTTCAG TCAGAAGGAT ACGATGTGGT TGTGATCAAC
CCACGACAGG CTAGGGATTT TGCCCGTTCA ATGGGATATC TGGCTAAAAC AGATAAACTT
GACGCCGCCA TGCTAGCACA ACTGGCCCTG GTCATTGATC GCCATCCGGA CCGCAGTCGT
TATATACGGC ATCTGCCAGA TGAGGCACGA GCAGTACTTG CCGCAATGGT CGTCCGTCGT
CGTCAGTTGA ACCATATGCT GGTCGCTGAG CGTAATCGTC TCTATCCTTC TCATCCCCAA
AGCAGGAAGA GTATCGATAA CATTATTGAT GCGCTTCAAA ACGAGCTCGA CCGGATCAAT
GAGCAAATGA AACAACACAT GACAGCATTC TTCCAGGAGC AGGCCAGACT GATAGGCAGC
GTGAAAGGCG TCGGCGATAT CACCGTCGCG TCGCTGATTG CCGAACTACC GGAACAGGGG
AAACTCAATC GACGGGAGAT TAGTGCTCTA ACTGGCGTCG CTCCTCTAAA CAGAGACTCC
GGGAAAATGC GAGGGAAACG GACCACGTTT GGTGGCAGAG CCGGAGTGAG AGCAACGTTG
AACATGGCAG CTCTGGTGGC TACGCAGTTT AATCCTGCCA TAAAGCTGTT CTACCAGCGT
TTGCTTGCCG CCGGAAAACC CAAAAAACTT GCTCTGGTCG CCTGCATGCG CAAACTCATC
ACCATTCTGA ATACCATGCT CAGAAAAGGG GAAGAGTGGA ACGCCTCATT TCAATCACAG
GTAATCTCAT GA
 
Protein sequence
MSQPNLQCMG IDVAKLSLDI ATTDTIEPFT VGNDEDGFAV ITDKLKHTKI NLILMEATGG 
LEAAIACKLQ SEGYDVVVIN PRQARDFARS MGYLAKTDKL DAAMLAQLAL VIDRHPDRSR
YIRHLPDEAR AVLAAMVVRR RQLNHMLVAE RNRLYPSHPQ SRKSIDNIID ALQNELDRIN
EQMKQHMTAF FQEQARLIGS VKGVGDITVA SLIAELPEQG KLNRREISAL TGVAPLNRDS
GKMRGKRTTF GGRAGVRATL NMAALVATQF NPAIKLFYQR LLAAGKPKKL ALVACMRKLI
TILNTMLRKG EEWNASFQSQ VIS