Gene EcSMS35_1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1079 
Symbol 
ID6143142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1094513 
End bp1095496 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content51% 
IMG OID641615966 
ProductISL3 family transposase 
Protein accessionYP_001743158 
Protein GI170680811 
COG category[L] Replication, recombination and repair 
COG ID[COG3464] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.464356 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTCTCC GTTGCAGTGC TGATACTCTT CTTCGCAGGC TTATCAATAC CCCGGAGACG 
AAACAGTCAG GCGCGCCTCA TGTCGGTATT GATGAGTGGG CGTGGCATCG GGGCCACTGC
TGCGGTATGT TAATAGTCAA TCCTGATACT CACCGTCCCC TCGTCCTGCT TCCCGGCCGT
GATCAGCGTA CGCTGGCGAC CTGGTTCAGA AAATATCCGG AAATACAGGT TGTCTCGCGT
GATCGCAGTG GAGTCTATGC GACAGCAGCA CGTGAAGGTG CACCTCAGGC CAGACAGGTG
GCCGATCGAT GGCACCTGCT AAAAAATATT GGTGATGAGC CTGAACGAAT GATGTACAGA
CATATGCCTC TGATACGTCT TGTTGTCAGA GAGTTATCAC TGAAGAAATC ACCTGAGCCA
GAAATATCTG TGCCTGTAGC ATCGCTCCGT CGTCTGGAAC GCCTTAAACA GCACATCCGC
AAAAAACGGC ATCAGCGTTG GACAGAGGTT ATGGCCCTGC ATAACAAGGG ATGTAGTTTC
AGGGAAATAT CCCGTATTAC AGGCCTGTCG CGAGTGACAG TCAGTCGCTG GGTGGGTTCA
GGAACATTCC CTGAAATGTC AACCAGGCCT CCAAAGCGAG GGCTTCTGGA CCCATGGAGG
GAGTGGTTAA AAGAGCAACG AGAATGTGGT AATTATAACT CCGGCCGGAT ATGGCGGGAA
ATGGTGGCCA GGGGGGTTAC AGGCAGTGAA ACCATCGTCA GGGATGCTGT TGCCAAATGG
CATAAAGGCT GGATCCCACC GGTTACTACT GCCGCAAGAC TTCCTTCAGT GTCCCGGGTA
AGCCGCTGGT TGATGCCCTG GAGAATAATC AGGGGTGAAG AAAATTATGC TTTCCGATTT
ATTAGTCTGA TGTGTGAAAA AGAACCGGAG TTGAAAATAG CGCAGCAACT GGTACTCGAG
TTCTACCGTA TTCTGAAAAC CTAA
 
Protein sequence
MGLRCSADTL LRRLINTPET KQSGAPHVGI DEWAWHRGHC CGMLIVNPDT HRPLVLLPGR 
DQRTLATWFR KYPEIQVVSR DRSGVYATAA REGAPQARQV ADRWHLLKNI GDEPERMMYR
HMPLIRLVVR ELSLKKSPEP EISVPVASLR RLERLKQHIR KKRHQRWTEV MALHNKGCSF
REISRITGLS RVTVSRWVGS GTFPEMSTRP PKRGLLDPWR EWLKEQRECG NYNSGRIWRE
MVARGVTGSE TIVRDAVAKW HKGWIPPVTT AARLPSVSRV SRWLMPWRII RGEENYAFRF
ISLMCEKEPE LKIAQQLVLE FYRILKT