Gene EcSMS35_4002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4002 
Symbol 
ID6143699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4081597 
End bp4082760 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content44% 
IMG OID641618827 
Productphage integrase family site specific recombinase 
Protein accessionYP_001745965 
Protein GI170682095 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCTCA CCGATATACA GATCAAACGT GCAAAACCAC AAGACAAGCC ATACACATTG 
AACGATGGAC AAGGTCTGTC ATTGCTTATC AATCCCGATG GCTCGAAAGG CTGGCGTTTC
CGTTTCCGGT TTGCAGGGAA AGCGCGGTTA ATGTCATTTG GCAGCTACGA TTTAGTAAGC
CTCGCAGAAG CACGTGAGAA GCGTGATATC GCCCGTAAGC AGGTTGCTAA TGGCATTGAC
CCGGTAGAGG AACGCAAAGC TTTAAGACTC GCCCAAAAGC TATCAACAGA AAATTCTTTC
GAAGCAATAT GTCGAGAATG GCATACCAAC AAAGCTGACC GCTGGACTGT GGCCTATCGA
GAAGAAATCA TTAAGACATT CGAGCAAGAT GTCTTCCCGT TCATTGGTAA ACGTCCTATC
AGTGAAATCA AACCATTAGA ACTGCTTGAA GTATTACGAC GAATCGAAAA ACGTGGAGCA
CTAGAGAAAA CACGCAAGGT GCGTCAAAGA TGCGGTGAGG TTTATCGCTA TGCAATCATA
ACTGGCCGTG CTGAGTACAA TCCTGCACCT GATTTAGCTA TCGCTCTGGC CGTTCCCAAG
CAAAAACACC ATCCATTTTT ATCCGCTGAA GAGTTGCCTC ATTTTATTCG AGATCTTGAA
GCGTATACCG GTAGCATCAT CACCAAAAAT GCTACGAAGA TAGTCATGCT GACTGGTGTA
AGAACGCAGG AGATGCGCTT TGCTACGTGG GAAGAAGTAG ACCTCGAAAA AGGTATATGG
GAGATACCAG CGGAACGTAT GAAAATGCGT AGACCTCACA TTGTTCCTTT ATCTACTCAG
GTAGTTGACC TTTTCAAACA GCTCAAACCT ATTACCGGCC ATTACCCTTA CATCTTTATT
GGCAGGAACA ACCGCAGCAA GCCAATCTCA AAAGAAAGTG TTTCACAAGT GATTGAGTTA
ATTGGCTACA AAGGCCGTGC TACAGGTCAC GGTTTTCGGC ATACCATGTC GACAATATTG
CACGAACAAG GGTTTGATAG CGCATGGATT GAAATACAAT TGGCACATGT TGATAAAAAC
AGAATCCGAG GGACTTACAA TCATGCTCAA TATCTTGAAC ATAGAAAAAA AATGATGCAA
TGGTATTCAG ATAAATTATA TTGA
 
Protein sequence
MALTDIQIKR AKPQDKPYTL NDGQGLSLLI NPDGSKGWRF RFRFAGKARL MSFGSYDLVS 
LAEAREKRDI ARKQVANGID PVEERKALRL AQKLSTENSF EAICREWHTN KADRWTVAYR
EEIIKTFEQD VFPFIGKRPI SEIKPLELLE VLRRIEKRGA LEKTRKVRQR CGEVYRYAII
TGRAEYNPAP DLAIALAVPK QKHHPFLSAE ELPHFIRDLE AYTGSIITKN ATKIVMLTGV
RTQEMRFATW EEVDLEKGIW EIPAERMKMR RPHIVPLSTQ VVDLFKQLKP ITGHYPYIFI
GRNNRSKPIS KESVSQVIEL IGYKGRATGH GFRHTMSTIL HEQGFDSAWI EIQLAHVDKN
RIRGTYNHAQ YLEHRKKMMQ WYSDKLY