Gene EcSMS35_A0120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_A0120 
Symbol 
ID6106580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010488 
Strand
Start bp90229 
End bp91242 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content61% 
IMG OID641614862 
Productintegrase/recombinase 
Protein accessionYP_001740003 
Protein GI170650836 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID[TIGR02249] integron integrase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.400978 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCG CCACTGCGCC GTTACCACCG CTGCGTTCGG TCAAGGTTCT GGACCAGTTG 
CGTGAGCGCA TACGCTACTT GCATTACAGT TTACGAACCG AACAGGCTTA TGTCCACTGG
GTTCGTGCCT TCATCCGTTT CCACGGTGTG CGTCACCCGG CAACCTTGGG CAGCAGCGAA
GTCGAGGCAT TTCTGTCCTG GCTGGCGAAC GAGCGCAAGG TTTCGGTCTC CACGCATCGT
CAGGCATTGG CGGCCTTGCT GTTCTTCTAC GGCAAGGTGC TGTGCACGGA TCTGCCCTGG
CTTCAGGAGA TCGGAAGACC TCGGCCGTCG CGGCGCTTGC CGGTGGTGCT GACCCCGGAT
GAAGTGGTTC GCATCCTCGG TTTTCTGGAA GGCGAGCATC GTTTGTTCGC CCAGCTTCTG
TATGGAACGG GCATGCGGAT CAGTGAGGGT TTGCAACTGC GGGTCAAGGA TCTGGATTTC
GATCACGGCA CGATCATCGT GCGGGAGGGC AAGGGCTCCA AGGATCGGGC CTTGATGTTA
CCCGAGAGCT TGGCACCCAG CCTGCGCGAG CAGCTGTCGC GTGCACGGGC ATGGTGGCTG
AAGGACCAGG CCGAGGGCCG CAGCGGCGTT GCGCTTCCCG ACGCCCTTGA GCGGAAGTAT
CCGCGCGCCG GGCATTCCTG GCCGTGGTTC TGGGTTTTTG CGCAGCACAC GCATTCGACC
GATCCACGGA GCGGTGTCGT GCGTCGCCAT CACATGTATG ACCAGACCTT TCAGCGCGCC
TTCAAACGTG CCGTAGAACA AGCAGGCATC ACGAAGCCCG CCACACCGCA CACCCTCCGC
CACTCGTTCG CGACGGCCTT GCTCCGCAGC GGTTACGACA TTCGAACCGT GCAGGATCTG
CTCGGCCATT CCGACGTCTC TACGACGATG ATTTACACGC ATGTGCTGAA AGTTGGCGGT
GCCGGAGTGC GCTCACCGCT TGATGCGCTG CCGCCCCTCA CTAGTGAGAG GTAG
 
Protein sequence
MKTATAPLPP LRSVKVLDQL RERIRYLHYS LRTEQAYVHW VRAFIRFHGV RHPATLGSSE 
VEAFLSWLAN ERKVSVSTHR QALAALLFFY GKVLCTDLPW LQEIGRPRPS RRLPVVLTPD
EVVRILGFLE GEHRLFAQLL YGTGMRISEG LQLRVKDLDF DHGTIIVREG KGSKDRALML
PESLAPSLRE QLSRARAWWL KDQAEGRSGV ALPDALERKY PRAGHSWPWF WVFAQHTHST
DPRSGVVRRH HMYDQTFQRA FKRAVEQAGI TKPATPHTLR HSFATALLRS GYDIRTVQDL
LGHSDVSTTM IYTHVLKVGG AGVRSPLDAL PPLTSER