Gene EcSMS35_4307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4307 
Symbol 
ID6147078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4408933 
End bp4409913 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content47% 
IMG OID641619128 
Productphage integrase family site specific recombinase 
Protein accessionYP_001746252 
Protein GI170680409 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0756066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.00146275 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCAATTA AGAAGCTCGA TGATGGACGC TATGAAGTGG ACATTAGACC TCGCGGTCGC 
GACGGAAAAC GCATCCGCAG GAAATTTGAA AGAAAAGCTG AGGCTGTAGC ATTTGAGCGA
TACACAATCG CCTACGCCAG CCAGAAAGAA TGGGCAGGTC AGCGAGCAGA TCGCAGAACT
TTGAGTGAGT TGCTGAACAT CTGGTGGAAA TATCACGGGC AAAACCACGA GCATGGAACA
AAAGAGTTTA ATCATCTGCT CAAAACCATC AGCGGCATAG GTGATATACC AGTGAGCCGG
ATGAGCAAAA GAGCTTTGAT GGATTATCGT TCCATGCGAC TACGTGATGG TATCAGTGCC
GCAACGATAA ACCGTGACAT GTACCGATTA TCCGGCATGT TCACAAAATT AATTCAATTG
GATGAATTTT CCGGGCAACA CCCAATTCAC GGACTGCCGC CACTGGCGGA GGCCAACCCT
GAAATGACGT TCCTGGAAAA AGCAGAAATC GAAAAACTGT TAAATGTTTT GGATGGTGAT
GACTTACTTG TCGCACTTTT ATGTCTGAGC ACTGGAGGAA GATGGACGGA AGTTGCCACG
CTAAAACCAG CACAGATTAC AAATTGCAGG GTTACCTTCC TGAAAACCAA AAACGGTAAA
AAGCGAACCG TGCCGATTTC TGAGGAACTG GAGAAAAAAG TTAAAGAGGA GGCCAGCGCT
AAATTATTCA AAGTTGATTA TGAGAAGTTT TGCGGGATTT TACGCAGAGT GAAGCCAGAT
ATACCACCCA ATCAGGCAAC CCACATCCTG CGGCATACAT TCGCAAGCCA TTTCATGATG
AATGGGGGCA ATATAATCGC ACTGCAACAG ATTCTGGGAC ATGCGAGCAT TCAGCAGACG
ATGGCCTATG CGCACCTTGC GCCTGACTAC CTGCAAAATG CCGTCGCGCT GAATCCTCTA
AAAGGCGGAG TGACGTTATA A
 
Protein sequence
MSIKKLDDGR YEVDIRPRGR DGKRIRRKFE RKAEAVAFER YTIAYASQKE WAGQRADRRT 
LSELLNIWWK YHGQNHEHGT KEFNHLLKTI SGIGDIPVSR MSKRALMDYR SMRLRDGISA
ATINRDMYRL SGMFTKLIQL DEFSGQHPIH GLPPLAEANP EMTFLEKAEI EKLLNVLDGD
DLLVALLCLS TGGRWTEVAT LKPAQITNCR VTFLKTKNGK KRTVPISEEL EKKVKEEASA
KLFKVDYEKF CGILRRVKPD IPPNQATHIL RHTFASHFMM NGGNIIALQQ ILGHASIQQT
MAYAHLAPDY LQNAVALNPL KGGVTL