Gene EcSMS35_2257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2257 
Symbol 
ID6145242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2278306 
End bp2279298 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content55% 
IMG OID641617133 
ProductIS5 transposase 
Protein accessionYP_001744306 
Protein GI170680700 
COG category[L] Replication, recombination and repair 
COG ID[COG3039] Transposase and inactivated derivatives, IS5 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.705619 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCATC AACTCACCTT CGCCGATAGT GAATTCAGCA CTAAGCGCCG TCAGACCCGA 
AAAGAGATTT TCCTCTCCCG CATGGAGCAG ATTCTGCCAT GGCAGAATAT GACCGCTGTC
ATCGAGCCGT TTTATCCCAA GGCGGGCAAT GGCCGACGGC CCTATCCGCT GGAGACCATG
CTGCGTATTC ACTGCATGCA GCATTGGTAC AACCTGAGCG ACGGTGCCAT GGAAGATGCC
CTGTACGAAA TCGCCTCCAT GCGCCTGTTT GCCCGATTAT CCCTGGATAG CGCCCTGCCG
GATCGCACCA CCATCATGAA TTTCCGCCAC CTGCTCGAGC AGCATCAACT GGCCCGTCAA
TTGTTCAAGA CCATCAATCG CTGGCTGGCC GAAGCAGGCG TCATGATGAC CCAAGGCACT
TTGGTGGATG CCACCATCAT TGAGGCACCC AGCTCTACCA AGAACAAAGA GCAGCAACGC
GATCCGGAGA TGCATCAGAC CAAGAAAGGC AATCAGTGGC ACTTTGGCAT GAAGGCCCAC
ATTGGTGTCG ATGCCAAGAG TGGCCTGACC CACAGCCTAG TCACCACCGC GGCCAACGAG
CATGACCTCA ATCAGCTGGG TAATCTGCTT CATGGTGAGG AGCAATTTGT CTCAGCCGAT
GCCGGCTACC AAGGAGCGCC ACAGCGCGAG GAGCTGGCCG AGGTGGATGT GGACTGGCTG
ATCGCCGAGC GTCCCGGCAA GGTAAAAACC TTGAAGCAGC ATCCGCGCAA GAACAAAACG
GCCATCAACA TCGAATACAT GAAAGCCAGC ATCCGTGCCA GGGTGGAGCA CCCGTTTCGC
ATCATCAAGC GGCAGTTCGG CTTCGTGAAA GCCAGATACA AGGGGCTGCT GAAAAACGAT
AACCAACTGG CGATGTTATT CACCCTGGCC AACCTGTTTC GGGTGGACCA AATGATACGT
CAGGGGAGAG ATCTCAGTAA AAACCGGAAA TAA
 
Protein sequence
MSHQLTFADS EFSTKRRQTR KEIFLSRMEQ ILPWQNMTAV IEPFYPKAGN GRRPYPLETM 
LRIHCMQHWY NLSDGAMEDA LYEIASMRLF ARLSLDSALP DRTTIMNFRH LLEQHQLARQ
LFKTINRWLA EAGVMMTQGT LVDATIIEAP SSTKNKEQQR DPEMHQTKKG NQWHFGMKAH
IGVDAKSGLT HSLVTTAANE HDLNQLGNLL HGEEQFVSAD AGYQGAPQRE ELAEVDVDWL
IAERPGKVKT LKQHPRKNKT AINIEYMKAS IRARVEHPFR IIKRQFGFVK ARYKGLLKND
NQLAMLFTLA NLFRVDQMIR QGRDLSKNRK