Gene EcSMS35_0642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0642 
Symbol 
ID6143361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp656167 
End bp657159 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content55% 
IMG OID641615533 
ProductIS5 transposase 
Protein accessionYP_001742739 
Protein GI170681327 
COG category[L] Replication, recombination and repair 
COG ID[COG3039] Transposase and inactivated derivatives, IS5 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0956986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCATC AACTCACCTT CGCCGATAGT GAATTCAGCA CTAAGCGCCG TCAGACCCGA 
AAAGAGATTT TCCTCTCCCG CATGGAGCAG ATTCTGCCAT GGCAGAATAT GACCGCTGTC
ATCGAGCCGT TTTATCCCAA GGCGGGCAAT GGCCGACGGC CCTATCCGCT GGAGACCATG
CTGCGTATTC ACTGCATGCA GCATTGGTAC AACCTGAGCG ACGGTGCCAT GGAAGATGCC
CTGTACGAAA TCGCCTCCAT GCGCCTGTTT GCCCGATTAT CCCTGGATAG CGCCCTGCCG
GATCGCACCA CCATCATGAA TTTCCGCCAC CTGCTCGAGC AGCATCAACT GGCCCGTCAA
TTGTTCAAGA CCATCAATCG CTGGCTGGCC GAAGCAGGCG TCATGATGAC CCAAGGCACT
TTGGTGGATG CCACCATCAT TGAGGCACCC AGCTCTACCA AGAACAAAGA GCAGCAACGC
GATCCGGAGA TGCATCAGAC CAAGAAAGGC AATCAGTGGC ACTTTGGCAT GAAGGCCCAC
ATTGGTGTCG ATGCCAAGAG TGGCCTGACC CACAGCCTAG TCACCACCGC GGCCAACGAG
CATGACCTCA ATCAGCTGGG TAATCTGCTT CATGGTGAGG AGCAATTTGT CTCAGCCGAT
GCCGGCTACC AAGGAGCGCC ACAGCGCGAG GAGCTGGCCG AGGTGGATGT GGACTGGCTG
ATCGCCGAGC GTCCCGGCAA GGTAAAAACC TTGAAGCAGC ATCCGCGCAA GAACAAAACG
GCCATCAACA TCGAATACAT GAAAGCCAGC ATCCGTGCCA GGGTGGAGCA CCCGTTTCGC
ATCATCAAGC GGCAGTTCGG CTTCGTGAAA GCCAGATACA AGGGGCTGCT GAAAAACGAT
AACCAACTGG CGATGTTATT CACCCTGGCC AACCTGTTTC GGGTGGACCA AATGATACGT
CAGGGGAGAG ATCTCAGTAA AAACCGGAAA TAA
 
Protein sequence
MSHQLTFADS EFSTKRRQTR KEIFLSRMEQ ILPWQNMTAV IEPFYPKAGN GRRPYPLETM 
LRIHCMQHWY NLSDGAMEDA LYEIASMRLF ARLSLDSALP DRTTIMNFRH LLEQHQLARQ
LFKTINRWLA EAGVMMTQGT LVDATIIEAP SSTKNKEQQR DPEMHQTKKG NQWHFGMKAH
IGVDAKSGLT HSLVTTAANE HDLNQLGNLL HGEEQFVSAD AGYQGAPQRE ELAEVDVDWL
IAERPGKVKT LKQHPRKNKT AINIEYMKAS IRARVEHPFR IIKRQFGFVK ARYKGLLKND
NQLAMLFTLA NLFRVDQMIR QGRDLSKNRK