Gene EcSMS35_3131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3131 
Symbol 
ID6142962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3218016 
End bp3219047 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content52% 
IMG OID641617995 
ProductIS630 transposase 
Protein accessionYP_001745145 
Protein GI170681851 
COG category[L] Replication, recombination and repair 
COG ID[COG3335] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATCA TAGCACCAAT TTCCCGTGAC GAACGACGCC TGATGCAGAA AGCCATCCAT 
AAAACACACG ATAAAAATTA TGCCCGCAGA CTGACTGCCA TGCTGATGCT GCACCGGGGC
GACCGTGTCA GCGACGTTGC CAGAACGCTC TGCTGCGCCC GTTCCTCTGT TGGATGTTGG
ATTAACTGGT TCACGCAGTC GGGTGTTGAG GGACTGAAAT CATTACCTGC CGGGCGAGCC
CGTCTCTGGT CGTTTGAGCA TATCTGCACA CTGTTACGTG AGCTGGTAAA ACATTCTCCC
GGCGACTTTG GCTACCAGCG TTCACGCTGG AGTACAGAAC TGCTGGCAAT AAAAATCAAT
GAGATAACCG ATTGCCAGTT AAATGCCGGA ACCGTTCGCC GCTGGTTGCC GTCTGCGGGG
ATTGTGTGGC GAAGGGCTGC GCCAACTCTG CGTATCCGTG ACCCGCATAA AGATGAAAAG
ATGGCAGCAA TCCATAAAGC ACTGGACGAA TGCAGCACAG AGCATCCGGT CTTTTATGAA
GATGAAGTGG ATATCCATCT TAATCCCAAA ATCGGTGCGG ACTGGCAACT GCGCGGACAG
CAAAAACGGG TGGTCACGCC GGGACAGAAT GAAAAATATG ATCTGGCCGG AGCGCTGCAC
AGCGGGACAG GTAAAGTCAG CTATGTGGGC GGCAACAGCA AAAGTTCGGC GCTGTTCATC
AGCCTGCTGA AGCGGCTTAA AGCGACATAC CTTCGGGTGA AAACCATCAC ACTGATCGTG
GACAACTACA TTATCCACAA AAGCCGGGAA ACACAGCGCT GGTTGAAGGA GAACCCGAAG
TTCAGGGTCA TTTATCAGCC GGTTTACTTG CCATGGGTGA ATCATGTTGA ACGGCTATGG
CAGGCACTTC ACGACACAAT AACGCGTAAT CATCAGTGCC GCTCAATGTG GCAACTGTTG
AAAAAAATTC GCCATTTTAT GGAAACCATC AGCCCGTTCC CCGGAGGCAA ACATGGGCTG
GCAAAAGTGT AG
 
Protein sequence
MPIIAPISRD ERRLMQKAIH KTHDKNYARR LTAMLMLHRG DRVSDVARTL CCARSSVGCW 
INWFTQSGVE GLKSLPAGRA RLWSFEHICT LLRELVKHSP GDFGYQRSRW STELLAIKIN
EITDCQLNAG TVRRWLPSAG IVWRRAAPTL RIRDPHKDEK MAAIHKALDE CSTEHPVFYE
DEVDIHLNPK IGADWQLRGQ QKRVVTPGQN EKYDLAGALH SGTGKVSYVG GNSKSSALFI
SLLKRLKATY LRVKTITLIV DNYIIHKSRE TQRWLKENPK FRVIYQPVYL PWVNHVERLW
QALHDTITRN HQCRSMWQLL KKIRHFMETI SPFPGGKHGL AKV