Gene EcSMS35_A0133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_A0133 
Symbolsat 
ID6106506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010488 
Strand
Start bp102959 
End bp104038 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content49% 
IMG OID641614873 
Productstreptothricin acetyltransferase Sat-1 
Protein accessionYP_001740014 
Protein GI170650888 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTCAC GCAACTGGTC CAGAACCTTG ACCGAACGCA GCGGTGGTAA CGGCGCAGTG 
GCGGTTTTCA TGGCTTGTTA TGACTGTTTT TTTGTACAGT CTATGCCTCG AGCATCCAAG
CAGCAAGCGC GTTACGCCGT GGGTCGATGT TTGATGTTAT GGAGCAGCAA CGATGTTACG
CAGCAGGGCA GTCGCCCTAA AACAAAGTTG TACGTAGAAC TCGAAGGCAA TTTAAGTATG
AAAGAAAAGG TAGTTGTTGA TAAAGCGATT TCACTCTATA CCGAATCATT CGGCGACCCG
GCCCATGAAC CCATTATTCT GATCATGGGG GCAATGTCGT CTGCGGTGTG GTGGCCTGAT
GAGTTTTGTT CCCAACTTGC CAAAATGGGT CGCTATGTGA TCCGGTACGA CCACCGTGAT
ACCGGGAAAT CAACAAGCTA TGAGCCAGGT CAGGCTCCAT ATTCCGTTGA AGAATTAGCA
GATGATGTGG TTCGCGTCAT TGATGGTTAT GGTCTGGAAG CTGCTCATTT AGTCGGCATG
TCTTTGGGGG GATTTCTTTC CCAGCTTGTA GCTCTCAAGT ATCCGAAACG TGTGAAGAGC
TTGACGCTGA TTGCTTCAGA ACGGCTTGCA GATGCAGATC CGGATATGCC CGCTTTTGAT
CCTGCCATCA TTGAGTATCA CCAACGGGCG GAATCGCTGG ATTGGTCTGA TAGAGATGCT
GTCGTCGCGT ATCAGGTCGG AGCGTGGCGA ATCAACTCAG GTACTGCGCA TGCTTTTGAC
GCTGAGAAGA TACAAAACAT CGCTGAGTTA AATTTTGATC GCACTCCGAA TATCCTGACA
ACATTCAACC ACACTACTTT AGGTGGTGGC GAGAGATGGC TCGGGAGATT AAATGAGATA
GCTGTACCAA CTTTGATCAT TCACGGCACG GAGGATCCTG TACTTCCTTA TGTGCATGGG
TTGGCACTGA AAGATGCGAT TCGTGGTTCA AAAATGCTGA CACTCGAAGG CACGGGACAT
GAGTTGCATC ATGAAGACTG GCCGAGGATT ATCCAGGCGA TTAAGGGGCA AACGTCATAG
 
Protein sequence
MRSRNWSRTL TERSGGNGAV AVFMACYDCF FVQSMPRASK QQARYAVGRC LMLWSSNDVT 
QQGSRPKTKL YVELEGNLSM KEKVVVDKAI SLYTESFGDP AHEPIILIMG AMSSAVWWPD
EFCSQLAKMG RYVIRYDHRD TGKSTSYEPG QAPYSVEELA DDVVRVIDGY GLEAAHLVGM
SLGGFLSQLV ALKYPKRVKS LTLIASERLA DADPDMPAFD PAIIEYHQRA ESLDWSDRDA
VVAYQVGAWR INSGTAHAFD AEKIQNIAEL NFDRTPNILT TFNHTTLGGG ERWLGRLNEI
AVPTLIIHGT EDPVLPYVHG LALKDAIRGS KMLTLEGTGH ELHHEDWPRI IQAIKGQTS