Gene EcSMS35_3332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3332 
Symbol 
ID6145912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3408943 
End bp3410103 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content51% 
IMG OID641618161 
Productglutathionylspermidine synthase family protein 
Protein accessionYP_001745311 
Protein GI170681476 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0754] Glutathionylspermidine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAGAG TCAGTATTAC CGAGCGCCCG GACTGGCGCG AAAAAGCCCA CGAATACGGT 
TTCAATTTTC ACACCATGTA CGGCGAGCCG TACTGGTGTG AAGATGCTTA CTACAAGTTG
ACCCTCGCTC AGGTTGAAAA GCTGGAAGAA GTCACCGCCG AACTGCACCA GATGTGCCTG
AAAGTGGTGG AAAAAGTGAT CGCCAGCGAT GAGCTGATGA CCAAATTCCG CATACCAAAA
CACACCTGGA GTTTTGTGCG CCAGTCATGG CTGACGCACC AGCCATCGCT TTATTCGCGT
CTTGATCTGG CGTGGGATGG CACTGGCGAA CCTAAGCTTC TGGAAAATAA CGCCGATACG
CCAACGTCAC TATACGAGGC GGCGTTCTTT CAGTGGATCT GGCTGGAAGA TCAGCTTAAC
GCCGGTAACT TGCCGGAGGG CAGCGACCAG TTTAACAGTT TGCAAGAAAA GCTGATCGAT
CGCTTCGTTG AGCTGCGTGA ACAGTATGGC TTCCAGTTGC TGCATCTCAC CTGCTGTCGC
GACACGGTGG AAGATCGCGG AACCATTCAG TATTTGCAGG ACTGCGCGAC GGAAGCTGAA
ATCGCTACCG AGTTCCTCTA CATCGACGAT ATCGGGTTAG GTGAAAAAGG TCAGTTCACG
GATTTACAGG ATCAGGTGAT TTCCAACCTG TTCAAGCTGT ATCCGTGGGA ATTTATGTTG
CGTGAGATGT TCTCCACCAA GCTGGAGGAT GCAGGCGTAC GCTGGCTGGA ACCGGCGTGG
AAGAGCATTA TCTCCAACAA AGCGCTTCTA CCGCTACTGT GGGAGATGTT CCCGAATCAC
CCGAACCTGC TGCCCGCTTA TTTTGCGGAA GATGATCATC CGCAAATGGA AAAATATGTG
GTTAAACCGA TCTTCTCCCG TGAAGGCGCA AACGTGTCGA TCATTGAGAA CGGCAAAACC
ATTGAAGCAG CGGAAGGTCC GTATGGCGAA GAAGGGATGA TTGTTCAGCA ATTCCACCCG
TTACCGAAAT TCGGCGACAG CTATATGCTG ATTGGTAGCT GGCTGGTGAA CGATCAACCT
GCCGGAATTG GCATTCGTGA AGACCGTGCA TTGATCACCC AGGATATGTC TCGGTTTTAT
CCGCATATTT TTGTTGAGTA A
 
Protein sequence
MERVSITERP DWREKAHEYG FNFHTMYGEP YWCEDAYYKL TLAQVEKLEE VTAELHQMCL 
KVVEKVIASD ELMTKFRIPK HTWSFVRQSW LTHQPSLYSR LDLAWDGTGE PKLLENNADT
PTSLYEAAFF QWIWLEDQLN AGNLPEGSDQ FNSLQEKLID RFVELREQYG FQLLHLTCCR
DTVEDRGTIQ YLQDCATEAE IATEFLYIDD IGLGEKGQFT DLQDQVISNL FKLYPWEFML
REMFSTKLED AGVRWLEPAW KSIISNKALL PLLWEMFPNH PNLLPAYFAE DDHPQMEKYV
VKPIFSREGA NVSIIENGKT IEAAEGPYGE EGMIVQQFHP LPKFGDSYML IGSWLVNDQP
AGIGIREDRA LITQDMSRFY PHIFVE