Gene ECH74115_4351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4351 
Symbol 
ID6968773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4029598 
End bp4030758 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content51% 
IMG OID643388078 
Productglutathionylspermidine synthase family protein 
Protein accessionYP_002272516 
Protein GI209397034 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0754] Glutathionylspermidine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00967413 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAGAG TCAGTATTAC CGAGCGCCCG GACTGGCGTG AGAAAGCCCA CGAATACGGT 
TTCAATTTTC ACACCATGTA CGGCGAGCCG TACTGGTGTG AAGATGCTTA CTACAAGTTG
ACCCTCGCCC AGGTTGAAAA GCTGGAAGAA GTCACCGCCG AACTGCACCA GATGTGCCTG
AAAGTGGTGG AAAAAGTGAT CGCCAGCGAT GAGCTGATGA CCAAATTCCG CATTCCAAAA
CACACCTGGA GTTTTGTGCG CCAGTCATGG CTGACGCACC AGCCATCGCT TTATTCGCGT
CTTGATCTGG CGTGGGATGG CACTGGCGAA CCTAAACTTC TGGAAAATAA CGCCGATACG
CCAACGTCAC TATACGAGGC GGCGTTCTTT CAGTGGATCT GGCTGGAAGA TCAGCTTAAC
GCCGGTAACT TGCCGGAGGG CAGCGACCAG TTTAACAGTC TGCAAGAAAA ACTGATCGAT
CGCTTCGTTG AGCTGCGTGA ACAGTATGGC TTCCAGTTGC TGCATCTCAC CTGCTGTCGC
GACACGGTGG AAGATCGCGG TACCATTCAG TATTTGCAGG ACTGCGCGAC GGAAGCTGAA
ATCGCTACCG AGTTCCTCTA CATTGACGAT ATCGGGTTAG GTGAAAAAGG TCAGTTCACG
GATTTACAGG ATCAGGTAAT TTCCAACCTG TTCAAGCTGT ATCCGTGGGA ATTTATGTTG
CGTGAGATGT TTTCCACCAA GCTGGAGGAT GCAGGCGTAC GCTGGCTGGA ACCGGCGTGG
AAGAGCATTA TCTCCAACAA GGCACTTCTA CCGCTACTGT GGGAGATGTT CCCGAATCAC
CCGAACCTGC TGCCCGCTTA TTTTGCGGAA GATGATCATC CGCAAATGGA AAAATATGTG
GTTAAACCGA TCTTCTCCCG TGAAGGCGCA AACGTGTCGA TCATTGAGAA CGGCAAAACC
ATTGAAGCAG CGGAAGGTCC GTATGGCGAA GAAGGGATGA TTGTTCAGCA ATTCCACCCG
TTACCGAAAT TCGGCGACAG CTATATGCTG ATTGGTAGCT GGCTGGTGAA CGATCAACCC
GCCGGAATTG GCATTCGTGA AGACCGTGCA TTGATCACCC AGGATATGTC TCGGTTTTAT
CCACATATTT TTGTTGAATA A
 
Protein sequence
MERVSITERP DWREKAHEYG FNFHTMYGEP YWCEDAYYKL TLAQVEKLEE VTAELHQMCL 
KVVEKVIASD ELMTKFRIPK HTWSFVRQSW LTHQPSLYSR LDLAWDGTGE PKLLENNADT
PTSLYEAAFF QWIWLEDQLN AGNLPEGSDQ FNSLQEKLID RFVELREQYG FQLLHLTCCR
DTVEDRGTIQ YLQDCATEAE IATEFLYIDD IGLGEKGQFT DLQDQVISNL FKLYPWEFML
REMFSTKLED AGVRWLEPAW KSIISNKALL PLLWEMFPNH PNLLPAYFAE DDHPQMEKYV
VKPIFSREGA NVSIIENGKT IEAAEGPYGE EGMIVQQFHP LPKFGDSYML IGSWLVNDQP
AGIGIREDRA LITQDMSRFY PHIFVE