Gene ECH74115_4300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4300 
Symbolgsp 
ID6971018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3980099 
End bp3981958 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content52% 
IMG OID643388029 
Productbifunctional glutathionylspermidine amidase/glutathionylspermidine synthetase 
Protein accessionYP_002272467 
Protein GI209400603 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0754] Glutathionylspermidine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAG GAACGACCAG CCAGGATGCC CCGTTCGGGA CATTATTGGG CTACGCCCCA 
GGTGGGGTAG CAATCTACTC TTCAGATTAC AGTTCTCTCG ATCCGCAGGA ATACGAAGAT
GACGCCGTAT TCCGTAGCTA TATCGACGAC GAATATATGG GCCACAAATG GCAATGCGTT
GAATTTGCTC GCCGTTTTCT CTTTCTGAAC TACGGTGTGG TCTTTACTGA CGTGGGTATG
GCGTGGGAGA TTTTCTCGCT GCGCTTCCTG CGTGAAGTGG TTAATGACAA CATCCTGCCA
TTGCAGGCAT TTCCTAACGG CTCGCCGCGT GCGCCGGTCG CGGGTGCGCT TCTTATCTGG
GATAAAGGCG GTGAATTTAA AGACACTGGC CATGTCGCCA TCATTACCCA ATTACATGGC
AACAAAGTCC GTATTGCGGA ACAGAACGTG ATTCATTCCC CGTTGCCGCA AGGGCAACAG
TGGACGCGCG AACTGGAGAT GGTGGTCGAA AACGGCTGCT ATACCCTTAA AGACACTTTT
GATGACACCA CCATTCTGGG CTGGATGATC CAGACGGAAG ATACTGAATA CAGCTTACCG
CAGCCGGAAA TTGCAGGCGA GCTGCTGAAA ATCAGCGGAG CGCGTCTGGA AAACAAAGGC
CAGTTTGACG GTAAATGGCT GGATGAAAAA GATCCGCTGC AAAACGCCTA TGTGCAGGCC
AACGGTCAGG TGATCAATCA GGATCCGTAT CATTACTACA CCATTACCGA GAGCGCCGAG
CAGGAGCTGA TTAAAGCCAC CAACGAGCTG CACCTGATGT ATCTTCACGC AACCGACAAG
GTGCTGAAAG ATGACAACCT GCTGGCGCTG TTCGACATTC CGAAAATCCT CTGGCCACGT
TTGCGTCTCT CCTGGCAGCG TCGCCGTCAC CATATGATCA CCGGTCGTAT GGATTTCTGC
ATGGATGAGC GTGGTCTGAA GGTTTACGAG TACAACGCCG ACTCCGCCTC CTGTCATACC
GAAGCGGGCT TGATCCTCGA ACGTTGGGCG GAGCAGGGCT ATAAAGGCAA CGGCTTCAAT
CCGGCGGAAG GGCTGATTAA CGAACTGGCT GGTGCCTGGA AACACAGTCG TGCACGTCCG
TTTGTCCATA TCATGCAGGA CAAAGATATC GAGGAAAACT ATCACGCGCA GTTTATGGAG
CAGGCGCTGC ACCAGGCGGG CTTTGAAACG CGTATCTTGC GCGGGCTGGA TGAACTGGGC
TGGGATGCTG CCGGGCAACT GATTGATGGG GAAGGGCGAC TGGTTAACTG CGTGTGGAAA
ACCTGGGCGT GGGAAACCGC GTTTGATCAG ATTCGTGAAG TTAGCGACCG TGAGTTTGCT
GCGGTGCCGA TCCGTACCGG TCATCCGCAA AATGAAGTGC GTCTTATCGA CGTATTGCTG
CGCCCGGAAG TGCTGGTCTT TGAGCCGCTG TGGACGGTGA TCCCCGGCAA CAAAGCGATT
CTGCCGATCC TCTGGTCGCT GTTCCCGCAC CATCGTTACC TGCTGGATAC CGATTTCACT
GTTAATGATG AATTGGTGAA AACAGGTTAC GCAGTGAAAC CGATCGCCGG TCGCTGTGGC
AGCAACATCG ACCTCGTCAG CCATCATGAA GAGGTGCTGG ACAAAACCAG TGGTAAATTT
GCCGAGCAGA AAAACATCTA TCAGCAACTG TGGTGTTTGC CGAAAGTGGA CGGTAAATAC
ATTCAGGTAT GTACCTTCAC CGTTGGCGGC AACTACGGTG GGACGTGTTT GCGCGGTGAT
GAATCACTGG TTATTAAAAA AGAGAGTGAT ATTGAACCGT TAATTGTGGT GAAAGAGTAA
 
Protein sequence
MSKGTTSQDA PFGTLLGYAP GGVAIYSSDY SSLDPQEYED DAVFRSYIDD EYMGHKWQCV 
EFARRFLFLN YGVVFTDVGM AWEIFSLRFL REVVNDNILP LQAFPNGSPR APVAGALLIW
DKGGEFKDTG HVAIITQLHG NKVRIAEQNV IHSPLPQGQQ WTRELEMVVE NGCYTLKDTF
DDTTILGWMI QTEDTEYSLP QPEIAGELLK ISGARLENKG QFDGKWLDEK DPLQNAYVQA
NGQVINQDPY HYYTITESAE QELIKATNEL HLMYLHATDK VLKDDNLLAL FDIPKILWPR
LRLSWQRRRH HMITGRMDFC MDERGLKVYE YNADSASCHT EAGLILERWA EQGYKGNGFN
PAEGLINELA GAWKHSRARP FVHIMQDKDI EENYHAQFME QALHQAGFET RILRGLDELG
WDAAGQLIDG EGRLVNCVWK TWAWETAFDQ IREVSDREFA AVPIRTGHPQ NEVRLIDVLL
RPEVLVFEPL WTVIPGNKAI LPILWSLFPH HRYLLDTDFT VNDELVKTGY AVKPIAGRCG
SNIDLVSHHE EVLDKTSGKF AEQKNIYQQL WCLPKVDGKY IQVCTFTVGG NYGGTCLRGD
ESLVIKKESD IEPLIVVKE