Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4300 |
Symbol | gsp |
ID | 6971018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3980099 |
End bp | 3981958 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643388029 |
Product | bifunctional glutathionylspermidine amidase/glutathionylspermidine synthetase |
Protein accession | YP_002272467 |
Protein GI | 209400603 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0754] Glutathionylspermidine synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAAG GAACGACCAG CCAGGATGCC CCGTTCGGGA CATTATTGGG CTACGCCCCA GGTGGGGTAG CAATCTACTC TTCAGATTAC AGTTCTCTCG ATCCGCAGGA ATACGAAGAT GACGCCGTAT TCCGTAGCTA TATCGACGAC GAATATATGG GCCACAAATG GCAATGCGTT GAATTTGCTC GCCGTTTTCT CTTTCTGAAC TACGGTGTGG TCTTTACTGA CGTGGGTATG GCGTGGGAGA TTTTCTCGCT GCGCTTCCTG CGTGAAGTGG TTAATGACAA CATCCTGCCA TTGCAGGCAT TTCCTAACGG CTCGCCGCGT GCGCCGGTCG CGGGTGCGCT TCTTATCTGG GATAAAGGCG GTGAATTTAA AGACACTGGC CATGTCGCCA TCATTACCCA ATTACATGGC AACAAAGTCC GTATTGCGGA ACAGAACGTG ATTCATTCCC CGTTGCCGCA AGGGCAACAG TGGACGCGCG AACTGGAGAT GGTGGTCGAA AACGGCTGCT ATACCCTTAA AGACACTTTT GATGACACCA CCATTCTGGG CTGGATGATC CAGACGGAAG ATACTGAATA CAGCTTACCG CAGCCGGAAA TTGCAGGCGA GCTGCTGAAA ATCAGCGGAG CGCGTCTGGA AAACAAAGGC CAGTTTGACG GTAAATGGCT GGATGAAAAA GATCCGCTGC AAAACGCCTA TGTGCAGGCC AACGGTCAGG TGATCAATCA GGATCCGTAT CATTACTACA CCATTACCGA GAGCGCCGAG CAGGAGCTGA TTAAAGCCAC CAACGAGCTG CACCTGATGT ATCTTCACGC AACCGACAAG GTGCTGAAAG ATGACAACCT GCTGGCGCTG TTCGACATTC CGAAAATCCT CTGGCCACGT TTGCGTCTCT CCTGGCAGCG TCGCCGTCAC CATATGATCA CCGGTCGTAT GGATTTCTGC ATGGATGAGC GTGGTCTGAA GGTTTACGAG TACAACGCCG ACTCCGCCTC CTGTCATACC GAAGCGGGCT TGATCCTCGA ACGTTGGGCG GAGCAGGGCT ATAAAGGCAA CGGCTTCAAT CCGGCGGAAG GGCTGATTAA CGAACTGGCT GGTGCCTGGA AACACAGTCG TGCACGTCCG TTTGTCCATA TCATGCAGGA CAAAGATATC GAGGAAAACT ATCACGCGCA GTTTATGGAG CAGGCGCTGC ACCAGGCGGG CTTTGAAACG CGTATCTTGC GCGGGCTGGA TGAACTGGGC TGGGATGCTG CCGGGCAACT GATTGATGGG GAAGGGCGAC TGGTTAACTG CGTGTGGAAA ACCTGGGCGT GGGAAACCGC GTTTGATCAG ATTCGTGAAG TTAGCGACCG TGAGTTTGCT GCGGTGCCGA TCCGTACCGG TCATCCGCAA AATGAAGTGC GTCTTATCGA CGTATTGCTG CGCCCGGAAG TGCTGGTCTT TGAGCCGCTG TGGACGGTGA TCCCCGGCAA CAAAGCGATT CTGCCGATCC TCTGGTCGCT GTTCCCGCAC CATCGTTACC TGCTGGATAC CGATTTCACT GTTAATGATG AATTGGTGAA AACAGGTTAC GCAGTGAAAC CGATCGCCGG TCGCTGTGGC AGCAACATCG ACCTCGTCAG CCATCATGAA GAGGTGCTGG ACAAAACCAG TGGTAAATTT GCCGAGCAGA AAAACATCTA TCAGCAACTG TGGTGTTTGC CGAAAGTGGA CGGTAAATAC ATTCAGGTAT GTACCTTCAC CGTTGGCGGC AACTACGGTG GGACGTGTTT GCGCGGTGAT GAATCACTGG TTATTAAAAA AGAGAGTGAT ATTGAACCGT TAATTGTGGT GAAAGAGTAA
|
Protein sequence | MSKGTTSQDA PFGTLLGYAP GGVAIYSSDY SSLDPQEYED DAVFRSYIDD EYMGHKWQCV EFARRFLFLN YGVVFTDVGM AWEIFSLRFL REVVNDNILP LQAFPNGSPR APVAGALLIW DKGGEFKDTG HVAIITQLHG NKVRIAEQNV IHSPLPQGQQ WTRELEMVVE NGCYTLKDTF DDTTILGWMI QTEDTEYSLP QPEIAGELLK ISGARLENKG QFDGKWLDEK DPLQNAYVQA NGQVINQDPY HYYTITESAE QELIKATNEL HLMYLHATDK VLKDDNLLAL FDIPKILWPR LRLSWQRRRH HMITGRMDFC MDERGLKVYE YNADSASCHT EAGLILERWA EQGYKGNGFN PAEGLINELA GAWKHSRARP FVHIMQDKDI EENYHAQFME QALHQAGFET RILRGLDELG WDAAGQLIDG EGRLVNCVWK TWAWETAFDQ IREVSDREFA AVPIRTGHPQ NEVRLIDVLL RPEVLVFEPL WTVIPGNKAI LPILWSLFPH HRYLLDTDFT VNDELVKTGY AVKPIAGRCG SNIDLVSHHE EVLDKTSGKF AEQKNIYQQL WCLPKVDGKY IQVCTFTVGG NYGGTCLRGD ESLVIKKESD IEPLIVVKE
|
| |