Gene ECH74115_4241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4241 
SymbolspeA 
ID6968274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3929818 
End bp3931716 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content54% 
IMG OID643387979 
Productarginine decarboxylase 
Protein accessionYP_002272418 
Protein GI209398776 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1166] Arginine decarboxylase (spermidine biosynthesis) 
TIGRFAM ID[TIGR01273] arginine decarboxylase, biosynthetic 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCCC AGGAAGCCAG CAAGATGCTG CGTACTTACA ATATTGCCTG GTGGGGCAAT 
AACTACTATG ACGTTAACGA GCTGGGCCAC ATCAGCGTGT GCCCGGACCC GGACGTCCCG
GAAGCTCGCG TCGATCTCGC GCAGTTAGTG AAAACTCGTG AAGCACAGGG TCAGCGTCTG
CCTGCACTGT TCTGTTTCCC ACAGATCCTG CAGCACCGTT TGCGTTCCAT TAACGCCGCG
TTCAAACGTG CGCGGGAATC CTACGGCTAT AACGGCGATT ACTTCCTTGT TTATCCGATC
AAAGTTAACC AGCACCGTCG CGTGATTGAG TCCCTGATTC ATTCGGGCGA ACCGCTGGGT
CTGGAAGCCG GTTCCAAAGC CGAGTTGATG GCAGTGCTGG CACATGCTGG CATGACCCGT
AGCGTCATCG TCTGCAATGG TTATAAAGAC CGCGAATATA TCCGCCTGGC ATTAATTGGC
GAGAAGATGG GGCACAAGGT CTATCTGGTC ATTGAGAAGA TGTCAGAAAT CGCCATTGTG
CTGGATGAAG CAGAACGTCT GAATGTCGTT CCTCGTCTGG GCGTGCGTGC ACGTCTGGCT
TCGCAGGGCT CCGGTAAATG GCAGTCCTCC GGCGGGGAAA AATCGAAGTT CGGCCTGGCG
GCGACTCAGG TACTGCAACT GGTTGAAACC CTACGTGAAG CCGGGCGTCT CGACAGCCTG
CAACTACTGC ACTTCCACCT CGGTTCGCAG ATGGCGAATA TTCGCGATAT CGCGACAGGC
GTTCGTGAAT CCGCGCGTTT CTATGTTGAG CTGCACAAGC TGGGCGTCAA TATTCAGTGC
TTCGACGTCG GCGGCGGTCT GGGCGTGGAT TATGAAGGTA CTCGTTCGCA GTCCGACTGT
TCGGTGAACT ACGGCCTCAA TGAATATGCC AACAACATCA TCTGGGCGAT TGGTGATGCG
TGTGAAGAAA ACGGTCTGCC GCATCCGACG GTAATCACCG AATCGGGTCG TGCAGTGACT
GCGCATCACA CCGTGCTGGT GTCTAATATC ATCGGCGTGG AACGTAACGA ATACACGGTG
CCGACCGCGC CTGCAGAAGA TGCGCCGCGC GCGCTGCAAA GCATGTGGGA AACCTGGCAG
GAGATGCACG AACCGGGAAC TCGCCGTTCT CTGCGTGAAT GGTTACACGA CAGTCAGATG
GATCTGCACG ACATTCATAT CGGCTACTCT TCCGGCACCT TTAGCCTGCA AGAACGTGCA
TGGGCTGAGC AGCTTTATTT GAGCATGTGC CATGAAGTGC AAAAGCAGCT GGATCCGCAA
AACCGTGCTC ATCGTCCGAT TATCGACGAG CTGCAGGAAC GTATGGCGGA CAAAATGTAC
GTCAACTTCT CGCTGTTCCA GTCGATGCCG GATGCGTGGG GGATCGACCA GTTGTTCCCG
GTTCTGCCGC TGGAAGGGCT GGATCAAGTG CCGGAACGCC GCGCTGTGCT GCTGGATATT
ACCTGTGACT CTGACGGTGC TATCGACCAC TATATCGATG GTGATGGTAT TGCCACGACA
ATGCCTATGC CGGAGTACGA TCCAGAGAAT CCGCCGATGC TCGGTTTCTT TATGGTCGGC
GCATATCAGG AGATCCTCGG CAACATGCAC AACCTGTTCG GTGATACCGA AGCGGTTGAC
GTGTTCGTCT TCCCTGACGG TAGTGTAGAA GTGGAACTGT CTGACGAAGG CGATACCGTG
GCGGACATGC TGCAATATGT ACAGCTCGAT CCGAAAACGC TGTTAACTCA GTTCCGTGAT
CAAGTGAAGA AAACCGATCT TGATGCTGAA CTGCAACAAC AGTTCCTTGA AGAGTTCGAG
GCAGGTTTGT ACGGTTATAC CTATCTTGAA GATGAATAA
 
Protein sequence
MSSQEASKML RTYNIAWWGN NYYDVNELGH ISVCPDPDVP EARVDLAQLV KTREAQGQRL 
PALFCFPQIL QHRLRSINAA FKRARESYGY NGDYFLVYPI KVNQHRRVIE SLIHSGEPLG
LEAGSKAELM AVLAHAGMTR SVIVCNGYKD REYIRLALIG EKMGHKVYLV IEKMSEIAIV
LDEAERLNVV PRLGVRARLA SQGSGKWQSS GGEKSKFGLA ATQVLQLVET LREAGRLDSL
QLLHFHLGSQ MANIRDIATG VRESARFYVE LHKLGVNIQC FDVGGGLGVD YEGTRSQSDC
SVNYGLNEYA NNIIWAIGDA CEENGLPHPT VITESGRAVT AHHTVLVSNI IGVERNEYTV
PTAPAEDAPR ALQSMWETWQ EMHEPGTRRS LREWLHDSQM DLHDIHIGYS SGTFSLQERA
WAEQLYLSMC HEVQKQLDPQ NRAHRPIIDE LQERMADKMY VNFSLFQSMP DAWGIDQLFP
VLPLEGLDQV PERRAVLLDI TCDSDGAIDH YIDGDGIATT MPMPEYDPEN PPMLGFFMVG
AYQEILGNMH NLFGDTEAVD VFVFPDGSVE VELSDEGDTV ADMLQYVQLD PKTLLTQFRD
QVKKTDLDAE LQQQFLEEFE AGLYGYTYLE DE