Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3081 |
Symbol | speA |
ID | 6142594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3170821 |
End bp | 3172719 |
Gene Length | 1899 bp |
Protein Length | 632 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641617949 |
Product | arginine decarboxylase |
Protein accession | YP_001745100 |
Protein GI | 170679666 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1166] Arginine decarboxylase (spermidine biosynthesis) |
TIGRFAM ID | [TIGR01273] arginine decarboxylase, biosynthetic |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.602145 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.00168239 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCTCCC AGGAAGCCAG CAAGATGCTG CGTACTTACA ATATTGCCTG GTGGGGCAAT AACTACTATG ACGTTAACGA GCTGGGCCAC ATTAGCGTGT GCCCGGACCC GGACGTCCCG GAAGCTCGCG TCGATCTCGC GCAGTTAGTG AAAACTCGTG AAGCACAGGG CCAGCGTCTG CCTGCACTGT TCTGTTTCCC ACAGATCCTG CAGCACCGTT TGCGTTCTAT TAACGCCGCG TTCAAACGTG CGCGGGAATC CTACGGCTAT AACGGCGATT ACTTCCTCGT TTATCCGATC AAAGTTAACC AGCACCGCCG CGTGATCGAG TCCCTGATTC ATTCGGGCGA ACCGCTGGGT CTGGAAGCCG GTTCCAAAGC CGAGTTGATG GCAGTGCTGG CACATGCTGG CATGACTCGT AGCGTCATCG TCTGCAACGG TTATAAGGAT CGCGAATATA TCCGCCTGGC ATTAATTGGC GAGAAGATGG GCCACAAGGT CTATCTGGTC ATTGAGAAGA TGTCAGAAAT CGCCATTGTG CTGGATGAAG CAGAACGTCT GAATGTCGTT CCTCGTCTGG GCGTGCGTGC ACGTCTGGCC TCGCAGGGCT CGGGGAAATG GCAGTCCTCC GGCGGGGAAA AATCGAAGTT CGGCCTGGCT GCGACTCAGG TACTGCAACT GGTTGAAACG CTGCGTGAAG CCGGGCGTCT CGACAGTCTG CAACTGCTGC ACTTCCACCT CGGTTCGCAG ATGGCGAATA TTCGCGATAT CGCGACAGGC GTTCGTGAAT CCGCGCGTTT CTATGTTGAA CTGCACAAGC TGGGCGTCAA TATTCAGTGC TTCGACGTCG GCGGCGGTCT GGGCGTGGAC TATGAAGGCA CCCGTTCGCA GTCTGACTGT TCGGTGAACT ACGGCCTCAA TGAATACGCC AACAACATCA TCTGGGCGAT TGGTGATGCG TGTGAAGAAA ATGGTCTGCC ACATCCGACG GTAATCACCG AGTCGGGTCG TGCGGTGACT GCGCATCACA CCGTGCTGGT GTCGAATATC ATCGGCGTGG AACGTAACGA ATACACGGTG CCGACCGCGC CTGCAGAAGA CGCACCGCGC GCCCTGCAAA GCATGTGGGA AACCTGGCAG GAGATGCACG AACCGGGAAC TCGCCGTTCT CTGCGTGAAT GGTTACACGA CAGCCAGATG GATCTGCACG ATATTCATAT CGGCTACTCT TCCGGCACCT TTAGCCTGCA AGAACGTGCA TGGGCAGAAC AACTCTATCT GAGCATGTGC CATGAAGTGC AGAAGCAACT GGATCCGCAA AACCGTGCGC ATCGTCCGAT TATCGACGAA CTGCAGGAAC GTATGGCGGA CAAAATGTAC GTCAACTTCT CGCTGTTCCA GTCGATGCCG GACGCATGGG GGATCGACCA GTTGTTCCCG GTTCTGCCGC TGGAAGGGTT GGATCAAGTG CCGGAGCGCC GCGCTGTGCT GCTGGATATT ACCTGTGACT CTGACGGTGC TATTGATCAC TACATCGATG GTGATGGTAT TGCCACGACA ATGCCCATGC CGGAGTACGA TCCAGAGAAT CCGCCGATGC TCGGTTTCTT CATGGTCGGC GCATATCAGG AGATCCTCGG CAACATGCAC AACCTGTTCG GTGATACCGA AGCGGTTGAC GTGTTCGTCT TCCCTGACGG CAGCGTAGAA GTAGAACTGT CTGACGAAGG CGATACCGTG GCGGACATGC TGCAATATGT ACAGCTCGAT CCGAAAACGT TGTTAACCCA GTTCCGTGAT CAAGTGAAGA AAACCGATCT TGATGCTGAA CTGCAACAAC AGTTCCTTGA AGAGTTCGAG GCAGGTTTGT ACGGTTATAC CTATCTTGAA GATGAATAA
|
Protein sequence | MSSQEASKML RTYNIAWWGN NYYDVNELGH ISVCPDPDVP EARVDLAQLV KTREAQGQRL PALFCFPQIL QHRLRSINAA FKRARESYGY NGDYFLVYPI KVNQHRRVIE SLIHSGEPLG LEAGSKAELM AVLAHAGMTR SVIVCNGYKD REYIRLALIG EKMGHKVYLV IEKMSEIAIV LDEAERLNVV PRLGVRARLA SQGSGKWQSS GGEKSKFGLA ATQVLQLVET LREAGRLDSL QLLHFHLGSQ MANIRDIATG VRESARFYVE LHKLGVNIQC FDVGGGLGVD YEGTRSQSDC SVNYGLNEYA NNIIWAIGDA CEENGLPHPT VITESGRAVT AHHTVLVSNI IGVERNEYTV PTAPAEDAPR ALQSMWETWQ EMHEPGTRRS LREWLHDSQM DLHDIHIGYS SGTFSLQERA WAEQLYLSMC HEVQKQLDPQ NRAHRPIIDE LQERMADKMY VNFSLFQSMP DAWGIDQLFP VLPLEGLDQV PERRAVLLDI TCDSDGAIDH YIDGDGIATT MPMPEYDPEN PPMLGFFMVG AYQEILGNMH NLFGDTEAVD VFVFPDGSVE VELSDEGDTV ADMLQYVQLD PKTLLTQFRD QVKKTDLDAE LQQQFLEEFE AGLYGYTYLE DE
|
| |