Gene EcSMS35_3081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3081 
SymbolspeA 
ID6142594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3170821 
End bp3172719 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content55% 
IMG OID641617949 
Productarginine decarboxylase 
Protein accessionYP_001745100 
Protein GI170679666 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1166] Arginine decarboxylase (spermidine biosynthesis) 
TIGRFAM ID[TIGR01273] arginine decarboxylase, biosynthetic 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.602145 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.00168239 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCTCCC AGGAAGCCAG CAAGATGCTG CGTACTTACA ATATTGCCTG GTGGGGCAAT 
AACTACTATG ACGTTAACGA GCTGGGCCAC ATTAGCGTGT GCCCGGACCC GGACGTCCCG
GAAGCTCGCG TCGATCTCGC GCAGTTAGTG AAAACTCGTG AAGCACAGGG CCAGCGTCTG
CCTGCACTGT TCTGTTTCCC ACAGATCCTG CAGCACCGTT TGCGTTCTAT TAACGCCGCG
TTCAAACGTG CGCGGGAATC CTACGGCTAT AACGGCGATT ACTTCCTCGT TTATCCGATC
AAAGTTAACC AGCACCGCCG CGTGATCGAG TCCCTGATTC ATTCGGGCGA ACCGCTGGGT
CTGGAAGCCG GTTCCAAAGC CGAGTTGATG GCAGTGCTGG CACATGCTGG CATGACTCGT
AGCGTCATCG TCTGCAACGG TTATAAGGAT CGCGAATATA TCCGCCTGGC ATTAATTGGC
GAGAAGATGG GCCACAAGGT CTATCTGGTC ATTGAGAAGA TGTCAGAAAT CGCCATTGTG
CTGGATGAAG CAGAACGTCT GAATGTCGTT CCTCGTCTGG GCGTGCGTGC ACGTCTGGCC
TCGCAGGGCT CGGGGAAATG GCAGTCCTCC GGCGGGGAAA AATCGAAGTT CGGCCTGGCT
GCGACTCAGG TACTGCAACT GGTTGAAACG CTGCGTGAAG CCGGGCGTCT CGACAGTCTG
CAACTGCTGC ACTTCCACCT CGGTTCGCAG ATGGCGAATA TTCGCGATAT CGCGACAGGC
GTTCGTGAAT CCGCGCGTTT CTATGTTGAA CTGCACAAGC TGGGCGTCAA TATTCAGTGC
TTCGACGTCG GCGGCGGTCT GGGCGTGGAC TATGAAGGCA CCCGTTCGCA GTCTGACTGT
TCGGTGAACT ACGGCCTCAA TGAATACGCC AACAACATCA TCTGGGCGAT TGGTGATGCG
TGTGAAGAAA ATGGTCTGCC ACATCCGACG GTAATCACCG AGTCGGGTCG TGCGGTGACT
GCGCATCACA CCGTGCTGGT GTCGAATATC ATCGGCGTGG AACGTAACGA ATACACGGTG
CCGACCGCGC CTGCAGAAGA CGCACCGCGC GCCCTGCAAA GCATGTGGGA AACCTGGCAG
GAGATGCACG AACCGGGAAC TCGCCGTTCT CTGCGTGAAT GGTTACACGA CAGCCAGATG
GATCTGCACG ATATTCATAT CGGCTACTCT TCCGGCACCT TTAGCCTGCA AGAACGTGCA
TGGGCAGAAC AACTCTATCT GAGCATGTGC CATGAAGTGC AGAAGCAACT GGATCCGCAA
AACCGTGCGC ATCGTCCGAT TATCGACGAA CTGCAGGAAC GTATGGCGGA CAAAATGTAC
GTCAACTTCT CGCTGTTCCA GTCGATGCCG GACGCATGGG GGATCGACCA GTTGTTCCCG
GTTCTGCCGC TGGAAGGGTT GGATCAAGTG CCGGAGCGCC GCGCTGTGCT GCTGGATATT
ACCTGTGACT CTGACGGTGC TATTGATCAC TACATCGATG GTGATGGTAT TGCCACGACA
ATGCCCATGC CGGAGTACGA TCCAGAGAAT CCGCCGATGC TCGGTTTCTT CATGGTCGGC
GCATATCAGG AGATCCTCGG CAACATGCAC AACCTGTTCG GTGATACCGA AGCGGTTGAC
GTGTTCGTCT TCCCTGACGG CAGCGTAGAA GTAGAACTGT CTGACGAAGG CGATACCGTG
GCGGACATGC TGCAATATGT ACAGCTCGAT CCGAAAACGT TGTTAACCCA GTTCCGTGAT
CAAGTGAAGA AAACCGATCT TGATGCTGAA CTGCAACAAC AGTTCCTTGA AGAGTTCGAG
GCAGGTTTGT ACGGTTATAC CTATCTTGAA GATGAATAA
 
Protein sequence
MSSQEASKML RTYNIAWWGN NYYDVNELGH ISVCPDPDVP EARVDLAQLV KTREAQGQRL 
PALFCFPQIL QHRLRSINAA FKRARESYGY NGDYFLVYPI KVNQHRRVIE SLIHSGEPLG
LEAGSKAELM AVLAHAGMTR SVIVCNGYKD REYIRLALIG EKMGHKVYLV IEKMSEIAIV
LDEAERLNVV PRLGVRARLA SQGSGKWQSS GGEKSKFGLA ATQVLQLVET LREAGRLDSL
QLLHFHLGSQ MANIRDIATG VRESARFYVE LHKLGVNIQC FDVGGGLGVD YEGTRSQSDC
SVNYGLNEYA NNIIWAIGDA CEENGLPHPT VITESGRAVT AHHTVLVSNI IGVERNEYTV
PTAPAEDAPR ALQSMWETWQ EMHEPGTRRS LREWLHDSQM DLHDIHIGYS SGTFSLQERA
WAEQLYLSMC HEVQKQLDPQ NRAHRPIIDE LQERMADKMY VNFSLFQSMP DAWGIDQLFP
VLPLEGLDQV PERRAVLLDI TCDSDGAIDH YIDGDGIATT MPMPEYDPEN PPMLGFFMVG
AYQEILGNMH NLFGDTEAVD VFVFPDGSVE VELSDEGDTV ADMLQYVQLD PKTLLTQFRD
QVKKTDLDAE LQQQFLEEFE AGLYGYTYLE DE