Gene EcHS_A3096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3096 
SymbolspeA 
ID5595215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3111562 
End bp3113538 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content54% 
IMG OID640922215 
Productarginine decarboxylase 
Protein accessionYP_001459715 
Protein GI157162397 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1166] Arginine decarboxylase (spermidine biosynthesis) 
TIGRFAM ID[TIGR01273] arginine decarboxylase, biosynthetic 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGACG ACATGTCTAT GGGTTTGCCT TCGTCAGCGG GCGAACACGG TGTACTACGC 
TCCATGCAGG AGGTTGCAAT GAGCTCCCAG GAAGCCAGCA AGATGCTGCG TACTTACAAT
ATTGCCTGGT GGGGCAATAA CTACTATGAC GTTAACGAGC TGGGCCACAT CAGCGTGTGC
CCGGACCCGG ACGTCCCGGA AGCTCGCGTC GATCTCGCGC AGTTAGTGAA AACTCGTGAA
GCACAGGGTC AGCGTCTGCC TGCACTGTTC TGTTTCCCAC AGATCCTGCA GCACCGTTTG
CGTTCCATTA ACGCCGCGTT CAAACGTGCG CGGGAATCCT ACGGCTATAA CGGCGATTAC
TTCCTTGTTT ATCCGATCAA AGTTAACCAG CACCGCCGCG TGATTGAGTC CCTGATTCAT
TCGGGCGAAC CGCTGGGACT GGAAGCAGGC TCGAAAGCAG AGTTGATGGC GGTGCTGGCA
CATGCTGGCA TGACCCGTAG CGTCATCGTC TGCAACGGTT ATAAAGACCG CGAATATATC
CGCCTGGCAT TAATTGGCGA GAAGATGGGC CACAAGGTCT ACCTGGTCAT TGAGAAGATG
TCAGAAATCG CCATTGTGCT GGATGAAGCA GAACGTCTGA ATGTCGTTCC TCGTCTGGGC
GTGCGTGCAC GTCTGGCTTC GCAGGGCTCG GGTAAATGGC AGTCCTCCGG CGGGGAAAAA
TCGAAGTTCG GCCTGGCTGC GACTCAGGTA CTGCAACTGG TTGAAACCCT GCGTGAAGCC
GGGCGTCTCG ACAGCCTGCA ACTACTGCAC TTCCACCTCG GTTCGCAGAT GGCGAATATT
CGCGATATCG CCACAGGCGT TCGTGAATCG GCGCGTTTCT ATGTGGAACT GCACAAGCTG
GGCGTCAATA TTCAGTGCTT CGACGTCGGC GGCGGTCTGG GCGTGGACTA TGAAGGCACC
CGTTCGCAGT CTGACTGCTC CGTAAACTAC GGCCTCAATG AATATGCCAA CAACATCATC
TGGGCGATTG GTGATGCGTG TGAAGAAAAC GGTCTGCCGC ATCCGACGGT AATCACCGAA
TCGGGTCGTG CGGTGACTGC GCATCACACC GTGCTGGTGT CTAATATCAT CGGCGTGGAA
CGTAACGAAT ACACGGTGCC GACCGCGCCT GCAGAAGATG CGCCGCGCGC GCTGCAAAGC
ATGTGGGAAA CCTGGCAGGA GATGCACGAA CCGGGAACTC GCCGTTCTCT GCGTGAATGG
TTACACGACA GTCAGATGGA TCTGCACGAC ATTCATATCG GCTACTCTTC CGGCACCTTT
AGCCTGCAAG AACGTGCATG GGCAGAACAA CTCTATCTGA GCATGTGCCA TGAAGTGCAG
AAGCAACTGG ATCCGCAAAA CCGTGCGCAT CGTCCGATAA TCGACGAACT GCAGGAACGT
ATGGCGGACA AAATGTACGT CAATTTCTCG CTGTTCCAGT CAATGCCGGA CGCATGGGGG
ATCGACCAGT TGTTCCCGGT TCTGCCGCTG GAAGGGCTGG ATCAAGTGCC GGAACGTCGC
GCTGTGCTGC TGGATATTAC CTGTGACTCT GACGGTGCTA TCGACCACTA TATTGATGGT
GATGGTATTG CCACGACAAT GCCAATGCCG GAGTACGATC CAGAGAATCC GCCGATGCTC
GGTTTCTTTA TGGTCGGCGC ATATCAGGAG ATCCTTGGCA ATATGCACAA CCTGTTCGGT
GATACCGAAG CGGTTGACGT GTTCGTGTTC CCTGACGGTA GCGTAGAAGT AGAACTGTCT
GACGAAGGCG ATACCGTGGC GGACATGCTG CAATATGTAC AGCTCGATCC GAAAACGCTG
TTAACCCAGT TCCGCGATCA AGTGAAGAAA ACCGATCTTG ATGCTGAACT GCAACAACAG
TTCCTTGAAG AGTTCGAGGC AGGTTTGTAC GGTTATACTT ATCTTGAAGA TGAGTAA
 
Protein sequence
MSDDMSMGLP SSAGEHGVLR SMQEVAMSSQ EASKMLRTYN IAWWGNNYYD VNELGHISVC 
PDPDVPEARV DLAQLVKTRE AQGQRLPALF CFPQILQHRL RSINAAFKRA RESYGYNGDY
FLVYPIKVNQ HRRVIESLIH SGEPLGLEAG SKAELMAVLA HAGMTRSVIV CNGYKDREYI
RLALIGEKMG HKVYLVIEKM SEIAIVLDEA ERLNVVPRLG VRARLASQGS GKWQSSGGEK
SKFGLAATQV LQLVETLREA GRLDSLQLLH FHLGSQMANI RDIATGVRES ARFYVELHKL
GVNIQCFDVG GGLGVDYEGT RSQSDCSVNY GLNEYANNII WAIGDACEEN GLPHPTVITE
SGRAVTAHHT VLVSNIIGVE RNEYTVPTAP AEDAPRALQS MWETWQEMHE PGTRRSLREW
LHDSQMDLHD IHIGYSSGTF SLQERAWAEQ LYLSMCHEVQ KQLDPQNRAH RPIIDELQER
MADKMYVNFS LFQSMPDAWG IDQLFPVLPL EGLDQVPERR AVLLDITCDS DGAIDHYIDG
DGIATTMPMP EYDPENPPML GFFMVGAYQE ILGNMHNLFG DTEAVDVFVF PDGSVEVELS
DEGDTVADML QYVQLDPKTL LTQFRDQVKK TDLDAELQQQ FLEEFEAGLY GYTYLEDE