Gene EcSMS35_3109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3109 
SymbolspeC 
ID6143820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3193823 
End bp3195958 
Gene Length2136 bp 
Protein Length711 aa 
Translation table11 
GC content53% 
IMG OID641617977 
Productornithine decarboxylase 
Protein accessionYP_001745128 
Protein GI170681617 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1982] Arginine/lysine/ornithine decarboxylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.015732 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCAA TGAATATTGC CGCCAGTAGT GAACTGGTAT CCCGACTTTC TTCTCATCGT 
CGCGTGGTGG CGTTGGGAGA TACTGATTTT ACGGACGTCG CGGCAGTCGT CATTACCGCT
GCGGATAGTC GCAGTGGCAT TCTTGCGTTG CTTAAGCGCA CCGGTTTTCA TCTACCGGTG
TTTTTGTATT CCGAACATGC TGTTGAATTA CCTGCGGGCG TTACGGCGGT AATCAACGGC
AACGAGCAGC AGTGGCTGGA GCTGGAATCC GCAGCCTGTC AGTATGAAGA GAATTTGCTG
CCACCGTTTT ATGACACGCT GACGCAGTAC GTTGAGATGG GCAACAGCAC CTTTGCTTGC
CCTGGACATC AACATGGTGC GTTTTTTAAA AAGCATCCTG CCGGACGCCA TTTTTACGAT
TTCTTTGGTG AGAACGTCTT TCGCGCCGAT ATGTGTAACG CTGACGTAAA ATTGGGCGAT
CTGCTTATTC ATGAAGGATC GGCGAAGGAT GCGCAGAAAT TCGCAGCCAA AGTCTTTCAT
GCCGATAAAA CCTATTTTGT GCTGAACGGC ACATCGGCAG CGAATAAAGT GGTGACGAAT
GCGCTGTTAA CGCGTGGCGA TCTGGTGCTC TTCGACCGTA ACAACCATAA GTCGAATCAT
CACGGCGCGC TGATTCAGGC GGGGGCGACG CCGGTCTATC TGGAAGTTTC ACGTAACCCG
TTTGGTTTCA TTGGCGGTAT TGATGCGCAC TGTTTTAATG AAGAGTATCT GCGCCAGCAA
ATTCGCGACG TTGCGCCAGA AAAAGCTGAG TTGCCGCGCC CGTTTCGCCT GGCGATTATT
CAGCTGGGAA CCTATGACGG CACTGTCTAT AACGCCCGTC AGGTGATCGA TACCGTTGGG
CATCTGTGTG ATTACATTCT GTTTGATTCC GCGTGGGTCG GTTACGAACA GTTTATCCCG
ATGATGGCGG ATAGCTCGCC GCTGCTGTTA GAACTTAACG AAAACGATCC GGGGATCTTT
GTGACCCAGT CGGTGCACAA ACAGCAGGCG GGATTCTCAC AGACGTCGCA GATCCATAAA
AAAGATAACC ATATCCGTGG ACAGGCGCGT TTTTGCCCGC ATAAGCGGTT GAATAATGCC
TTTATGCTCC ATGCTTCTAC CAGCCCGTTC TATCCGCTGT TCGCCGCGCT GGACGTTAAC
GCCAAAATTC ATGAAGGGGA GAGTGGGCGT CGGCTGTGGG CGGAGTGCGT TGCGTTGGGG
ATTGAAGCCC GGAAGGCGAT TCTTGCGCGC TGTAAGCTGT TCCGCCCGTT TATCCCGCCC
GTTGTTGATG GCAAATTGTG GCAGGATTAT CCGACGTCAG TGTTAGCCAG CGACCGCCGT
TTTTTCAGTT TTGAGCCGGG GGCGAAGTGG CACGGCTTTG AAGGATATGC CGCGGATCAG
TATTTTGTTG ATCCGTGCAA GCTGTTACTC ACCACGCCGG GTATCGATGC TGAAACCGGC
GAATATAGCG ACTTTGGCGT TCCGGCGACG ATTCTGGCGC ACTATCTGCG TGAGAACGGC
ATTGTGCCGG AGAAGTGCGA TCTCAATTCC ATTCTGTTCT TATTAACTCC GGCGGAAAGC
CACGAGAAGC TGGCGCAACT GGTGGCGATG CTGGCGCAAT TTGAACAGCA TATTGAGGAT
GACTCGCCGC TGGCTGAGGT GTTGCCGAGC ATTTATAACA AATATCCGGT GCGCTATCGC
GACTACACCC TGCGCCAGTT GTGTCAGGAG ATGCACGATT TGTATGTCAG TTTCGACGTC
AAAGACCTAC AAAAAGCGAT GTTCCGCCAG CAGAGTTTCC CGTCAGTGGT GATGAATCCC
CAGGATGCGC ATAGCGCTTA TATTCGCGGT GAAGTGGAGT TGGTGCGGAT TCGTGATGCC
GAAGGGCGAA TTGCGGCAGA AGGGGCGTTG CCTTATCCCC CTGGCGTGCT TTGCGTGGTG
CCCGGGGAAG TCTGGGGCGG GGCGGTCCAA CGTTATTTCC TTGCGCTGGA AGAAGGGGTG
AATTTGCTGC CAGGTTTTTC ACCGGAGCTG CAAGGTGTCT ATAGCGAAAC CGATGCGGAT
GGCATGAAAC GGTTGTACGG TTATGTGTTG AAGTAA
 
Protein sequence
MKSMNIAASS ELVSRLSSHR RVVALGDTDF TDVAAVVITA ADSRSGILAL LKRTGFHLPV 
FLYSEHAVEL PAGVTAVING NEQQWLELES AACQYEENLL PPFYDTLTQY VEMGNSTFAC
PGHQHGAFFK KHPAGRHFYD FFGENVFRAD MCNADVKLGD LLIHEGSAKD AQKFAAKVFH
ADKTYFVLNG TSAANKVVTN ALLTRGDLVL FDRNNHKSNH HGALIQAGAT PVYLEVSRNP
FGFIGGIDAH CFNEEYLRQQ IRDVAPEKAE LPRPFRLAII QLGTYDGTVY NARQVIDTVG
HLCDYILFDS AWVGYEQFIP MMADSSPLLL ELNENDPGIF VTQSVHKQQA GFSQTSQIHK
KDNHIRGQAR FCPHKRLNNA FMLHASTSPF YPLFAALDVN AKIHEGESGR RLWAECVALG
IEARKAILAR CKLFRPFIPP VVDGKLWQDY PTSVLASDRR FFSFEPGAKW HGFEGYAADQ
YFVDPCKLLL TTPGIDAETG EYSDFGVPAT ILAHYLRENG IVPEKCDLNS ILFLLTPAES
HEKLAQLVAM LAQFEQHIED DSPLAEVLPS IYNKYPVRYR DYTLRQLCQE MHDLYVSFDV
KDLQKAMFRQ QSFPSVVMNP QDAHSAYIRG EVELVRIRDA EGRIAAEGAL PYPPGVLCVV
PGEVWGGAVQ RYFLALEEGV NLLPGFSPEL QGVYSETDAD GMKRLYGYVL K