Gene EcSMS35_3919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3919 
SymbolselA 
ID6145567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3991534 
End bp3992925 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content56% 
IMG OID641618745 
Productselenocysteine synthase 
Protein accessionYP_001745884 
Protein GI170680855 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1921] Selenocysteine synthase [seryl-tRNASer selenium transferase] 
TIGRFAM ID[TIGR00474] seryl-tRNA(sec) selenium transferase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.419812 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACCG AAACGCGTTC CCTCTATAGT CAACTTCCGG CTATTGATCG CTTATTGCGC 
GATAGCTCCT TCCTTTCTTT GCGTGATACT TATGGTCACA CCCGCGTGGT GGAATTGTTG
CGTCAGATGC TCGACGAAGC GCGAGAAGTG ATTCGTGGCA GCCAGACGCT GCCCGCATGG
TGTGAAAACT GGGCGCAAGA AGTTGATGCC CGGTTGACGA AAGAGGCGCA AAGCGCGCTG
CGTCCGGTGA TCAACCTGAC GGGAACCGTG CTGCATACCA ACCTTGGGCG AGCTTTACAG
GCGGAAGCCG CGGTGGAAGC CGTTACGAAG GCTATGCGTT CGCCAGTGAC CCTCGAGTAC
GATCTGGACG ACGCCGGACG CGGACATCGC GATCGGGCGC TGGCGCAGCT GCTGTGCCGT
ATTACGGGGG CGGAAGATGC CTGTATCGTT AATAACAATG CGGCGGCGGT GTTATTGATG
TTGGCGGCCA CTGCCAGCGG GAAAGAGGTG GTGGTTTCTC GCGGCGAACT GGTGGAGATT
GGCGGCGCGT TTCGTATTCC TGATGTCATG CGTCAGGCAG GCTGCACCCT ACACGAAGTG
GGGACCACCA ACCGCACGCA CGCTAATGAT TATCGCCAGG CGGTGAATGA AAATACCGCG
CTGTTGATGA AAGTACATAC CAGTAACTAC AGCATTCAGG GATTCACCAA AGCGATAGAT
GAAGCGGAAC TGGTGGCGCT CGGCAAAGAG CTGGATGTCC CCGTAGTGAC TGATTTAGGC
AGTGGCTCGC TGGTCGATCT TAGCCAGTAC GGTTTGCCGA AAGAGCCAAT GCCGCAGGAG
TTGATTGCGG CGGGCGTCAG TCTGGTGAGC TTCTCCGGCG ACAAGTTGTT AGGCGGGCCG
CAGGCAGGAA TTATTGTTGG TAAAAAAGAG ATGATCGCCC GACTGCAAAG CCACCCGCTG
AAGCGTGCAT TACGCGCGGA TAAAATGACC CTTGCGGCGC TGGAAGCCAC GTTGCGTCTT
TATTTACACC CTGAAGCTCT GAGTAAAAAA TTACCGACCC TGCGCCTGCT TACCCGCAGC
GCAGAGGTCA TTCAAATCCA GGCACAACGT TTACAGGCTC CCCTTGCCGC ACATTACGGC
GCGGAGTTTG CGGTACAGGT TATGCCATGT CTTTCGCAGA TTGGCAGTGG TTCGCTGCCG
GTTGATCGCC TGCCGAGCGC GGCATTAACG TTTACACCCC ATGATGGACG CGGTAGCCAC
CTTGAGTCAT TAGCCGCCCG CTGGCGTGAA TTGCCAGTGC CGGTGATTGG TCGTATTTAT
GACGGACGAT TGTGGCTGGA TTTACGCTGC CTTGAAGATG AGCAACGGTT TTTGGAGATG
TTGTTGAAAT GA
 
Protein sequence
MTTETRSLYS QLPAIDRLLR DSSFLSLRDT YGHTRVVELL RQMLDEAREV IRGSQTLPAW 
CENWAQEVDA RLTKEAQSAL RPVINLTGTV LHTNLGRALQ AEAAVEAVTK AMRSPVTLEY
DLDDAGRGHR DRALAQLLCR ITGAEDACIV NNNAAAVLLM LAATASGKEV VVSRGELVEI
GGAFRIPDVM RQAGCTLHEV GTTNRTHAND YRQAVNENTA LLMKVHTSNY SIQGFTKAID
EAELVALGKE LDVPVVTDLG SGSLVDLSQY GLPKEPMPQE LIAAGVSLVS FSGDKLLGGP
QAGIIVGKKE MIARLQSHPL KRALRADKMT LAALEATLRL YLHPEALSKK LPTLRLLTRS
AEVIQIQAQR LQAPLAAHYG AEFAVQVMPC LSQIGSGSLP VDRLPSAALT FTPHDGRGSH
LESLAARWRE LPVPVIGRIY DGRLWLDLRC LEDEQRFLEM LLK