Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3919 |
Symbol | selA |
ID | 6145567 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3991534 |
End bp | 3992925 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641618745 |
Product | selenocysteine synthase |
Protein accession | YP_001745884 |
Protein GI | 170680855 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1921] Selenocysteine synthase [seryl-tRNASer selenium transferase] |
TIGRFAM ID | [TIGR00474] seryl-tRNA(sec) selenium transferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.419812 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAACCG AAACGCGTTC CCTCTATAGT CAACTTCCGG CTATTGATCG CTTATTGCGC GATAGCTCCT TCCTTTCTTT GCGTGATACT TATGGTCACA CCCGCGTGGT GGAATTGTTG CGTCAGATGC TCGACGAAGC GCGAGAAGTG ATTCGTGGCA GCCAGACGCT GCCCGCATGG TGTGAAAACT GGGCGCAAGA AGTTGATGCC CGGTTGACGA AAGAGGCGCA AAGCGCGCTG CGTCCGGTGA TCAACCTGAC GGGAACCGTG CTGCATACCA ACCTTGGGCG AGCTTTACAG GCGGAAGCCG CGGTGGAAGC CGTTACGAAG GCTATGCGTT CGCCAGTGAC CCTCGAGTAC GATCTGGACG ACGCCGGACG CGGACATCGC GATCGGGCGC TGGCGCAGCT GCTGTGCCGT ATTACGGGGG CGGAAGATGC CTGTATCGTT AATAACAATG CGGCGGCGGT GTTATTGATG TTGGCGGCCA CTGCCAGCGG GAAAGAGGTG GTGGTTTCTC GCGGCGAACT GGTGGAGATT GGCGGCGCGT TTCGTATTCC TGATGTCATG CGTCAGGCAG GCTGCACCCT ACACGAAGTG GGGACCACCA ACCGCACGCA CGCTAATGAT TATCGCCAGG CGGTGAATGA AAATACCGCG CTGTTGATGA AAGTACATAC CAGTAACTAC AGCATTCAGG GATTCACCAA AGCGATAGAT GAAGCGGAAC TGGTGGCGCT CGGCAAAGAG CTGGATGTCC CCGTAGTGAC TGATTTAGGC AGTGGCTCGC TGGTCGATCT TAGCCAGTAC GGTTTGCCGA AAGAGCCAAT GCCGCAGGAG TTGATTGCGG CGGGCGTCAG TCTGGTGAGC TTCTCCGGCG ACAAGTTGTT AGGCGGGCCG CAGGCAGGAA TTATTGTTGG TAAAAAAGAG ATGATCGCCC GACTGCAAAG CCACCCGCTG AAGCGTGCAT TACGCGCGGA TAAAATGACC CTTGCGGCGC TGGAAGCCAC GTTGCGTCTT TATTTACACC CTGAAGCTCT GAGTAAAAAA TTACCGACCC TGCGCCTGCT TACCCGCAGC GCAGAGGTCA TTCAAATCCA GGCACAACGT TTACAGGCTC CCCTTGCCGC ACATTACGGC GCGGAGTTTG CGGTACAGGT TATGCCATGT CTTTCGCAGA TTGGCAGTGG TTCGCTGCCG GTTGATCGCC TGCCGAGCGC GGCATTAACG TTTACACCCC ATGATGGACG CGGTAGCCAC CTTGAGTCAT TAGCCGCCCG CTGGCGTGAA TTGCCAGTGC CGGTGATTGG TCGTATTTAT GACGGACGAT TGTGGCTGGA TTTACGCTGC CTTGAAGATG AGCAACGGTT TTTGGAGATG TTGTTGAAAT GA
|
Protein sequence | MTTETRSLYS QLPAIDRLLR DSSFLSLRDT YGHTRVVELL RQMLDEAREV IRGSQTLPAW CENWAQEVDA RLTKEAQSAL RPVINLTGTV LHTNLGRALQ AEAAVEAVTK AMRSPVTLEY DLDDAGRGHR DRALAQLLCR ITGAEDACIV NNNAAAVLLM LAATASGKEV VVSRGELVEI GGAFRIPDVM RQAGCTLHEV GTTNRTHAND YRQAVNENTA LLMKVHTSNY SIQGFTKAID EAELVALGKE LDVPVVTDLG SGSLVDLSQY GLPKEPMPQE LIAAGVSLVS FSGDKLLGGP QAGIIVGKKE MIARLQSHPL KRALRADKMT LAALEATLRL YLHPEALSKK LPTLRLLTRS AEVIQIQAQR LQAPLAAHYG AEFAVQVMPC LSQIGSGSLP VDRLPSAALT FTPHDGRGSH LESLAARWRE LPVPVIGRIY DGRLWLDLRC LEDEQRFLEM LLK
|
| |