Gene EcSMS35_1427 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1427 
SymbolselD 
ID6146419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1412305 
End bp1413348 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content55% 
IMG OID641616305 
Productselenophosphate synthetase 
Protein accessionYP_001743485 
Protein GI170681972 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0709] Selenophosphate synthase 
TIGRFAM ID[TIGR00476] selenium donor protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000127521 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.773319 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGA ACTCGATTCG TTTGACCCAA TACAGCCACG GCGCTGGTTG CGGCTGTAAA 
ATTTCCCCAA AAGTGTTGGA AACCATCCTG CACAGTGAGC AGGCGAAGTT TGTTGATCCG
AATTTGCTTG TGGGTAATGA AACCCGCGAC GATGCGGCGG TGTACGATCT GGGCAATGGC
ACCAGCGTTA TCAGTACCAC CGACTTCTTT ATGCCGATCG TTGATAATCC TTTCGATTTT
GGCCGCATTG CGGCGACTAA CGCCATCAGC GATATCTTCG CAATGGGCGG CAAACCGATT
ATGGCGATTG CGATCCTCGG CTGGCCGATT AACAAACTTT CCCCGGAAAT TGCCCGCGAA
GTGACCGAAG GTGGACGCTA TGCATGCCGT CAGGCGGGTA TTGCGCTGGC TGGCGGTCAC
TCCATCGATG CGCCGGAGCC GATTTTTGGT CTGGCGGTAA CGGGGATCGT ACCGACCGAG
CGGGTGAAGA AAAACAGTAC CGCACAAGCC GGATGCAAAC TGTTCCTGAC GAAACCGCTG
GGGATTGGCG TTCTTACCAC GGCTGAGAAA AAATCACTGC TTAAACCAGA ACATCAGGGA
TTGGCGACGG AAGTGATGTG CCGGATGAAC ATCGCAGGCG CGTCCTTTGC CAACATCGAA
GGCGTAAAAG CGATGACAGA CGTTACGGGT TTTGGTTTGC TGGGGCATTT AAGCGAAATG
TGTCAGGGCG CAGGTGTGCA GGCGCGCGTC GACTATGACG CGATCCCGAA ATTGCCTGGG
GTTGAAGAGT ACATTAAGTT GGGCGCAGTG CCTGGCGGCA CCGAACGTAA CTTTGCCAGC
TACGGTCATC TGATGGGTGA AATGCCGCGT GAAGTGCGCG ATCTGCTGTG CGATCCGCAA
ACTTCTGGCG GTTTGCTGCT GGCGGTCATT CCGGAAGCAG AAAATGAGGT CAAAGCTACA
GCCGCCGAGT TTGGCATTGA ACTGACGGCG ATTGGCGAAC TGGTGCCAGC GCGCGGCGGT
CGTGCCATGG TTGAGATCCG TTAA
 
Protein sequence
MSENSIRLTQ YSHGAGCGCK ISPKVLETIL HSEQAKFVDP NLLVGNETRD DAAVYDLGNG 
TSVISTTDFF MPIVDNPFDF GRIAATNAIS DIFAMGGKPI MAIAILGWPI NKLSPEIARE
VTEGGRYACR QAGIALAGGH SIDAPEPIFG LAVTGIVPTE RVKKNSTAQA GCKLFLTKPL
GIGVLTTAEK KSLLKPEHQG LATEVMCRMN IAGASFANIE GVKAMTDVTG FGLLGHLSEM
CQGAGVQARV DYDAIPKLPG VEEYIKLGAV PGGTERNFAS YGHLMGEMPR EVRDLLCDPQ
TSGGLLLAVI PEAENEVKAT AAEFGIELTA IGELVPARGG RAMVEIR