Gene SeD_A2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2049 
SymbolselD 
ID6874553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1980975 
End bp1982018 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content58% 
IMG OID642785163 
Productselenophosphate synthetase 
Protein accessionYP_002215829 
Protein GI198243320 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0709] Selenophosphate synthase 
TIGRFAM ID[TIGR00476] selenium donor protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.25484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.757262 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGC AAGCCATTCG TTTAACGCAA TACAGCCACG GCGCTGGTTG CGGTTGTAAA 
ATTTCCCCTA AAGTGCTGGA GACTATCCTG CATAGCGAGC AGGCGAAGTT CGTCGACCCG
AACCTGCTGG TGGGTAATGA AACCCGCGAT GATGCGGCGG TTTACGATCT GGGTAATGGC
ACCAGTATCA TCAGCACCAC CGACTTCTTT ATGCCGATAG TCGACAACCC GTTTGATTTT
GGCCGCATTG CGGCAACAAA CGCCATCAGC GATATTTTTG CGATGGGCGG CAAACCGATT
ATGGCGATCG CGATCCTTGG CTGGCCGATT AACACCCTGT CGCCCGATAT TGCGCGTGAA
GTGACCGAGG GGGGGCGCTT TGCCTGCCGT CAGGCCGGTA TCGCGCTGGC GGGCGGACAC
TCTATTGACG CCCCGGAGCC GATCTTTGGT CTCGCGGTCA CAGGCGTAGT GCCGACCGAA
CGGGTGAAGA AAAACAGTAC CGCGCAGGCG GGATGCAAAC TCTTTCTGAC CAAACCGTTG
GGGATTGGCG TGTTGACCAC CGCCGAGAAA AAATCGCTGC TTAAACCTGA ACATCAGGGG
CTGGCGACGG AAGTCATGTG TCGGATGAAC GTTGCTGGCG CGGCGTTTGC CAATATCGAC
GGCGTAAAAG CTATGACTGA CGTTACCGGT TTTGGCCTGC TGGGGCACCT GAGCGAGATG
TGCCAGGGCG CAGGCGTGCA GGCGCTGCTT TGCTATCAGG ACATCCCTAA ACTGCCGGGC
GTGGAAGAGT ATATTGCTCT GGGCGCCGTA CCGGGCGGCA CAGAGCGCAA CTTCGCCAGC
TATGGTCATC TGATGGGCGA CATGTCGCGT GAAGTTCGTA GCCTGCTGTG CGATCCGCAA
ACGTCAGGCG GTCTGTTGTT GGCGGTAACG CCGGACGCCG AAGACGATGT TAAAGCCACC
GCGGCGGAAT TTGGTATCGA TCTGACCGCG ATTGGCGAAC TGGTCGAGGC CCGCGGCGGT
CGCGCTATGG TTGAGATTCG TTAA
 
Protein sequence
MSEQAIRLTQ YSHGAGCGCK ISPKVLETIL HSEQAKFVDP NLLVGNETRD DAAVYDLGNG 
TSIISTTDFF MPIVDNPFDF GRIAATNAIS DIFAMGGKPI MAIAILGWPI NTLSPDIARE
VTEGGRFACR QAGIALAGGH SIDAPEPIFG LAVTGVVPTE RVKKNSTAQA GCKLFLTKPL
GIGVLTTAEK KSLLKPEHQG LATEVMCRMN VAGAAFANID GVKAMTDVTG FGLLGHLSEM
CQGAGVQALL CYQDIPKLPG VEEYIALGAV PGGTERNFAS YGHLMGDMSR EVRSLLCDPQ
TSGGLLLAVT PDAEDDVKAT AAEFGIDLTA IGELVEARGG RAMVEIR