Gene SeD_A3737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3737 
Symbol 
ID6873535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3588921 
End bp3589925 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content51% 
IMG OID642786712 
Productputative sulfite oxidase subunit YedY 
Protein accessionYP_002217346 
Protein GI198245414 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.256044 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.0204641 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA TACGTCCATT AACAGAAGCC GATGTGACTG CGGAATCGGC TTTTTTTATG 
CAGCGCCGAC AGGTGCTAAA AGCATTAGGC ATCAGCGCGG CCGCCTTATC CTTACCCTCA
ACGGCGCAGG CCGATCTCTT CAGTTGGTTT AAAGGCAACG ATCGTCCGAA AGCGCCTGCC
GGTAAACCGC TTGAGTTTAG TCAGCCTGCC GCCTGGCGAA GCGATTTAGC GTTAACGCCG
GAAGATAAGG TGACGGGCTA CAACAATTTC TATGAGTTTG GCCTTGATAA AGCCGACCCG
GCGGCCAATG CCGGAAGTCT GAAAACCGAA CCGTGGACGT TGAAAATCAG CGGGGAAGTC
GCGAAGCCAT TTACGCTGGA TTATGACGAT TTAACACATC GTTTCCCATT AGAAGAGCGT
ATCTATCGAA TGCGCTGCGT CGAAGCGTGG TCCATGGTCG TGCCGTGGAT TGGTTTCCCT
TTATATAAGC TACTCGCGCA GGCACAGCCC ACCAGCCACG CTAAATATGT GGCATTCGAA
ACGCTATACG CGCCGGATGA TATGCCAGGA CAGAAAGATC GTTTTATTGG CGGCGGACTG
AAATACCCTT ATGTCGAAGG GCTACGTCTG GACGAAGCCA TGCATCCGCT GACTCTGATG
ACCGTTGGCG TCTATGGTAA GGCGTTACCC CCGCAAAACG GCGCGCCCAT TCGACTCATC
GTTCCATGGA AGTATGGTTT TAAAGGTATT AAATCTATTG TCAGCATTAA ACTCACCCGC
GAACGTCCGC TAACCACCTG GAATTTGTCG GCTCCCAACG AATATGGTTT TTACGCCAAT
GTGAACCCGC ATGTGGATCA TCCACGCTGG TCTCAGGCTA CCGAACGCTT TATTGGTTCA
GGCGGTATCC TCGATGTGCA AAGGCAGCCG ACGCTGCTGT TTAACGGCTA TGCCAATGAA
GTCGCTTCGC TGTATCGCGG TCTCAATTTG CGGGAGAATT TTTAA
 
Protein sequence
MKKIRPLTEA DVTAESAFFM QRRQVLKALG ISAAALSLPS TAQADLFSWF KGNDRPKAPA 
GKPLEFSQPA AWRSDLALTP EDKVTGYNNF YEFGLDKADP AANAGSLKTE PWTLKISGEV
AKPFTLDYDD LTHRFPLEER IYRMRCVEAW SMVVPWIGFP LYKLLAQAQP TSHAKYVAFE
TLYAPDDMPG QKDRFIGGGL KYPYVEGLRL DEAMHPLTLM TVGVYGKALP PQNGAPIRLI
VPWKYGFKGI KSIVSIKLTR ERPLTTWNLS APNEYGFYAN VNPHVDHPRW SQATERFIGS
GGILDVQRQP TLLFNGYANE VASLYRGLNL RENF