Gene SeHA_C3675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3675 
Symbol 
ID6490986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3562520 
End bp3563524 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content51% 
IMG OID642743793 
Productputative sulfite oxidase subunit YedY 
Protein accessionYP_002047405 
Protein GI194450363 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0647126 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value0.489877 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA TACGTCCATT AACAGAAGCC GATGTGACTG CGGAATCGGC TTTTTTTATG 
CAGCGCCGAC AGGTGCTAAA AGCATTAGGC ATCAGCGCGG CCGCCTTATC CTTACCCTCA
ACGGCGCAGG CCGATCTCTT CAGTTGGTTT AAAGGCAACG ATCGTCCGAA AGCGCCTGCC
GGTAAACCGC TTGAGTTTAG TCAGCCTGCC GCCTGGCGAA GCGATTTAGC GTTAACGCCG
GAAGATAAAG TGACGGGCTA CAACAATTTC TATGAGTTTG GCCTTGATAA AGCCGACCCG
GCGGCCAATG CCGGAAGTCT GAAAACCGAA CCGTGGACGT TGAAAATCAG CGGGGAAGTC
GCGAAGCCAT TTACGCTGGA TTACGACGAT TTAACACATC GTTTCCCATT AGAAGAGCGT
ATCTATCGAA TGCGCTGCGT CGAAGCGTGG TCCATGGTCG TGCCGTGGAT TGGTTTCCCT
TTATATAAGC TACTCGCGCA GGCACAGCCC ACCAGCCACG CTAAATATGT GGCATTCGAA
ACGCTATACG CGCCGGATGA TATGCCAGGA CAGAAAGATC GCTTTATTGG CGGCGGACTG
AAATACCCTT ATGTCGAAGG GCTACGTCTG GACGAAGCCA TGCATCCGCT GACTCTGATG
ACCGTTGGCG TCTATGGTAA GGCGTTACCC CCGCAAAACG GCGCGCCTAT TCGACTCATC
GTTCCATGGA AGTATGGTTT TAAAGGTATT AAATCTATTG TCAGCATTAA ACTCACCCGC
GAACGTCCGC CAACCACCTG GAATTTGTCG GCTCCCAACG AATATGGTTT TTATGCCAAT
GTGAACCCGC ATGTGGATCA TCCACGCTGG TCTCAGGCTA CCGAACGCTT TATTGGTTCA
GGCGGTATCC TCGATGTACA AAGACAGCCG ACGCTGCTGT TTAACGGCTA CGCCAATGAA
GTCGCTTCGC TGTATCGCGG TCTCAATTTG CGGGAGAATT TTTAA
 
Protein sequence
MKKIRPLTEA DVTAESAFFM QRRQVLKALG ISAAALSLPS TAQADLFSWF KGNDRPKAPA 
GKPLEFSQPA AWRSDLALTP EDKVTGYNNF YEFGLDKADP AANAGSLKTE PWTLKISGEV
AKPFTLDYDD LTHRFPLEER IYRMRCVEAW SMVVPWIGFP LYKLLAQAQP TSHAKYVAFE
TLYAPDDMPG QKDRFIGGGL KYPYVEGLRL DEAMHPLTLM TVGVYGKALP PQNGAPIRLI
VPWKYGFKGI KSIVSIKLTR ERPPTTWNLS APNEYGFYAN VNPHVDHPRW SQATERFIGS
GGILDVQRQP TLLFNGYANE VASLYRGLNL RENF