Gene Bphyt_3608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphyt_3608 
Symbol 
ID6284163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phytofirmans PsJN 
KingdomBacteria 
Replicon accessionNC_010681 
Strand
Start bp4050358 
End bp4051353 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content60% 
IMG OID642623188 
Productputative sulfite oxidase subunit YedY 
Protein accessionYP_001897222 
Protein GI187925580 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0264425 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000362399 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTGGATCA AACGTAGCGA CAGAATTCTA CTCAGCGGCG ACGACATTGC GCGCAGCGAA 
ATCACGCCGC AACATGTTTT TCAGAACCGG CGGCGCGTGT TGCAGGCGGC CGGCGCGGCG
GCGCTCGGCA GTCTGATCGG GGTGAATGGC GAGGCGCTGG CGGCTTACAC GTCGCCGGAT
CCGAAGGCGC AGAAGCTGGC GGCGAAGACC AACGCCAAGT TCGTCGCGCT CGACAAAATC
ACGCCTTACA AGGACATCAC CACGTACAAC AACTTCTACG AGTTCGGCAC CGACAAAGCC
GATCCCGCGC ATAACGCCGG GACGCTGCGG CCGCATCCGT GGAAGGTGAG CGTCGAGGGT
GAGATCAAGA ATCCCAAGGT CTACGATATC GACGAATTGC TCAAGCTCGC GCCGCTCGAA
GAGCGCGTGT ACAGAATGCG CTGCGTCGAA GGCTGGTCGA TGGTGATTCC GTGGATCGGC
GTGCCGCTCG CGGAATTGAT CAAGCGCGTG CAACCGACGG GCAACGCAAA GTACGTACAG
TTCATCACGC TGGCCGATCC GTCGCAAATG CCCGGACTGT CGACGCCCGT ACTCGATTGG
CCATACTCCG AAGGGCTGCG CATGGACGAA GCGATGAATC CGCTGACGTT GCTGACGATG
GGCCTCTACG GCCAGGTGTT GCCTAATCAG AACGGCGCGC CGGTGCGCGT CGTGGTGCCG
TGGAAATACG GCTTCAAGAG CGCGAAGTCG CTGGTGAAGA TCCGCTTCCT CGACAAGCAG
CCGCCGACCA GTTGGAATAC GTATGCATCG AACGAATACG GGTTTTACTC GAACGTGAAT
CCGAACGTCG ATCATCCGCG CTGGAGTCAG GCGACGGAGC GTCGCATCGG CGAAGATGGT
TTCTTCACGC CCAAGCGCAA GACGTTGATG TTCAACGGCT ACGGCGAACA GGTCGCATCG
CTCTATCAGG GCATGGACCT GAAGAAGAAT TTCTGA
 
Protein sequence
MWIKRSDRIL LSGDDIARSE ITPQHVFQNR RRVLQAAGAA ALGSLIGVNG EALAAYTSPD 
PKAQKLAAKT NAKFVALDKI TPYKDITTYN NFYEFGTDKA DPAHNAGTLR PHPWKVSVEG
EIKNPKVYDI DELLKLAPLE ERVYRMRCVE GWSMVIPWIG VPLAELIKRV QPTGNAKYVQ
FITLADPSQM PGLSTPVLDW PYSEGLRMDE AMNPLTLLTM GLYGQVLPNQ NGAPVRVVVP
WKYGFKSAKS LVKIRFLDKQ PPTSWNTYAS NEYGFYSNVN PNVDHPRWSQ ATERRIGEDG
FFTPKRKTLM FNGYGEQVAS LYQGMDLKKN F