Gene Spro_4414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4414 
Symbol 
ID5605893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4888615 
End bp4889616 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content57% 
IMG OID640939976 
Productputative sulfite oxidase subunit YedY 
Protein accessionYP_001480636 
Protein GI157372647 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0244066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAC AACGAAAACT CACTGAGGCC GATGTCACGC CAGAGAGCGT ATTTTATCAG 
CGCCGTAAAG TGTTGCAGGC GTTGGGCATT ACCGCCGCAT CACTGGCCCT GCCGCATAAT
GCGCAGGCCG ATTTGCTGTC ATGGTTTAAG GGTAACGATC GGCCCAAGGC ACCGCCGGGT
AAACCGCTGG AGTTCAGCAA ACCTGCCGCC TGGCAGGCCC AGTTGGATTT GACGCCCGAA
GATAAAGTCA CCGGCTATAA CAACTTCTAC GAATTCGGTC TGGACAAGGC CGATCCGGCA
GCCAATGCCG GCGGCTTGAA AACCGAAGGC TGGCAGGTAC GCATCGACGG TGAAGTCGCC
AAACCCATCA CGCTGGACAT AGATGATTTA ATCAAACGCT TCCCGCTGGA ACAGCGCATC
TATCGCATGC GCTGCGTTGA AGCCTGGTCA ATGGTGGTGC CGTGGATTGG CTTTGAATTG
GGTAAACTGA TCAAATTCGC GGAACCCAAC AGCAACGCAC GCTACGTCGC TTTCCAGACG
TTGTACGACC CGGAACAGAT GCCCGGCCAG AAAGACCGCT TTATCGGCGG CGGGTTGAAG
TATCCCTATG TCGAAGGGCT GCGTCTCGAC GAGGCGATGA ACCCGCTGGC ACTGCTGACC
GTCGGCGTGT ACGGCAAAAC GCTGCCGCCG CAAAATGGCG CGCCGCTGCG CTTGATCACC
CCATGGAAAT ACGGTTTTAA GGGGATAAAG TCGATCGTCC ATATCCGCCT GGTGCGCGAT
CAGCCGCCGA CCACCTGGAA TCAGTCGGCG CCGAATGAAT ACGGCTTCTA CGCCAACGTG
AATCCGCACG TCGATCATCC CCGTTGGTCG CAGGCCACCG AGCGTTTTAT CGGTTCCGGC
GGCATTCTGG ACGTTAAACG CCAACCCACC CTGCTGTTTA ATGGCTATGC GGAACAGGTC
GCATCGCTGT ACCGTGGCCT GGATCTACGG GAGAATTTCT AA
 
Protein sequence
MSKQRKLTEA DVTPESVFYQ RRKVLQALGI TAASLALPHN AQADLLSWFK GNDRPKAPPG 
KPLEFSKPAA WQAQLDLTPE DKVTGYNNFY EFGLDKADPA ANAGGLKTEG WQVRIDGEVA
KPITLDIDDL IKRFPLEQRI YRMRCVEAWS MVVPWIGFEL GKLIKFAEPN SNARYVAFQT
LYDPEQMPGQ KDRFIGGGLK YPYVEGLRLD EAMNPLALLT VGVYGKTLPP QNGAPLRLIT
PWKYGFKGIK SIVHIRLVRD QPPTTWNQSA PNEYGFYANV NPHVDHPRWS QATERFIGSG
GILDVKRQPT LLFNGYAEQV ASLYRGLDLR ENF