Gene Rsph17025_0760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_0760 
Symbol 
ID5083524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp771065 
End bp772873 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content67% 
IMG OID640482318 
Productsulfoacetaldehyde acetyltransferase 
Protein accessionYP_001166971 
Protein GI146276812 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR03457] sulfoacetaldehyde acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGGAC CGCGCGGCCG ACACATCCAT CAGCGGACTT CAACAATGGA AATGACGACG 
GAAGAGGCCT TCGTGAAGGT CTTGCAGCGA CACGGGATCA GACAAGCTTT CGGCATCATC
GGCTCTGCCA TGATGCCGAT CTCGGACCTG TTCCCGAAGG CGGGGATCAC CTTTTGGGAC
TGCGCCCATG AAGGCAATGC CGGGATCATG GCCGACGGTT TCACCCGCGC CACCGGGCGC
ATGGCGATGC TGGTGGCGCA GAACGGCCCC GGCGTCACCA ACTTCGTGAC GGCGGTCAAG
ACCGCCTACT GGAACCACAC GCCGCTTCTG CTGGTGACGC CGCAGGCCGC AAACCGGACC
ATCGGGCAGG GCGGCTTTCA GGAGGTCGAA CAGATGGCCG CCTTCCGCGA CATGGTGGCC
TGGCAGGAAG AAGTCCGCGA CCCCGCCCGC ATCGCCGAGG TGCTGAATCG TGTGATCCAG
AAGGCGCGCC GCGCCTCGGC ACCCGCGCAG ATCAACGTGC CGCGCGATCT GTGGACCCGG
CAGATTGACA TCGAGCTGCC CGAACCACTC GACATCGAGC CCTCTCCGGG CGGGGCGGCC
AGCATCGTCC GCGCGGCCGA TCTGCTGACC CGTGCGCGGT TTCCGGTGAT CCTGAACGGG
GCGGGCGTCG TTCTGTCGGG TGCGATCCCG GCCACGGTGC GGCTGGCCGA GCGGCTGGGC
GCGCCGGTCG CCTGCGGCTA TCAGCACAAT GACGCCTTCC CGCAGGCGCA TCCGCTGGCG
CTGGGGCCCC TGGGCTACAA TGGATCGAAG GCGGCAATGG AGGTGATTGC CCAGGCCGAT
GTGGTGCTGG CGCTGGGGAC GCGGCTCAAT CCGTTCTCGA CATTGCCGGC CTATGGGATC
GACTACTGGC CAAGGGATGC GGCGGTGATC CAGGTGGATC TGAACCCGGA CCGGATCGGG
TTGACGAAGC CCGTCGCGCT GGGGATCGCG GGGGATGCGG GGCAGGTGGC CGAAGCGATC
CTTTCTGCGC TCGGGCCCAT GGCGGGCGAC GAGGGGCGGG CGGATCGGCA GGCGCTGGTG
GCGCAGCGCC GCTCGGCCTG GGCGCAGGAA CTGGCTGGCC TTGATCACGA AGAGGATGAT
CCGGGCACTG ACTGGAATGC CCGCGCCCGG GCGCGCGAGC CCGCGAAGAT CAGCCCCCGG
ATGGCCTGGC GCGCGATCAC GGCGGCGCTG CCGCGCGAGG CGATCATCTC GACCGACATC
GGCAACAACT GCGCCATCGG CAATGCCTAT CCCGGCTTCG ATGAGGGCCG TCGCTACCTT
GCCCCCGGGC TTTTCGGCCC TTGCGGCTAT GGGCTGCCCG CGATCATCGG CGCGCGGATC
GGCCGGCCCG ACCTGCCGGC CGTGGGATTT GCGGGCGATG GCGCCTTCGG GATCTCGATG
AACGAGATGG TCTCGCTTGG CCGCCCCGGA TGGCCCGGGA TTACCATGGT GATCTTCCGC
AACTACCAGT GGGGTGCCGA GAAGCGGAAC ACGACGCTCT GGTATGCTGA CAATTTCGTC
GGCACCGAGT TGAACGACAA GGTAAGCTAT GCAGGGATCG CCCGCGCCTG CGGTCTCGAG
GGCGTGCAGG TCTGGACGAT GGAGGAGTTG ACCGAAACCC TTCGCCTTGC CGTCGCTGCG
CAGAAGGACG GGGTCACGAC ATTCCTCGAG GTGATGCTGA ACCAGGAACT CGGCGAGCCC
TTCCGCCGTG ACGCGATGAC GAAGCCGGTG GTGCTGGCCG GGATCGACAG GGCGGATCTG
CGGGCGTGA
 
Protein sequence
MGGPRGRHIH QRTSTMEMTT EEAFVKVLQR HGIRQAFGII GSAMMPISDL FPKAGITFWD 
CAHEGNAGIM ADGFTRATGR MAMLVAQNGP GVTNFVTAVK TAYWNHTPLL LVTPQAANRT
IGQGGFQEVE QMAAFRDMVA WQEEVRDPAR IAEVLNRVIQ KARRASAPAQ INVPRDLWTR
QIDIELPEPL DIEPSPGGAA SIVRAADLLT RARFPVILNG AGVVLSGAIP ATVRLAERLG
APVACGYQHN DAFPQAHPLA LGPLGYNGSK AAMEVIAQAD VVLALGTRLN PFSTLPAYGI
DYWPRDAAVI QVDLNPDRIG LTKPVALGIA GDAGQVAEAI LSALGPMAGD EGRADRQALV
AQRRSAWAQE LAGLDHEEDD PGTDWNARAR AREPAKISPR MAWRAITAAL PREAIISTDI
GNNCAIGNAY PGFDEGRRYL APGLFGPCGY GLPAIIGARI GRPDLPAVGF AGDGAFGISM
NEMVSLGRPG WPGITMVIFR NYQWGAEKRN TTLWYADNFV GTELNDKVSY AGIARACGLE
GVQVWTMEEL TETLRLAVAA QKDGVTTFLE VMLNQELGEP FRRDAMTKPV VLAGIDRADL
RA