Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_0760 |
Symbol | |
ID | 5083524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 771065 |
End bp | 772873 |
Gene Length | 1809 bp |
Protein Length | 602 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640482318 |
Product | sulfoacetaldehyde acetyltransferase |
Protein accession | YP_001166971 |
Protein GI | 146276812 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | [TIGR03457] sulfoacetaldehyde acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGGAC CGCGCGGCCG ACACATCCAT CAGCGGACTT CAACAATGGA AATGACGACG GAAGAGGCCT TCGTGAAGGT CTTGCAGCGA CACGGGATCA GACAAGCTTT CGGCATCATC GGCTCTGCCA TGATGCCGAT CTCGGACCTG TTCCCGAAGG CGGGGATCAC CTTTTGGGAC TGCGCCCATG AAGGCAATGC CGGGATCATG GCCGACGGTT TCACCCGCGC CACCGGGCGC ATGGCGATGC TGGTGGCGCA GAACGGCCCC GGCGTCACCA ACTTCGTGAC GGCGGTCAAG ACCGCCTACT GGAACCACAC GCCGCTTCTG CTGGTGACGC CGCAGGCCGC AAACCGGACC ATCGGGCAGG GCGGCTTTCA GGAGGTCGAA CAGATGGCCG CCTTCCGCGA CATGGTGGCC TGGCAGGAAG AAGTCCGCGA CCCCGCCCGC ATCGCCGAGG TGCTGAATCG TGTGATCCAG AAGGCGCGCC GCGCCTCGGC ACCCGCGCAG ATCAACGTGC CGCGCGATCT GTGGACCCGG CAGATTGACA TCGAGCTGCC CGAACCACTC GACATCGAGC CCTCTCCGGG CGGGGCGGCC AGCATCGTCC GCGCGGCCGA TCTGCTGACC CGTGCGCGGT TTCCGGTGAT CCTGAACGGG GCGGGCGTCG TTCTGTCGGG TGCGATCCCG GCCACGGTGC GGCTGGCCGA GCGGCTGGGC GCGCCGGTCG CCTGCGGCTA TCAGCACAAT GACGCCTTCC CGCAGGCGCA TCCGCTGGCG CTGGGGCCCC TGGGCTACAA TGGATCGAAG GCGGCAATGG AGGTGATTGC CCAGGCCGAT GTGGTGCTGG CGCTGGGGAC GCGGCTCAAT CCGTTCTCGA CATTGCCGGC CTATGGGATC GACTACTGGC CAAGGGATGC GGCGGTGATC CAGGTGGATC TGAACCCGGA CCGGATCGGG TTGACGAAGC CCGTCGCGCT GGGGATCGCG GGGGATGCGG GGCAGGTGGC CGAAGCGATC CTTTCTGCGC TCGGGCCCAT GGCGGGCGAC GAGGGGCGGG CGGATCGGCA GGCGCTGGTG GCGCAGCGCC GCTCGGCCTG GGCGCAGGAA CTGGCTGGCC TTGATCACGA AGAGGATGAT CCGGGCACTG ACTGGAATGC CCGCGCCCGG GCGCGCGAGC CCGCGAAGAT CAGCCCCCGG ATGGCCTGGC GCGCGATCAC GGCGGCGCTG CCGCGCGAGG CGATCATCTC GACCGACATC GGCAACAACT GCGCCATCGG CAATGCCTAT CCCGGCTTCG ATGAGGGCCG TCGCTACCTT GCCCCCGGGC TTTTCGGCCC TTGCGGCTAT GGGCTGCCCG CGATCATCGG CGCGCGGATC GGCCGGCCCG ACCTGCCGGC CGTGGGATTT GCGGGCGATG GCGCCTTCGG GATCTCGATG AACGAGATGG TCTCGCTTGG CCGCCCCGGA TGGCCCGGGA TTACCATGGT GATCTTCCGC AACTACCAGT GGGGTGCCGA GAAGCGGAAC ACGACGCTCT GGTATGCTGA CAATTTCGTC GGCACCGAGT TGAACGACAA GGTAAGCTAT GCAGGGATCG CCCGCGCCTG CGGTCTCGAG GGCGTGCAGG TCTGGACGAT GGAGGAGTTG ACCGAAACCC TTCGCCTTGC CGTCGCTGCG CAGAAGGACG GGGTCACGAC ATTCCTCGAG GTGATGCTGA ACCAGGAACT CGGCGAGCCC TTCCGCCGTG ACGCGATGAC GAAGCCGGTG GTGCTGGCCG GGATCGACAG GGCGGATCTG CGGGCGTGA
|
Protein sequence | MGGPRGRHIH QRTSTMEMTT EEAFVKVLQR HGIRQAFGII GSAMMPISDL FPKAGITFWD CAHEGNAGIM ADGFTRATGR MAMLVAQNGP GVTNFVTAVK TAYWNHTPLL LVTPQAANRT IGQGGFQEVE QMAAFRDMVA WQEEVRDPAR IAEVLNRVIQ KARRASAPAQ INVPRDLWTR QIDIELPEPL DIEPSPGGAA SIVRAADLLT RARFPVILNG AGVVLSGAIP ATVRLAERLG APVACGYQHN DAFPQAHPLA LGPLGYNGSK AAMEVIAQAD VVLALGTRLN PFSTLPAYGI DYWPRDAAVI QVDLNPDRIG LTKPVALGIA GDAGQVAEAI LSALGPMAGD EGRADRQALV AQRRSAWAQE LAGLDHEEDD PGTDWNARAR AREPAKISPR MAWRAITAAL PREAIISTDI GNNCAIGNAY PGFDEGRRYL APGLFGPCGY GLPAIIGARI GRPDLPAVGF AGDGAFGISM NEMVSLGRPG WPGITMVIFR NYQWGAEKRN TTLWYADNFV GTELNDKVSY AGIARACGLE GVQVWTMEEL TETLRLAVAA QKDGVTTFLE VMLNQELGEP FRRDAMTKPV VLAGIDRADL RA
|
| |