Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_4214 |
Symbol | |
ID | 7115341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | - |
Start bp | 4441041 |
End bp | 4442816 |
Gene Length | 1776 bp |
Protein Length | 591 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643526929 |
Product | sulfoacetaldehyde acetyltransferase |
Protein accession | YP_002422935 |
Protein GI | 218532119 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | [TIGR03457] sulfoacetaldehyde acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.169445 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGATGA CGACCGAGGA GGCCTTCATC AAAGTCCTTC AGATGCACGG CATCGAGCAC GCCTTCGGCA TCATCGGCTC GGCGATGATG CCGGTCTCCG ACCTGTTTCC CAAGGCCGGC ATCACCTTCT GGGACTGCGC GCACGAGAGC AACGCGGCGC TGATCGCCGA CGGCTATTCG CGGGTGACCG GCAAGATGGC GATGGCGGTG GCCCAGAACG GGCCGGGCGT GACCGGCTTC GTCACGGCGA TCAAGACCGC CTACTGGAAC CACACGCCGA TGCTGCTGGT GACGCCGCAG GCGGCCAACA AGACGATCGG CCAGGGCGGC TTCCAAGAGG TCGAGCAGAT GGCCCTGTTC GAGGACATGG TCTGCTATCA GGAGGAGGTG CGCGACCCGA GCCGCGTCGC CGAGGTGCTC AACCGGGTGA TCGAGAAGGC GTGGCGGGGC TGCGCACCGG CGCAGATCAA CGTCCCGCGC GATTTCTGGA CGCATCAAGT CGACACTGCG CTCCCGCAGA TCGTCCGGTT GGAGCGGCCG ACAGGCGGGC GCCAAGCCAT CGCCGAGGCC GCGACGCTGC TGTCGAAGGC GCAATTCCCG GTCATCCTGA ACGGCGCGGG CGTCGTGATC GGCGATGCGA TCCAGGAATC CGTCGCGCTG GCTGAGCGGC TCGATGCGCC GGTGGCCTGC GGCTACCAGC ACAACGATTC CTTCCCCGGC TCGCATCCGT TGGCCGTCGG CCCGCTCGGC TACAACGGCT CGAAGGCGGC GATGGAGCTG ATCGCCAAGG CCGACGTGGT GCTGGCGCTC GGCACTCGCC TCAACCCGTT CTCGACCCTG CCGGGCTACG GCATCGATTA CTGGCCGAAA GACGCCAAGA TCATTCAGGT CGACATCAAC CCCGACCGGA TCGGCCTGAC AAAGCCGATC TCCGTGGGCA TCTGCGGCGA CGCCAAGCAC GTCGCGCAGG AATTGCTGCA ACAGCTGACG CCGGATGCCG GCAATGCCGG CCGCGAGGAG CGCCAGGCGC TCATCCACCG CACCAAATCC GCCTGGGCCC AGGCGCTGAC CTCGCTCGAC CACGAGGACG ACGACGAGGG CACGCGCTGG AACGTCGAGG CGCGGGAGCG CGAAAGCGAG AAGATGTCGC CGCGCCAAGC GTGGCGAGCG ATCCAGGCGG GCCTGCCCGC CGACGCGATC CTCTCGACCG ACATCGGCAA CAACTGCGCC ATCGGCAACG CCTACCCGAC CTTCGACCAG GGCCGGCGAT ACCTCGCGCC GGGCATGTTT GGCCCCTGCG GCTACGGCTT CCCGGCGATC ATCGGCGCCA AGATCGGCTG CCCGGACGTG CCGGTCTTCG GTTTCGCGGG CGACGGCGCC TTCGGCATCT CGATGAACGA GATGACCTCG ATCGGCCGCG ACGAGTGGCC GGCGATCACC ATGGTGATCT TCCGCAATTT CCAGTGGGGC GCCGAGAAGC GCAACTCAAT CCTCTGGTAC GACAACAACT TCGTCGGCAC CGAGCTCGAC AGCAAGCTGA GCTACGCCAA GATCGCTGAA GGGTGCGGCT TCCGCGGCGT GCGTGTCGAG ACCCAGCGGG CGTTGACGGA AGCGATCCAG CAGGCGGGCG AGGCGCAAGG CGAGGGCGTC ACGACCTTCA TCGAGGTGAT GCTCAACCAG GAACTCGGCG AGCCCTTCCG TCGCGACGCG ATGAAGAAGC CGGTCGTCGT CGCCGGCATC GACCGCGCCG ACATGCGGCC GCAGCGCGTG GCCTGA
|
Protein sequence | MRMTTEEAFI KVLQMHGIEH AFGIIGSAMM PVSDLFPKAG ITFWDCAHES NAALIADGYS RVTGKMAMAV AQNGPGVTGF VTAIKTAYWN HTPMLLVTPQ AANKTIGQGG FQEVEQMALF EDMVCYQEEV RDPSRVAEVL NRVIEKAWRG CAPAQINVPR DFWTHQVDTA LPQIVRLERP TGGRQAIAEA ATLLSKAQFP VILNGAGVVI GDAIQESVAL AERLDAPVAC GYQHNDSFPG SHPLAVGPLG YNGSKAAMEL IAKADVVLAL GTRLNPFSTL PGYGIDYWPK DAKIIQVDIN PDRIGLTKPI SVGICGDAKH VAQELLQQLT PDAGNAGREE RQALIHRTKS AWAQALTSLD HEDDDEGTRW NVEARERESE KMSPRQAWRA IQAGLPADAI LSTDIGNNCA IGNAYPTFDQ GRRYLAPGMF GPCGYGFPAI IGAKIGCPDV PVFGFAGDGA FGISMNEMTS IGRDEWPAIT MVIFRNFQWG AEKRNSILWY DNNFVGTELD SKLSYAKIAE GCGFRGVRVE TQRALTEAIQ QAGEAQGEGV TTFIEVMLNQ ELGEPFRRDA MKKPVVVAGI DRADMRPQRV A
|
| |