Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2067 |
Symbol | |
ID | 5703278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2366876 |
End bp | 2368582 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641271553 |
Product | thiamine pyrophosphate binding domain-containing protein |
Protein accession | YP_001536924 |
Protein GI | 159037671 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.246741 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGGCC AGGTACGCGT TGTGGATCGC ATCGCTGCGA CACTGGCCCG GTTGGGTGTC CGCCACGTCT TCGGCGTCAG CGGCGCCAAC ATCGAGGACC TGTACGACGC GCTGCGCGGC ACCGACGGTG CGACCTGCGG TGTAGTGGCC AAACACGAGT TCTCTGCGGC CACCATGGCA GACGGCTCAG CCCGCGTCAC CGGTCGCTTC GGCGTGGTGT CGACGACCTC CGGCGGCGCC GCGATGAATC TGGTGCCTGG GCTGGCGGAG GCGTATGCCT CGCGGGTGCC GATGTTGGCC CTGGTCGGCC AGCCACCCAC GGCCCAGGAG GGACGTGGGG CGTTCCAGGA AACCAGCGGT CTGGCGGGCT CGTTCGACGC GATGGCGGTA CTCGACCCTG TCTCCCGGTT CTGTGCCCGG GTGGAGGATC CGGCTCGTAT CGACGCTGCC CTCACCGCAG CGATCTCCGC GGCACACCAG GACCCGAAGG GGCCGGCGGT GCTGCTGCTA CCCAAGGATG TGCAGCAGGC ACTGGTCGAC GACTCCCCGA GTCGTGTCCT CACCGTCGCG GCGCCGGCCA GCCCGGCACC GACGCCGGCC CTGGACCAGG CGGCAGCTCT GCTCCGCGAG GCACGGCAGG CTCTGGTCAT CGCCGGTGCG GGTGTGGCCT CGTCGGGTGG CCGACGGGAA CTGGATCGTC TGGTGGGGCG CCTCGGCGCA TGGGTGGCGG CCACCCCAGA CGCCAGGGAT GCCTTCGACA ACCGTCATCC GGCATTCGCT GGTGTCGCCG GGGTGATGGG ACACGACGCC GTCGGCGAGC TACTGCAACG AGCCGACCTG TGCCTGCTGG CGGGCACCCG GCTGCCCGCC CTGGCGCGCA ACGGACTGGA AGAGGCGCTG GCGGGGATGC CGGTGATCTG CGTCGACCCC GAGCCACCGC ACATCCCGGG CCTCGCACTG ATGGGTTGCC CGCAAGCCAC ACTGCGCGCC TTGTCGCTGC GTTTGGGCGC TCACCGACGG TCATGTCCGC CTCACCCTGG GCCTGTCCTG CTCTCGGGCG GTACGTCGTC CGGGGAGACG CTGCGGGCAT ACGCCGATGC CCTCCACGTG ATCTCCACCG TCCTGGCGCC CGACGTTCAC GTCTTCGTCG ATGCCGGCAA TGCGGCTGCG GCCGCGATCC ATGCGCTGTC GCCTGCCCCT CGAGGACGTT TCGTCGTGGC GCTGGGAATG GGTGGTATGG GCTACACCTT CGGAGCGGGG ATCGGCGCGG CGCTGGCCAC TGGACGGCGT ACGTACGTCC TGGCAGGAGA TGGTGCGTTC TATGCACACG GCACCGAGGT GCACACCGCA CTGGAAGCCG CCGCCCCCGT CACCTTCGTG ATCTTCAACA ACAACGCGCA CGCCATGTGC GTCACCCGCG AGGACCTGTT CCAAGGCGGC GCCAGCGGCG TCAACGCCTT CCGGCCGTCG GACATCGCCG CCGGTGTATC CGCGATGTTT CCGGGTCTTC GGGCAACCCG CGCCAGCACC GCGCCCCAGT TGCGTGCGGC CCTGCTGGCG GGTCAGGCCG GTGGCGGTCC GGCTCTCGTG GCCATGGACT TCGATCCCGC TGAACTACCG CCGTTTCGTC CGTTCCTGGC CGCGGGCCAG GCACCCGTCA ACAACCAGGA GGGTGATCAT GACGACCGCG CCGTCCACGT TGGCTGA
|
Protein sequence | MTGQVRVVDR IAATLARLGV RHVFGVSGAN IEDLYDALRG TDGATCGVVA KHEFSAATMA DGSARVTGRF GVVSTTSGGA AMNLVPGLAE AYASRVPMLA LVGQPPTAQE GRGAFQETSG LAGSFDAMAV LDPVSRFCAR VEDPARIDAA LTAAISAAHQ DPKGPAVLLL PKDVQQALVD DSPSRVLTVA APASPAPTPA LDQAAALLRE ARQALVIAGA GVASSGGRRE LDRLVGRLGA WVAATPDARD AFDNRHPAFA GVAGVMGHDA VGELLQRADL CLLAGTRLPA LARNGLEEAL AGMPVICVDP EPPHIPGLAL MGCPQATLRA LSLRLGAHRR SCPPHPGPVL LSGGTSSGET LRAYADALHV ISTVLAPDVH VFVDAGNAAA AAIHALSPAP RGRFVVALGM GGMGYTFGAG IGAALATGRR TYVLAGDGAF YAHGTEVHTA LEAAAPVTFV IFNNNAHAMC VTREDLFQGG ASGVNAFRPS DIAAGVSAMF PGLRATRAST APQLRAALLA GQAGGGPALV AMDFDPAELP PFRPFLAAGQ APVNNQEGDH DDRAVHVG
|
| |