Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2955 |
Symbol | |
ID | 5707809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3351680 |
End bp | 3353377 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641272404 |
Product | thiamine pyrophosphate binding domain-containing protein |
Protein accession | YP_001537772 |
Protein GI | 159038519 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.836437 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0528961 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGGCA TCCCGGGTCA CCCGCCCGGC CAACCGCGCA CGGCGGCCAC CACCCTCGTG GCCGCCCTGC TCGGACACGA CGTCGACCGG GTGTTCTGCG TGGCCGGCGA GAGCTACCTG GCGGTCCTCG ACGCCCTGTA CGACACCCCG ACCGTCGAGG TGGTGACCTG CCGGCACGAG GCGTCGGCGG CGTTCGCCGC CGTCGCCGAC GCCAAGCTGA CTGGTCGGGC GGGTGTCTGC CTGGTCAGTC GCGGCCCCGG GGCCACCAAC GCAGGCATCG CCGTGCACTC GGCCGCCCAG GACGCCACCC CGCTGGTCCT GCTCGTCGGC CACGTGCCGC GGTCCGAGAT CGGCACCGAC GCGTTTCAGG AGATCGACCC GCGCGCCTTC TCCGGTCTGG CCAAACAGGT ACTGGTGCTG CTGGATCCGG CACGCACCGG CGAGTTCGTC GCCCGGGCCT TCCGGGTCGC CGAGGCCGGT ACCCGCGGGC CGGTGGTGCT GGTTCTTCCC GAGGATGTCC TGGCCATGTC AGATCCGGTC ACGCCAGTGC CCGCCCGCTG GGCGGCAGCC GCGCCGGTGG CCGCCGCCGA GGATCTACAG GCGGTGCGGG CGCTGCTGGC ACGGTCACGA CGACCGTTGC TCGTGGCGGG CGGCGACCTC TCCGGCGACC GGGGCCGGTG TCTGCTGCGC GAGGTGGCGC ACCGACACCG GTTTCCGGTG GTGACCAGCA ACAAACGGCA GGACCTCCTC GACAACCGGG ACTCCTGCTA CGCCGGCCAT CTGCACAACA ACACCCAGGA GAGGCAGATC GCGGCACTGG ATCGGGCGGA TCTCGTCCTG GCGGTCGGAA CCCGGCTGGA CGACGTGACC ACGTGTGGCC GGCGGCTGCC GCGCCCCGGT CGGCCCGATC AGCCGTTGGT GCATGTGCAC GCCGATCCGC AGCGGCTCGG GCGGACCCAC CCGCCGGCCG TCGGGTTGGC CTGCGATCCG GTCGCCTTCC TCGGCCAACT GGCACTGGAG CCCGCGTACC CGGACGCCGG CCGGGAGACC TGGATCGACG AGTTGCACGC GATCGAGGTC GAGAAGGCCG TCTGGTCCGA GCATCCGAGC GACGACGGCG TCGCGTTCGG TGCCGTGGTC GCCGGCCTCG ACGAGCTCAC CGACGGCGAC GTGGTCGTTG CCGTCGACTC CGGCACCTTC ACCAGTTGGC TGTACCGCTA CCTGCGGCTG AGCGGCGAGG GGCGGATGCT CGGAGTCGGA TCCAGCGCGA TGGGTTTCGG CGTCCCGGCC GGCGTGGCCG CTGCACTGCG GACACGCCGC CCGGTCGTGG TGGTCGTCGG CGACGGCGGG TTCCTGATGA CGGGCAGCGA ACTGGCCACA GCGGTGAGCC ACCGGCTGCC CCTGGTCGTT CTCGTCGCCA ACAACGGCAG CTACGGCACG ATCCGCCTGC ACCAGGAACG GGAGTTTCCC GGGCGGGTCA TCGCCACCGA TCTGAGTAAC CCCGACTTCG TCCAGCTCGC CCGCGCGTTC GGCGCGCTGG GCCTGATCGT GCAGGCCGAG GAGGACGTCG AGCCCTGCCT GGCCCGGGCA CTCGCCCACG GGGGGCCGGT CGTGGTCGAC GTACGGACCA GCCTGAGCTG GATCACCGCC TACCGACGGA TGCGAACGCG GGTGGCCGCG GATGTGGGGT CGGCATGA
|
Protein sequence | MNGIPGHPPG QPRTAATTLV AALLGHDVDR VFCVAGESYL AVLDALYDTP TVEVVTCRHE ASAAFAAVAD AKLTGRAGVC LVSRGPGATN AGIAVHSAAQ DATPLVLLVG HVPRSEIGTD AFQEIDPRAF SGLAKQVLVL LDPARTGEFV ARAFRVAEAG TRGPVVLVLP EDVLAMSDPV TPVPARWAAA APVAAAEDLQ AVRALLARSR RPLLVAGGDL SGDRGRCLLR EVAHRHRFPV VTSNKRQDLL DNRDSCYAGH LHNNTQERQI AALDRADLVL AVGTRLDDVT TCGRRLPRPG RPDQPLVHVH ADPQRLGRTH PPAVGLACDP VAFLGQLALE PAYPDAGRET WIDELHAIEV EKAVWSEHPS DDGVAFGAVV AGLDELTDGD VVVAVDSGTF TSWLYRYLRL SGEGRMLGVG SSAMGFGVPA GVAAALRTRR PVVVVVGDGG FLMTGSELAT AVSHRLPLVV LVANNGSYGT IRLHQEREFP GRVIATDLSN PDFVQLARAF GALGLIVQAE EDVEPCLARA LAHGGPVVVD VRTSLSWITA YRRMRTRVAA DVGSA
|
| |