Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3238 |
Symbol | |
ID | 5705389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3729081 |
End bp | 3730925 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641272666 |
Product | thiamine pyrophosphate protein central region |
Protein accession | YP_001538033 |
Protein GI | 159038780 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.864303 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACAGCG ACGACGGATG GGTACCGACC GCGCCGTGGT CGCTGCCCCG CAACGCGACG GTCGCCGACC ACATCACGCA GCGGATGGCC TGCTGGGGTG TCCGCCGCTA CTTCGGCTTT CCCGGCGACG CCATCAACGG CATGACCTCT GCCGTGCAGC GCACCAACGA GCTGGCGCAG TTCATCCAGG TCCGGCACGA GGAAACCGCC GGTTTCGCCG CGTCGGCGCA CGTCAAGTAC GGCGGCGGCC CACTCGGCTG CGTCCTGGTG ACCAGCGGCC CCGGGGCCAT CCACGTGCTC AACGGCCTGT ACGACGCGAA ACTCGACCAC CAGCCGGTCG TCGCACTGGT CGGACACACC GCGCTCACCG CCGAAGGCGG CGGCTACTAC CAGGAGGTCG ACCTGCTCGC CCTCTACAAG GACGTCGCCG CCGCGTTCCT GGCCAAACTC GACCACCCCG CCCAGGTACG TCACCTGGTC GACCGGGCCT GCCGGACAGC TCTCGCCCGG CGTACCGTCA CCGCCCTGAT CCTCCCCCTC GACGTGCAGG ACAAGCCGGC GGTGCCCGAC CCACCGCACG CCCACGGCTA TTACCAGACC AGCAGCGTGC CGGGAAGTGG CCCGACCATC CCGCCGGAGA CCGACGTTCG CCGCGCCGCC GAGGTGCTAC GCGGCGGCGA GCGGGTGGCG ATGCTGGTCG GTCAGGGTGC GCTCGGCGCC GAGACCGAGG TACGCGAGGT CGCCCGGCGG CTCGGCGCGG GTGTGGCGAC CGCGCTGCTG GGCTTCACCG TGGTCGACCA CCGCGAGCCC TGGGTGACCG GTGCGATCGG CCTGCTCGGC ACCCGGCCCA GCTGGCAGCT CATGCAGGAG TGCGACCGGC TGTTGATCGT CGGGAGCAAC CTGCCGTACT CCGAGTTCTA CCCACCACCG GGACGGGCGC GGGCGGTGCA GATCGACCGG GACGGCACCC TGCTGGGACT GCGGTACCCG ACCGAGGTGA ACCTGACCGG CGACGCCGCG CCGACCCTTC GGGCCCTGCT GCACGAGTTG GGCCCCGGCC CCGGCCCGAC CCGCTGGCGG GAGACGCTCA GCCGGAAGGT GGCATCGTGG CGGCAGTCGC AGCGTGAGCT GGCGGAGCAG CCCGCCGACC CGATCAACCC TCAGATGATC TTCACTCACC TCAACGAACG GCTCCCCGAC GACGTGCTGC TCGCCGTCGA CTGCGGTACC ACCACCGCCT GGTACGCCCG GCACGTTCAG GTCCGCCCGG GAATGCTGGC CAGCATGTCG GGCACCCTGC TGTCCATGGG CGGCGCCATG CCGTACGGCA TCGCCGCCAA GTTCGCCCAT CCCGACCGGC CACTGGTGGC CCTCATCGGC GACGGGGCGA TGCAGATGAA CGGTGTCAAC GAGCTGATCA CAGTGGCGAA GTACTGGCGT GGCTGGACCG ACCCGCGCTT CGTGGTGCTG GTCCTCAATA ACCGGGACCT GGCGTTCGTG AGCTGGGAGC AGCGGTCCAG CGAAGGCACG CCGATGTTCC CCGACAGCCA GCAACTGCCC GACATCGCCT ACCACCGGTG GGCCGAGGTG CTGGGGCTAC GCGGCGAGCT GGTGGACTCC CCCGACCAGG TGCCGGACGC GTGGGAACGG GCCTTGGGCG CCGACCGTCC CACCGTCATC AACGCTCTGG TCGACCCGGC CGAGATGATG CTGCCGCCGC ACTTCACCCC CGAACAGGTC CGCAACACCG CCATGGCGGT GCTACGTGGC GACACCGACT GGGCCGGCAT CGTCCGTCGC GGACTCCCCG CCACGCTCAG CACCTACCGG CCACGCCGAG GCTGA
|
Protein sequence | MDSDDGWVPT APWSLPRNAT VADHITQRMA CWGVRRYFGF PGDAINGMTS AVQRTNELAQ FIQVRHEETA GFAASAHVKY GGGPLGCVLV TSGPGAIHVL NGLYDAKLDH QPVVALVGHT ALTAEGGGYY QEVDLLALYK DVAAAFLAKL DHPAQVRHLV DRACRTALAR RTVTALILPL DVQDKPAVPD PPHAHGYYQT SSVPGSGPTI PPETDVRRAA EVLRGGERVA MLVGQGALGA ETEVREVARR LGAGVATALL GFTVVDHREP WVTGAIGLLG TRPSWQLMQE CDRLLIVGSN LPYSEFYPPP GRARAVQIDR DGTLLGLRYP TEVNLTGDAA PTLRALLHEL GPGPGPTRWR ETLSRKVASW RQSQRELAEQ PADPINPQMI FTHLNERLPD DVLLAVDCGT TTAWYARHVQ VRPGMLASMS GTLLSMGGAM PYGIAAKFAH PDRPLVALIG DGAMQMNGVN ELITVAKYWR GWTDPRFVVL VLNNRDLAFV SWEQRSSEGT PMFPDSQQLP DIAYHRWAEV LGLRGELVDS PDQVPDAWER ALGADRPTVI NALVDPAEMM LPPHFTPEQV RNTAMAVLRG DTDWAGIVRR GLPATLSTYR PRRG
|
| |