Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0838 |
Symbol | |
ID | 5707274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 939399 |
End bp | 941129 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641270356 |
Product | thiamine pyrophosphate binding domain-containing protein |
Protein accession | YP_001535747 |
Protein GI | 159036494 |
COG category | [G] Carbohydrate transport and metabolism [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG3961] Pyruvate decarboxylase and related thiamine pyrophosphate-requiring enzymes |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00172744 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.208315 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCGGCA GCGGGGGGTC GGTCACAGTC GGCGAACTGT TGCTTGGTCG CCTCCACGAC CTCGGCGTGC GTCATGTTTT TGGAGTGCCC GGCGACTATG CAATGGACTT CATAGATCAG ATCATGACGT TCGATGGCAT CGACTGGATC GGTAGCTCCA GCGAGTTCAA CGCCGGCTGC AGTGCGGACG GCTACGCCCG AGTTGCTGGC ATAGGTGCCA TTGTTACCCA ATTTGGTGTG GGCGAACTGT CGACCATGAA TGCATTGGCT GGCGCAATGG CTGAGTCGGT GCCTATCGTC TCGGTCGTGG GTGGCCCGAT GCTGGAAGTC ATGCGGCAGC GCACGTCGAT TCACCACTCA CTCGCGGATG GCGATTCCGA GCGTTGGATT CGGATGGCCC GCGAGGTGAC GGTTGCCCAA GCCTCGTTGA CGCCCGAATG TGCACTGCAG GAGATCGACC GGGTGTTGGC CGAGTGCTGG TCCCAGCAGC GTCCCGTATA CATTCGAATT CCCGGTGATG TGGCCATAGC TCCCGTCTCC CGACCGTCGC GACGCTTCAC CCGCCCAAAT CCGGTCGTGT TGCCCGCACA ACTGGACGCG TTCGCCGCCG CTGCTCAGCG CCTGCTCGCT GGTGCCGAAC GGCCAGCCTT GCTGGTGGGG AATCTACCGA TACGCCTCGG TCTTGGTGCG GCTGTCGCCG CGCTCGCCAA CGAGCGAAAC TGGCCGATCG CCACTCAGAT GCTCGGCCGA GGGCTGGTTG ACGAGACAGA TCCCCACTAC ATCGGCATCT ACAACGGGGC CGAAAGTTCG GCTCCGGTCC GCGAGGTGGT CGAAGGCGCC GACGTCCTGG TCTGTCTGGG AACCACCTTT TTCGACTGGA ATGGTCTGTT CACCGCTGAA CTGGATCCTG CCCGGATCAT AAACCTGAGG CGGGACGGCG CTGTGGTCGG CGGAACCTGT TTCGCCCCGG TATCCATGGC CGCGGCGCTG GATCGGTTGC ACGAGATGGC TGCCAGTCGT TCGGTTGGCT GGCCGAGCGC TGCCCTGTTG CACGACCCGC CCGAAATCGA CAGGGCAAGC ACCGACCCAA TCCGTCAGGA ACGACTCTGG TCGGCGGTTC AGGACGTACT GCGTCCGGGG GATATTTTGG TCTCGGAGGT CGGCACCGCG TTCTTCGGTG CGGCAACAAT GCGCCTACCG GCTGGGACGA CTGTATTGGC GGCGCCAATC TGGAGCTTGG CCGGCTATAC CACGCCTGCT GCTTTCGGCG CCGGGATCGC CGCGCCTGAC CGGCGGGTGG TGTCGATCAC CGGCGACGGG GGAATACAAA TTTCCCCACA GGAGATAAGC CGAATGTTCG TCTTTGATCA ACATCCGATC ATATTCGTGG TGAATAACGG TGGCTACAGT AGCGAGCGAG CGCTTGAAAA AGCGGTCGGG GAAGAAACTC AGGCGTACAC CCAGATCCCT GACTGGCGAT ATAGTGAGAT ACCGGCGGTG TTTGCGCCGG AGGGAACCTT CGTTGCGCAT GTCGCCCGCA CCGAGGCCGA GTTGGCTGGG ATCTTGGCCG GTGTCGACGG GCGTACGGAC CGGCTGACTC TCATTGAGGT GATCGTCGAC CCGACGGATC TGCCTCCAGG GTTGCCGCAG TGGAGCCAGG AGGCATCCGC CTTTATCTAC CACGCGCAGT TCCCTGCCCC GGCCAGTCTC CCCTGGGGCG GGCTCGGCTA G
|
Protein sequence | MGGSGGSVTV GELLLGRLHD LGVRHVFGVP GDYAMDFIDQ IMTFDGIDWI GSSSEFNAGC SADGYARVAG IGAIVTQFGV GELSTMNALA GAMAESVPIV SVVGGPMLEV MRQRTSIHHS LADGDSERWI RMAREVTVAQ ASLTPECALQ EIDRVLAECW SQQRPVYIRI PGDVAIAPVS RPSRRFTRPN PVVLPAQLDA FAAAAQRLLA GAERPALLVG NLPIRLGLGA AVAALANERN WPIATQMLGR GLVDETDPHY IGIYNGAESS APVREVVEGA DVLVCLGTTF FDWNGLFTAE LDPARIINLR RDGAVVGGTC FAPVSMAAAL DRLHEMAASR SVGWPSAALL HDPPEIDRAS TDPIRQERLW SAVQDVLRPG DILVSEVGTA FFGAATMRLP AGTTVLAAPI WSLAGYTTPA AFGAGIAAPD RRVVSITGDG GIQISPQEIS RMFVFDQHPI IFVVNNGGYS SERALEKAVG EETQAYTQIP DWRYSEIPAV FAPEGTFVAH VARTEAELAG ILAGVDGRTD RLTLIEVIVD PTDLPPGLPQ WSQEASAFIY HAQFPAPASL PWGGLG
|
| |