Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0487 |
Symbol | |
ID | 5732386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 566338 |
End bp | 568341 |
Gene Length | 2004 bp |
Protein Length | 667 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277613 |
Product | transketolase |
Protein accession | YP_001543266 |
Protein GI | 159897019 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0021] Transketolase |
TIGRFAM ID | [TIGR00232] transketolase, bacterial and yeast |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.33279 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACTCAAG CACATACACT CGATGAACGG GCCATCAATA CAATCCGCAT GTTGTCGGTT GATGGTGTGC AAGCGGCTAA TTCGGGCCAC CCAGGTTTAC CAATGGGGGC AGCGGCCATG GCTTATGTGC TCTGGACGCG CCACCTCAAA CATAATCCGG CTAATCCAGA TTGGGCTGAT CGCGACCGCT TTGTGCTGTC GGCGGGCCAT GGCTCAATGC TGTTGTATAG TTTGTTGCAT CTCACGGGCT ATGATCTTTC GCTCGATGAT TTGAAGAATT TCCGCCAATG GCATAGTAAA ACCGCTGGTC ACCCAGAATA TGGTTATGCC GCCGGCATCG AAACCACAAC TGGGCCACTT GGCCAAGGAT TTGCCACGGG CGTGGGTATG GCGATTGCTG CCCGTCACTT AGCAGGCACG TTCAATCAGC CTGAACTCGA AATCGTCAAG CATCATATTT ATGCGATTGT CTCCGATGGC GATTTGGAAG AGGGGATTAG CGCCGAAGCT GCTTCGTTGG CAGGTCACCT CAAATTGGGC GAACTTATCT ATCTGTATGA TGATAACGAA ATTTCGATTG AAGGCGATAC CTCAATTGCC TTTACCGAAG ATGTGCCAGC CCGTTTCCGC GCCTATGGCT GGCATGTCCA AGAGATCGAC GGGCTTGATC CTGAGCAAGT TGATGCAGCC TTGCATGCTG CCAAAGCGGT GACCGATCAG CCATCGTTGA TTGTGGCTCA CACCGTGATT GGCTTTGGCT CGCCCAATCG GGCTGGCACG GCCAAAGCTC ACGGCTCGCC GCTCGGCCCC GATGAAGTTA AATTGACCAA GGAAGCGCTT GGTTGGCCGC TCGAACCAAC CTTCTACATT CCTGAAGAAG TCTTGGCCCA CTTCCGCCAA GCCTTGGATC ATGGTGCGGC GGCAGAGCAA GCCTGGAACG AATTGCTCGA ACGCTACACG GCGGCGCATC CTGAAAAAGC TGCTGATTTC AAGCAACGCA TGTCGGGTGA ATTGCCAGCA GGTTGGGATA GTACCTTGCC AGTTTGGCCA GCTGATGCCA AAGGCGTGGC AACCCGCAAA TCATCAGAAA CTGCCTTGAA TGCCTTGGCC GAGCAAATTC CAGCGCTCAT TGGCGGCTCA GCTGATTTGG CCGAATCGAC CTTTACCTTG ATCGAGCATG CTCAATCGTT CCAAGCCGAT ACGCCGCAAG GCCGCAATAT GCACTGGGGG ATTCGTGAGC ATGCTATGGT TGCTGCCGTC AACGGGATGG CGTTGCATGG CGGCACGATT CCTTACGGCG CAACCTTCTT GGTTTTCAGC GATTATTGCC GTGCCTCGAT TCGCTTGGCA GCCTTGATGG GCATTCGCAC GATTCAAGTC TTTACTCACG ATAGCATTGG GGTTGGCGAA GACGGCCCAA CCCACCAACC AATCGAACAC ATTCCATCGT TGCGGATTAT CCCCAATTTG AATGTGATGC GGCCTGGTGA TGCCAACGAA ACCAGCCAAG CTTGGCGCGT AGCAGTCAGC CATAAAGGCC CAACCTTGCT GGCCTTGACC CGCCAAAACT TACCAACGCT TGATCGCACC CGCTATGCCT CGGCTGAGGG TGTGGCTCAA GGTGGCTATG TCTTGGCTGA TAGCGCTGGT CAACCAGAAT TAATCATCAT TGCAACTGGC TCGGAATTGC AACATGCCGT GGCGGCCTAC GAGCAATTGA GTGGCGAAGG AGTCAAGGTA CGGGTGGTCA GTATGCCATC AACCTTGCTG TTCGACGCTC AGTCAGTGGA ATATCGCGAG AGCGTACTGC CCAAGGCTGT GACCAAACGG ATTGCGATTG AAGCTGCGCA TCCCGTGACC TGGTATAAAT ATGTTGGGAC TGAGGGCGAT ATTATTGGGA TTGATCACTT CGGTGCTTCA GCGCCAATTA ATATTTTGAT GAAGGAATTT GGCTTTACCG CCGAAAACCT GATTGCTCGT GCCAAGGCCT TGTTGGCCAA ATAA
|
Protein sequence | MTQAHTLDER AINTIRMLSV DGVQAANSGH PGLPMGAAAM AYVLWTRHLK HNPANPDWAD RDRFVLSAGH GSMLLYSLLH LTGYDLSLDD LKNFRQWHSK TAGHPEYGYA AGIETTTGPL GQGFATGVGM AIAARHLAGT FNQPELEIVK HHIYAIVSDG DLEEGISAEA ASLAGHLKLG ELIYLYDDNE ISIEGDTSIA FTEDVPARFR AYGWHVQEID GLDPEQVDAA LHAAKAVTDQ PSLIVAHTVI GFGSPNRAGT AKAHGSPLGP DEVKLTKEAL GWPLEPTFYI PEEVLAHFRQ ALDHGAAAEQ AWNELLERYT AAHPEKAADF KQRMSGELPA GWDSTLPVWP ADAKGVATRK SSETALNALA EQIPALIGGS ADLAESTFTL IEHAQSFQAD TPQGRNMHWG IREHAMVAAV NGMALHGGTI PYGATFLVFS DYCRASIRLA ALMGIRTIQV FTHDSIGVGE DGPTHQPIEH IPSLRIIPNL NVMRPGDANE TSQAWRVAVS HKGPTLLALT RQNLPTLDRT RYASAEGVAQ GGYVLADSAG QPELIIIATG SELQHAVAAY EQLSGEGVKV RVVSMPSTLL FDAQSVEYRE SVLPKAVTKR IAIEAAHPVT WYKYVGTEGD IIGIDHFGAS APINILMKEF GFTAENLIAR AKALLAK
|
| |