Gene Haur_0487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0487 
Symbol 
ID5732386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp566338 
End bp568341 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content53% 
IMG OID641277613 
Producttransketolase 
Protein accessionYP_001543266 
Protein GI159897019 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.33279 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTCAAG CACATACACT CGATGAACGG GCCATCAATA CAATCCGCAT GTTGTCGGTT 
GATGGTGTGC AAGCGGCTAA TTCGGGCCAC CCAGGTTTAC CAATGGGGGC AGCGGCCATG
GCTTATGTGC TCTGGACGCG CCACCTCAAA CATAATCCGG CTAATCCAGA TTGGGCTGAT
CGCGACCGCT TTGTGCTGTC GGCGGGCCAT GGCTCAATGC TGTTGTATAG TTTGTTGCAT
CTCACGGGCT ATGATCTTTC GCTCGATGAT TTGAAGAATT TCCGCCAATG GCATAGTAAA
ACCGCTGGTC ACCCAGAATA TGGTTATGCC GCCGGCATCG AAACCACAAC TGGGCCACTT
GGCCAAGGAT TTGCCACGGG CGTGGGTATG GCGATTGCTG CCCGTCACTT AGCAGGCACG
TTCAATCAGC CTGAACTCGA AATCGTCAAG CATCATATTT ATGCGATTGT CTCCGATGGC
GATTTGGAAG AGGGGATTAG CGCCGAAGCT GCTTCGTTGG CAGGTCACCT CAAATTGGGC
GAACTTATCT ATCTGTATGA TGATAACGAA ATTTCGATTG AAGGCGATAC CTCAATTGCC
TTTACCGAAG ATGTGCCAGC CCGTTTCCGC GCCTATGGCT GGCATGTCCA AGAGATCGAC
GGGCTTGATC CTGAGCAAGT TGATGCAGCC TTGCATGCTG CCAAAGCGGT GACCGATCAG
CCATCGTTGA TTGTGGCTCA CACCGTGATT GGCTTTGGCT CGCCCAATCG GGCTGGCACG
GCCAAAGCTC ACGGCTCGCC GCTCGGCCCC GATGAAGTTA AATTGACCAA GGAAGCGCTT
GGTTGGCCGC TCGAACCAAC CTTCTACATT CCTGAAGAAG TCTTGGCCCA CTTCCGCCAA
GCCTTGGATC ATGGTGCGGC GGCAGAGCAA GCCTGGAACG AATTGCTCGA ACGCTACACG
GCGGCGCATC CTGAAAAAGC TGCTGATTTC AAGCAACGCA TGTCGGGTGA ATTGCCAGCA
GGTTGGGATA GTACCTTGCC AGTTTGGCCA GCTGATGCCA AAGGCGTGGC AACCCGCAAA
TCATCAGAAA CTGCCTTGAA TGCCTTGGCC GAGCAAATTC CAGCGCTCAT TGGCGGCTCA
GCTGATTTGG CCGAATCGAC CTTTACCTTG ATCGAGCATG CTCAATCGTT CCAAGCCGAT
ACGCCGCAAG GCCGCAATAT GCACTGGGGG ATTCGTGAGC ATGCTATGGT TGCTGCCGTC
AACGGGATGG CGTTGCATGG CGGCACGATT CCTTACGGCG CAACCTTCTT GGTTTTCAGC
GATTATTGCC GTGCCTCGAT TCGCTTGGCA GCCTTGATGG GCATTCGCAC GATTCAAGTC
TTTACTCACG ATAGCATTGG GGTTGGCGAA GACGGCCCAA CCCACCAACC AATCGAACAC
ATTCCATCGT TGCGGATTAT CCCCAATTTG AATGTGATGC GGCCTGGTGA TGCCAACGAA
ACCAGCCAAG CTTGGCGCGT AGCAGTCAGC CATAAAGGCC CAACCTTGCT GGCCTTGACC
CGCCAAAACT TACCAACGCT TGATCGCACC CGCTATGCCT CGGCTGAGGG TGTGGCTCAA
GGTGGCTATG TCTTGGCTGA TAGCGCTGGT CAACCAGAAT TAATCATCAT TGCAACTGGC
TCGGAATTGC AACATGCCGT GGCGGCCTAC GAGCAATTGA GTGGCGAAGG AGTCAAGGTA
CGGGTGGTCA GTATGCCATC AACCTTGCTG TTCGACGCTC AGTCAGTGGA ATATCGCGAG
AGCGTACTGC CCAAGGCTGT GACCAAACGG ATTGCGATTG AAGCTGCGCA TCCCGTGACC
TGGTATAAAT ATGTTGGGAC TGAGGGCGAT ATTATTGGGA TTGATCACTT CGGTGCTTCA
GCGCCAATTA ATATTTTGAT GAAGGAATTT GGCTTTACCG CCGAAAACCT GATTGCTCGT
GCCAAGGCCT TGTTGGCCAA ATAA
 
Protein sequence
MTQAHTLDER AINTIRMLSV DGVQAANSGH PGLPMGAAAM AYVLWTRHLK HNPANPDWAD 
RDRFVLSAGH GSMLLYSLLH LTGYDLSLDD LKNFRQWHSK TAGHPEYGYA AGIETTTGPL
GQGFATGVGM AIAARHLAGT FNQPELEIVK HHIYAIVSDG DLEEGISAEA ASLAGHLKLG
ELIYLYDDNE ISIEGDTSIA FTEDVPARFR AYGWHVQEID GLDPEQVDAA LHAAKAVTDQ
PSLIVAHTVI GFGSPNRAGT AKAHGSPLGP DEVKLTKEAL GWPLEPTFYI PEEVLAHFRQ
ALDHGAAAEQ AWNELLERYT AAHPEKAADF KQRMSGELPA GWDSTLPVWP ADAKGVATRK
SSETALNALA EQIPALIGGS ADLAESTFTL IEHAQSFQAD TPQGRNMHWG IREHAMVAAV
NGMALHGGTI PYGATFLVFS DYCRASIRLA ALMGIRTIQV FTHDSIGVGE DGPTHQPIEH
IPSLRIIPNL NVMRPGDANE TSQAWRVAVS HKGPTLLALT RQNLPTLDRT RYASAEGVAQ
GGYVLADSAG QPELIIIATG SELQHAVAAY EQLSGEGVKV RVVSMPSTLL FDAQSVEYRE
SVLPKAVTKR IAIEAAHPVT WYKYVGTEGD IIGIDHFGAS APINILMKEF GFTAENLIAR
AKALLAK