Gene Hoch_3879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3879 
Symbol 
ID8546275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5337700 
End bp5339727 
Gene Length2028 bp 
Protein Length675 aa 
Translation table11 
GC content72% 
IMG OID646388551 
Producttransketolase 
Protein accessionYP_003268271 
Protein GI262197062 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.657918 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAGC TGAGCAAGCG CTGCATCGAC ACCATCCGCA CCCTGTCCAT CGACGCCATC 
GAGACGTCGA ACTCGGGGCA CCCGGGACTG CCCATGGGCG CCGCTCCCAT GGCCTTCGTG
CTCTGGGACC GCCACCTGCG CCACAACCCC CGCGACCCGG CCTGGCCCAA CCGCGACCGC
TTCGTGCTCT CGGCCGGCCA CGGCAGCATG TTGCTCTACA GCCTGCTGCA CCTCACCGGC
TACGGGCTGT CGATCGATGA GCTGAAATCC TTCCGGCAGT GGGGCAGCAA GACGCCCGGT
CACCCCGAGA GCTTCATCAC CGACGGGGTC GAGGCCACCA CCGGGCCGCT CGGCCAGGGC
GCGGCCAACG CCGTGGGTAT GGCCATGGCC GAGCGCCATC TGGCCAGCCG CTTCAACCGC
CCGGGCCACG AGATCGTCGA GCACTACACC TACGCGCTGG TCTCCGACGG CGACATCATG
GAGGGCGTGG CCGCCGAGGC CGCGTCCTTG GCCGGACACC TGGGTCTGGG ACGGCTGATC
TACCTCTACG ACTTCAACAA CATCACGCTC GACGGGCCGG CCTCGCTGGC CTTCTCCGAG
GACGTGTGCA AGCGCTACGA GGCCTACGGC TGGCACGTGC AGCGCGTCGA TGAGGGCGAC
ACCGATCTCG ACGCCATCGA CGCCGCTATC GCCGCGGCCA AGGCCGAGCG CGAGCGCCCC
AGCCTGATTC TGGTCAACAC CACCATCGGC TACGGCTCGC CCAAGAAGGC CGGCACCTCG
TCCGCTCACG GCGCGCCGCT GGGCGCCGAC GAGACCAAGG CCACCAAGGC GGCGCTGGGC
TGGCCCACCG ACGCCGCGTA CCTGGTGCCG GACGAGGTCC GCGCGCACAT GGCCGCGGCC
GGCGAGCGCG GCGCCACCGC GCAGACCGCG TGGCAGGAGC AGATGCACGG CTACGCCAAG
GCGCACCCGG AGCTGGCCGA GGCCTGGCGG CAGAGCCTGG CCTGCGAGCT GCCCGCGGGC
TGGGACAGCG AATCCATCGC CTGGGACGAG GGCGCGCAGG TGGCCACGCG CTCGGCCGGC
GCCAAGGTGA TCCAGGCGCT GACGGCCAAG GTGCCGTGGC TGCTGGGCGG CGACGCCGAC
CTGGGCTGCT CGACCAAGAC CCTGCTGCCC GGCGGCGGCG ACTTCGACGG CGCCAGCGGC
GCCGGCCGCA ATATCCACTT CGGCGTGCGC GAGCACGCCA TGGGCTCCAT CTGCAACGGC
ATGGAGTACC ACGGCGGCGT GCGCTCCTAC GCGGCGACCT TCTTCGTGTT TTCGGACTAC
ATGCGCCCGG CCGTGCGCCT GGCCGCGCTC AACCGCCTGC CGGTCATCTA CATCTGGACC
CACGACTCGA TCGGCGTCGG CGAGGACGGT CCCACGCACC AGCCGGTCGA GCAGCTCATG
TCGCTGCGCG CGATGCCCAA CCTGCACGTG GTGCGCCCGG CCGACGCGCG CGAGACCGAA
GAGGCCTGGC GCCACGCGCT TGTTCGGACA GATGGACCAA CCGCCCTGGT GTTCTCGCGC
CAGAACCTGC CCGTGCTGGC GCGTCCGGCC GCGATCGGGG AGGGGCCGCA CTTCCTCGCC
CGCGGCGCCT ACGTGCTGGT CGAAGCCGAC AAGCCCGAGG CCATCGACGT CATCCTCATG
GCCACCGGCT CGGAGGTGAG CCTGGCGGTG GCGGCGCGCG CGCTGCTGGC GGCCGAGGGC
CTGAGCGTGC GCGTGGTGTC GATGCCGTGC ATGGAGCTGT TTCGCGCCCA GAGCGAGGAG
TATCGCGAGT CGGTGCTGCC CGCCGCGGTG CGCGCGCGGG TCTCGGTCGA GGCCGGCTCC
ACCTTCGGCT GGGCCGGCTG GGTCGGCCTC GACGGCGAGT CCGTGGGTCT CGATCGTTTC
GGCGCGTCGG CGCCCGGCGA GGTGCTGATG GAGAAGTTCG GCTTCACGGC CGACAACATC
GCCGCGGCCG CGCGCCGCAC AGTCGAGCGC AACCGCTCCC GGGCCTGA
 
Protein sequence
MDELSKRCID TIRTLSIDAI ETSNSGHPGL PMGAAPMAFV LWDRHLRHNP RDPAWPNRDR 
FVLSAGHGSM LLYSLLHLTG YGLSIDELKS FRQWGSKTPG HPESFITDGV EATTGPLGQG
AANAVGMAMA ERHLASRFNR PGHEIVEHYT YALVSDGDIM EGVAAEAASL AGHLGLGRLI
YLYDFNNITL DGPASLAFSE DVCKRYEAYG WHVQRVDEGD TDLDAIDAAI AAAKAERERP
SLILVNTTIG YGSPKKAGTS SAHGAPLGAD ETKATKAALG WPTDAAYLVP DEVRAHMAAA
GERGATAQTA WQEQMHGYAK AHPELAEAWR QSLACELPAG WDSESIAWDE GAQVATRSAG
AKVIQALTAK VPWLLGGDAD LGCSTKTLLP GGGDFDGASG AGRNIHFGVR EHAMGSICNG
MEYHGGVRSY AATFFVFSDY MRPAVRLAAL NRLPVIYIWT HDSIGVGEDG PTHQPVEQLM
SLRAMPNLHV VRPADARETE EAWRHALVRT DGPTALVFSR QNLPVLARPA AIGEGPHFLA
RGAYVLVEAD KPEAIDVILM ATGSEVSLAV AARALLAAEG LSVRVVSMPC MELFRAQSEE
YRESVLPAAV RARVSVEAGS TFGWAGWVGL DGESVGLDRF GASAPGEVLM EKFGFTADNI
AAAARRTVER NRSRA