Gene Hoch_5290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5290 
Symbol 
ID8547702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7274505 
End bp7276148 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content65% 
IMG OID646389964 
Productthiamine pyrophosphate protein domain protein TPP-binding protein 
Protein accessionYP_003269668 
Protein GI262198459 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCAT CCGACTTATT CGTTAAAGCG CTCGAAGCCG AAGGTGTGGA ATACGTCTTC 
GGCGTCCCCG GTGAGGAAAA TCTCGATTTC CTCGCCTCCC TCCAGAACTC GTCTATCAAG
CTGGTGCTCA CCCGGCACGA GCAGGGCGCC GGCTTCATGG CCGCCACCTA CGGACGCCTC
ACCGGCAAAC CCGGCGTGTG CTTGTCCACG CTCGGCCCGG GCGCGACCAA CCTGGTCACG
GCCGCCGCCT ACGCCCAGCT CGGCGGCCTG CCGATGTTCA TGCTCACCGG CCAGAAGCCG
ATCAAGACCA GCAAGCAGGC GCAGTTTCAG ATCGTCGACG TGGTCGACAT GATGCGCCCG
CTGACCAAGT ACACGCGCCA GATCGTGAGC GCCGACTCCA TCCCCTCGCG GGTGCGCGAG
GCCTTTCGCC TGGCCCAGGA GGAGCGGCCC GGCGCCGTGC ACCTCGAGCT GCCCGAGGAC
ATCGCGGCCG AAAACAGCGA GGCCGGCGTC ATCCAGGCCA GCCAGGTGCG GCGGCCGGTG
GCCGAGGACA AGGCCATCAA GTCGGCGGTC GAGCTCATCG AGAAAGCCTC GCATCCGCTG
CTGCTCATCG GCGCGGGCGC CAACCGCAAG CTGACCTCGC GCATGCTGCG GCAGTTCATC
GACAAGACCG GGATTCCCTT CTTCAGCACG CAGATGGGCA AAGGCGTCAT CGACGAGCGC
GACCCACTGT ACCTGGGCAA CGCGGCGCTG TCCGACAACG ACTTCCTGCA CCGGGCGATC
GAGCACGCCG ACTTGATCAT CAATGTCGGA CACGACGTCG TCGAAAAACC GCCCTTCTTC
ATGCACCGCG AGTCCAAGCT CAAGGTCATC CACGTCAACT TCTCGAGCGC CAACGTCGAC
CCCGTGTACT TTCCGCAAGT GGAAGTGGTC GGCGACATCG CCAACAGCAT CTGGCAGATC
AAAGAGCGCA TGCTCAAGCA GAGCACCTGG GACTTCTCCT ACTTCCTGAA GGTCAAGGAG
CGCCTCGAGA TGCACCTGCG CGAGGGCGTG GACGACTGCG CCTTCCCGGT GCAGCCGCAG
CGCCTGGTCG CCGACGTGCG CCGGGCCATG CCCGAGAGCG GCATCATCGC GCTCGACAAC
GGCGTGTATA AGATCTGGTT TGCGCGCAAC TACTGGGCCT ACGGACCCAA CACCGTGCTG
CTCGACAACG CGCTCGCGAC CATGGGCGCC GGCCTGCCCT CGGCCATGGC GGCCAAGCTG
GTGTATCCCG ATCGCAAGGT CATGGCCATC TGCGGCGACG GCGGCTTCCT GATGAACTCG
CAGGAGCTCG AGACCGCGGT GCGCCTCAAG CTCGACGTGG TGGTCATGAT CCTGCGCGAC
GACGCCTACG GCATGATCAA GTGGAAGCAG ACCGCCATGG GCCTGGGCGA CTACGGCCTG
GACTTCCAGA ACCCCGACTT CGTCAAGTAC GCCGAGAGCT ACGGCGCCCA CGGCCACCGG
CTCGAGCGCA CCGAAGACCT GCAGTCGCTG GTCGAGCGCT GCCTGAGCAC CCCGGGCGTG
CACGTCATCG ACGTGCCCGT GGACTACTCG AGCAACGACC GCATCCTCAA CCGCGAGATC
AAAGAGAAGA GCAAACAGCT CTGA
 
Protein sequence
MKASDLFVKA LEAEGVEYVF GVPGEENLDF LASLQNSSIK LVLTRHEQGA GFMAATYGRL 
TGKPGVCLST LGPGATNLVT AAAYAQLGGL PMFMLTGQKP IKTSKQAQFQ IVDVVDMMRP
LTKYTRQIVS ADSIPSRVRE AFRLAQEERP GAVHLELPED IAAENSEAGV IQASQVRRPV
AEDKAIKSAV ELIEKASHPL LLIGAGANRK LTSRMLRQFI DKTGIPFFST QMGKGVIDER
DPLYLGNAAL SDNDFLHRAI EHADLIINVG HDVVEKPPFF MHRESKLKVI HVNFSSANVD
PVYFPQVEVV GDIANSIWQI KERMLKQSTW DFSYFLKVKE RLEMHLREGV DDCAFPVQPQ
RLVADVRRAM PESGIIALDN GVYKIWFARN YWAYGPNTVL LDNALATMGA GLPSAMAAKL
VYPDRKVMAI CGDGGFLMNS QELETAVRLK LDVVVMILRD DAYGMIKWKQ TAMGLGDYGL
DFQNPDFVKY AESYGAHGHR LERTEDLQSL VERCLSTPGV HVIDVPVDYS SNDRILNREI
KEKSKQL