Gene Hoch_2104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2104 
Symbol 
ID8544490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2918776 
End bp2920542 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content73% 
IMG OID646386811 
Productthiamine pyrophosphate protein TPP binding domain protein 
Protein accessionYP_003266542 
Protein GI262195333 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.444225 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGAG GCAGACGCAT CGCCGACGTC CTGGTCGCCC AGGGCGTGCG CCAACTCTTT 
ACCCTGTGCG GTGGGCACAT CGCGCCCATC CTGGTCGAGA GCCAGCGCCG CGACATCCAG
GTGGTCGACG TCCGCCACGA GGCCAACGCG GTGTTCGCCG CCGACGCCGT GGCGCGACTC
ACCGGCATCC CGGGCGTGGC CGCGGTGACC GCGGGTCCCG GGGTCACCAA CGCGCTCACC
GCGATCAAGA ACGCCCAGCT CGCGCAGTCG CCGCTGGTGC TCCTGGGCGG CGCCACGGCC
ACGCTCTTGC GCGGCCGCGG CGCGCTCCAG GACATCGACC AGATGGCGCT GATCAAGCCG
CACGTGAAGT GGGCGGCGCG GCCGCGCAGC CTGCGCGAGA TCGTGCCGGC GCTCGAGCGC
GCCTTTGCCA TCGCCCGCCG CGGCGTGCCC GGACCGGTGT TCGTCGAGCT GGCCGTGGAC
CTGCTCTACG ACGAGCGCCT GGTGCGCGAG TGGTTCCTCA AGGACCGCAG CGGCGGCGAT
CCGACCCTGG GCCAGCGGGC CCAGGACCTG TACATCCGCG GCCACCTGGC CTACGTCTTC
GCCGAGGCCC GGGCCGTGCG CCTGCGCCCG CCCGCGCCGG CGCCGCTCGA GACCCCGTCG
ATGGAGGACG TGGCCGAGGC CGCGCGCGTC ATCGCCCAGT CCGAGCGGCC GCTGATGGTG
CTCGGCAGCC AGAGCATGCT GCACCCGACC CGGGCCCGCG AGCTGGTCGC CGCGGTCGAG
CGCATCGGCG TGCCCGTGTA CCTCTCGGGC ATGGCCCGCG GCCTGCTCGG TCGCAGCCAC
AACCTGCTCA TGCGCCACAA GCGCCGCGTG GCCCTGCGCG AGGCCGACAC CGTCATCCTG
TGCGGCGTGC CCTGCGACTT CCGCCTCGAC TACGGCGCCC ACGTGGCCCG TGCCCACGTC
ATCGGCGTCA ACCTCAGCCG CGAGGACCTG AGCCTCAACC GCAAGCCCGA CCTGGCCATC
AACGCCGATC CGCTGGCCTT TCTCAACCAC CTGTCCCGGG TCATGCCCGC GCAGCCGGCG
AGCTACGAGG CCTGGCGCGA TACCCTCATC GGCCGTGAGC GCGCCCGCGA GGACGAGATC
CGGGCCATGG CCGGACACGT GTCCGGCTCC GACTCGAGTG CGTCGGGCGA GGCAGACGGC
GGCGGCATCA ACCCGCTGGC GCTGTGCCAG GCCCTCGACA ACGTGCTCGC CGACGACAGC
GTGATCGTCG GCGACGGCGG CGACTTCGTG GCCACGGCCT CGTACACCGT GGCCCCGCGT
GGGCCCTACT CGTGGCTGGA TCCCGGCGTC TTCGGCACCC TGGGCGTGGG CGCGGGCTTT
GCGCTCGGCG CCAAGCTGGT GCGCCCCAGC GCCGACGTGT GGCTGCTCTA CGGCGACGGC
GCGGCCGGCT TCAGCATCAT GGAGTTCGAC ACCTTCGCGC GTCACGGCAT CCCGGTCATC
GCCGTGGTCG GCAACGACGC GGGCTGGACC CAGATCGCGC GCGACCAGGT CGACATCCTG
GGCACCGACA CCGCCTGTCG GCTCACCCAC ATGGACTACG ACCAGGTCGC CAGCGCGTGC
GGCGCCCACG GCATCCGCAT CAATGACCTC GCCCAGGTGC CCGGCGCCCT GCGCGAGGCC
GTCGAGGTGT CGCGCGAGGG TCGGCCGGTG CTGATCAACG CCATCCTCGC GGGTTCGGAT
TTCCGCAAGG GCTCGCTGTC GATGTGA
 
Protein sequence
MNGGRRIADV LVAQGVRQLF TLCGGHIAPI LVESQRRDIQ VVDVRHEANA VFAADAVARL 
TGIPGVAAVT AGPGVTNALT AIKNAQLAQS PLVLLGGATA TLLRGRGALQ DIDQMALIKP
HVKWAARPRS LREIVPALER AFAIARRGVP GPVFVELAVD LLYDERLVRE WFLKDRSGGD
PTLGQRAQDL YIRGHLAYVF AEARAVRLRP PAPAPLETPS MEDVAEAARV IAQSERPLMV
LGSQSMLHPT RARELVAAVE RIGVPVYLSG MARGLLGRSH NLLMRHKRRV ALREADTVIL
CGVPCDFRLD YGAHVARAHV IGVNLSREDL SLNRKPDLAI NADPLAFLNH LSRVMPAQPA
SYEAWRDTLI GRERAREDEI RAMAGHVSGS DSSASGEADG GGINPLALCQ ALDNVLADDS
VIVGDGGDFV ATASYTVAPR GPYSWLDPGV FGTLGVGAGF ALGAKLVRPS ADVWLLYGDG
AAGFSIMEFD TFARHGIPVI AVVGNDAGWT QIARDQVDIL GTDTACRLTH MDYDQVASAC
GAHGIRINDL AQVPGALREA VEVSREGRPV LINAILAGSD FRKGSLSM