Gene Hoch_5222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5222 
Symbol 
ID8547634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7177138 
End bp7178841 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content67% 
IMG OID646389897 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_003269601 
Protein GI262198392 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACAG GCAACGGCGC CCATGGCAAT GGCGCGAACG GCAACGGCGC CAACGACAAG 
CACAGCCTGG CCCAGGTCGG CCCGGCGCCG GTGGACGGCT TGACGCCGAT CGCCGGCTTC
CCCAGCTCCG AGAAGGTCTA TCTCGAGCGC GACGGTGTGC GGGTGCCGGT GCGGCGCATC
GAGCTCAGCG GCGGCGAGCC CGCGCTCGAT GTCTACGACA CCTCGGGCCC CGAGAACTGC
GATCTCCATC GCGGCCTGCC CAAGCTGCGG CAGCCGTGGA TCGACGCGCG CATAGCCGAG
GACGACGGCA ACCGCACGCA GATGCACTAC GCGCGCCGCG GCCTCATCAC CGAGGAGATG
AAGTTCATCG CCCTGCGCGA GGGACTCGCG GCCGAGTTCG TGCGCGACGA GGTCGCCAGC
GGCCGCGCCA TCATCCCGGC CAACATCAAG CACCCGGAGA GCGAGCCGAT GATCATCGGC
AAGAACTTCC TGGTGAAGAT CAACGCCAAC ATCGGCAACT CGGCCGTGTC CTCGTCGATC
GGCGAGGAGG TCGACAAGCT GCGCTGGGCG ACAAAATGGG GCGCGGACAC CATCATGGAC
CTGTCCACCG GCAAGCAGAT CCACGAGACC CGCGAGTGGA TTCTGCGCAA CGCGCCGGTG
CCCGTGGGCA CGGTGCCCAT CTACCAGGCG CTGGAGAAGG TCGGTGGCGA CCCCGAAAAG
CTCAACATCG ACGTGTTTAT GGACACCCTG GTCGAACAAG CCGAGCAGGG CGTCGACTAC
TTCACCATCC ACGCCGGCGT GCTGCTGCGC TACGTGCCGC TCACGGCCAA TCGCGTCACC
GGCATCGTCT CGCGCGGCGG CTCCATCCTG GCCAAGTGGT GCATGGCCCA CCACCGCGAG
AACTTCCTGT ACACCGAGTT CGAGCGCATC TGCGAGCTGA TGAAGAAGTA CGACGTGGCC
TTTAGCCTGG GCGACGGCCT GCGTCCGGGC TCGATCGCCG ACGCCAACGA CGCCGCCCAG
CTCGGCGAAC TCGAGACCCT GGGCGAGCTC ACCGAGCTGG CCTGGAAGCA CGACGTGCAG
GTGATGATCG AGGGCCCCGG CCACGTGCCC ATGCACAAGA TCAAAGAGAA CGTCGAGCTG
CAAGAGAAGC TGTGCCACGA GGCGCCCTTC TACACCCTGG GGCCGCTCAC CACCGATATC
GCTCCCGGCT ACGATCACAT CACCTCGGCC ATCGGCGCGG CCATGATCGG TTGGTTCGGC
ACCGCCATGC TGTGCTACGT GACGCCCAAG GAGCACCTCG GCCTGCCCGA TCGCGACGAC
GTCAAAGCCG GCGTCATCGC GTACAAGATC GCGGCCCACG CCGCCGACCT GGCCAAGGGC
CACCCGGGCG CACAGAAGCG CGACGACGCG CTGTCCAAAG CGCGCTTCGA GTTCCGCTGG
GACGACCAGT TCAACCTCTC GCTCGACCCC GACACCGCGC GCGCCTTCCA CGACCAGACC
CTGCCAGCGC CCGCGGCCAA AGGCGCGCAC TTCTGCTCCA TGTGCGGCCC CAAGTTCTGC
TCGATGAAGA TCACCCAGGA CGTGCGCGAC TTCGCCGTCG CCCAGGGCGT GAGCGAGGAC
GAGGCCGTGC GCTCCGGCAT GGAGCACAAA GCCGCCGAGT TCCGCGAGCA GGGCAAGCGC
CTGTACGCGG AAACCGAGAG CTGA
 
Protein sequence
MSTGNGAHGN GANGNGANDK HSLAQVGPAP VDGLTPIAGF PSSEKVYLER DGVRVPVRRI 
ELSGGEPALD VYDTSGPENC DLHRGLPKLR QPWIDARIAE DDGNRTQMHY ARRGLITEEM
KFIALREGLA AEFVRDEVAS GRAIIPANIK HPESEPMIIG KNFLVKINAN IGNSAVSSSI
GEEVDKLRWA TKWGADTIMD LSTGKQIHET REWILRNAPV PVGTVPIYQA LEKVGGDPEK
LNIDVFMDTL VEQAEQGVDY FTIHAGVLLR YVPLTANRVT GIVSRGGSIL AKWCMAHHRE
NFLYTEFERI CELMKKYDVA FSLGDGLRPG SIADANDAAQ LGELETLGEL TELAWKHDVQ
VMIEGPGHVP MHKIKENVEL QEKLCHEAPF YTLGPLTTDI APGYDHITSA IGAAMIGWFG
TAMLCYVTPK EHLGLPDRDD VKAGVIAYKI AAHAADLAKG HPGAQKRDDA LSKARFEFRW
DDQFNLSLDP DTARAFHDQT LPAPAAKGAH FCSMCGPKFC SMKITQDVRD FAVAQGVSED
EAVRSGMEHK AAEFREQGKR LYAETES