Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_5222 |
Symbol | |
ID | 8547634 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 7177138 |
End bp | 7178841 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646389897 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_003269601 |
Protein GI | 262198392 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACAG GCAACGGCGC CCATGGCAAT GGCGCGAACG GCAACGGCGC CAACGACAAG CACAGCCTGG CCCAGGTCGG CCCGGCGCCG GTGGACGGCT TGACGCCGAT CGCCGGCTTC CCCAGCTCCG AGAAGGTCTA TCTCGAGCGC GACGGTGTGC GGGTGCCGGT GCGGCGCATC GAGCTCAGCG GCGGCGAGCC CGCGCTCGAT GTCTACGACA CCTCGGGCCC CGAGAACTGC GATCTCCATC GCGGCCTGCC CAAGCTGCGG CAGCCGTGGA TCGACGCGCG CATAGCCGAG GACGACGGCA ACCGCACGCA GATGCACTAC GCGCGCCGCG GCCTCATCAC CGAGGAGATG AAGTTCATCG CCCTGCGCGA GGGACTCGCG GCCGAGTTCG TGCGCGACGA GGTCGCCAGC GGCCGCGCCA TCATCCCGGC CAACATCAAG CACCCGGAGA GCGAGCCGAT GATCATCGGC AAGAACTTCC TGGTGAAGAT CAACGCCAAC ATCGGCAACT CGGCCGTGTC CTCGTCGATC GGCGAGGAGG TCGACAAGCT GCGCTGGGCG ACAAAATGGG GCGCGGACAC CATCATGGAC CTGTCCACCG GCAAGCAGAT CCACGAGACC CGCGAGTGGA TTCTGCGCAA CGCGCCGGTG CCCGTGGGCA CGGTGCCCAT CTACCAGGCG CTGGAGAAGG TCGGTGGCGA CCCCGAAAAG CTCAACATCG ACGTGTTTAT GGACACCCTG GTCGAACAAG CCGAGCAGGG CGTCGACTAC TTCACCATCC ACGCCGGCGT GCTGCTGCGC TACGTGCCGC TCACGGCCAA TCGCGTCACC GGCATCGTCT CGCGCGGCGG CTCCATCCTG GCCAAGTGGT GCATGGCCCA CCACCGCGAG AACTTCCTGT ACACCGAGTT CGAGCGCATC TGCGAGCTGA TGAAGAAGTA CGACGTGGCC TTTAGCCTGG GCGACGGCCT GCGTCCGGGC TCGATCGCCG ACGCCAACGA CGCCGCCCAG CTCGGCGAAC TCGAGACCCT GGGCGAGCTC ACCGAGCTGG CCTGGAAGCA CGACGTGCAG GTGATGATCG AGGGCCCCGG CCACGTGCCC ATGCACAAGA TCAAAGAGAA CGTCGAGCTG CAAGAGAAGC TGTGCCACGA GGCGCCCTTC TACACCCTGG GGCCGCTCAC CACCGATATC GCTCCCGGCT ACGATCACAT CACCTCGGCC ATCGGCGCGG CCATGATCGG TTGGTTCGGC ACCGCCATGC TGTGCTACGT GACGCCCAAG GAGCACCTCG GCCTGCCCGA TCGCGACGAC GTCAAAGCCG GCGTCATCGC GTACAAGATC GCGGCCCACG CCGCCGACCT GGCCAAGGGC CACCCGGGCG CACAGAAGCG CGACGACGCG CTGTCCAAAG CGCGCTTCGA GTTCCGCTGG GACGACCAGT TCAACCTCTC GCTCGACCCC GACACCGCGC GCGCCTTCCA CGACCAGACC CTGCCAGCGC CCGCGGCCAA AGGCGCGCAC TTCTGCTCCA TGTGCGGCCC CAAGTTCTGC TCGATGAAGA TCACCCAGGA CGTGCGCGAC TTCGCCGTCG CCCAGGGCGT GAGCGAGGAC GAGGCCGTGC GCTCCGGCAT GGAGCACAAA GCCGCCGAGT TCCGCGAGCA GGGCAAGCGC CTGTACGCGG AAACCGAGAG CTGA
|
Protein sequence | MSTGNGAHGN GANGNGANDK HSLAQVGPAP VDGLTPIAGF PSSEKVYLER DGVRVPVRRI ELSGGEPALD VYDTSGPENC DLHRGLPKLR QPWIDARIAE DDGNRTQMHY ARRGLITEEM KFIALREGLA AEFVRDEVAS GRAIIPANIK HPESEPMIIG KNFLVKINAN IGNSAVSSSI GEEVDKLRWA TKWGADTIMD LSTGKQIHET REWILRNAPV PVGTVPIYQA LEKVGGDPEK LNIDVFMDTL VEQAEQGVDY FTIHAGVLLR YVPLTANRVT GIVSRGGSIL AKWCMAHHRE NFLYTEFERI CELMKKYDVA FSLGDGLRPG SIADANDAAQ LGELETLGEL TELAWKHDVQ VMIEGPGHVP MHKIKENVEL QEKLCHEAPF YTLGPLTTDI APGYDHITSA IGAAMIGWFG TAMLCYVTPK EHLGLPDRDD VKAGVIAYKI AAHAADLAKG HPGAQKRDDA LSKARFEFRW DDQFNLSLDP DTARAFHDQT LPAPAAKGAH FCSMCGPKFC SMKITQDVRD FAVAQGVSED EAVRSGMEHK AAEFREQGKR LYAETES
|
| |