Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3681 |
Symbol | |
ID | 5714211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009955 |
Strand | + |
Start bp | 84592 |
End bp | 86400 |
Gene Length | 1809 bp |
Protein Length | 602 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641276599 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001541895 |
Protein GI | 159046223 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGATC TGACACCAAC CGTTACTACT GGACCCTTAC CGGCATCCCG GCGCGTTTAT CAGCCCGGAA CGCTGCATCC CCAAATTCGC GTCCCTATGC GCGAGATTGA TCTTCACCCT TCGGCTGGCG AACCGCCGGT TACGGTCTAT GACAGCTCTG GCCCCTATAC TGATCCCAGC GCTAAAATCG CTATCAACGG TGGCCTGCCG CGCCTGCGGG CAGACTGGAT CGCCGCACGC AACGATATCG AGCTTTATGA GGGTCGCCAT GTGCGTCCCG AGGATAATGG CTTTGCTGAG GGTGCCCGGC TGGTGCCCGA ATTCCCGGTC CGCCACAAGC CGTTGCAAGC CAAGGCAGGA CGCGCGGTCA CGCAGATGGC CTATGCCCGC GCCGGGATTA TAACACCCGA GATGGAATTT GTCGCCATCC GCGAGAACCT TGGCCGTAAG GCCGCGAAAG AGGCTCTGGT ACGTGACGGC CAGGATTGGG GCGCGGCCAT CCCCGATTTT GTGACTCCCG ATTTCGTGCG CGAAGAAGTC GCCAGGGGCC GTGCGATCAT CCCCGCCAAT ATCAACCACC CCGAGGCCGA GCCGATGGGG ATCGGGCGTA ATTTTCTGGT CAAGATTAAC GCGAATATCG GTAACTCCGC CGTCACCTCC TCGATGGAGG AAGAGGTGGA AAAGATGGTC TGGGCCACAC GTTGGGGCGC GGATACGGTG ATGGACTTGT CTACGGGCCG CAATATTCAC AACATCCGCG ACTGGATCAT CCGCAATTCT GCCGTACCCA TCGGCACCGT TCCGCTTTAT CAGGCGCTGG AAAAGGTCGA GGGTATCGCC GAAGACCTAA GCTGGGAGGT GTACCGCGAT ACGCTGATCG AACAGGCTGA ACAGGGGGTG GATTACTTCA CCATTCATGC TGGCGTGCGG CTGCATATGA TCCCGATGAC GATAAAGCGG GTGACCGGGA TTGTATCGCG CGGCGGCTCG ATCATGGCGA AATGGTGCCT GCATCATCAC CGCGAAAGTT TCCTTTACGA GCACTTCGCT GAGATCTGCG AAATTGCCCG GGCATATGAC GTCAGCTTCT CTTTGGGGGA TGGGTTGCGT CCCGGCTCTA TCGCGGACGC CAATGATGAG GCGCAATTCG CCGAACTGGA GACGTTGGGC GAACTGACGC AGATCGCATG GAAGCATGAC TGTCAGGTGA TGATCGAAGG ACCGGGCCAT GTAGCCATGC ACAAAATCAA GGAGAATATG GACAAGCAGC TGGAGGTCTG CGGCGAGGCG CCGTTTTATA CGCTCGGGCC GCTTACCACG GATATCGCAC CGGGATATGA CCACATTACC AGCGGTATTG GTGCGGCGAT GATAGGCTGG TTCGGCACGG CGATGCTTTG CTATGTCACT CCCAAAGAGC ATCTAGGTCT GCCTAACCGT GACGATGTGA AGATCGGTGT CATCACCTAC AAGATTGCGG CTCATGCTGC CGATCTGGCC AAGGGGCATC CGGGAGCGCA GATCCGAGAT GACGCGTTGT CACGCGCGCG GTTCGAGTTC CGCTGGGAGG ATCAGTTCAA CCTCTCGCTT GATCCCGAAA CCGCGCGCTC GATGCATGAC GAAACCTTGC CGAAAGAGGC GCATAAGGTC GCGCATTTCT GCTCCATGTG CGGACCGAAG TTCTGCTCGA TGCGGATCAG CCACGATATC CGCGCCGAGG CGCAGAAGGA CGGCATGGCG AAAATGGCCG AGAAGTTCCG CAAAGGCGGT GATCTTTATG TTCCTCTTGA TAAGGTAAAG GAAATATAA
|
Protein sequence | MKDLTPTVTT GPLPASRRVY QPGTLHPQIR VPMREIDLHP SAGEPPVTVY DSSGPYTDPS AKIAINGGLP RLRADWIAAR NDIELYEGRH VRPEDNGFAE GARLVPEFPV RHKPLQAKAG RAVTQMAYAR AGIITPEMEF VAIRENLGRK AAKEALVRDG QDWGAAIPDF VTPDFVREEV ARGRAIIPAN INHPEAEPMG IGRNFLVKIN ANIGNSAVTS SMEEEVEKMV WATRWGADTV MDLSTGRNIH NIRDWIIRNS AVPIGTVPLY QALEKVEGIA EDLSWEVYRD TLIEQAEQGV DYFTIHAGVR LHMIPMTIKR VTGIVSRGGS IMAKWCLHHH RESFLYEHFA EICEIARAYD VSFSLGDGLR PGSIADANDE AQFAELETLG ELTQIAWKHD CQVMIEGPGH VAMHKIKENM DKQLEVCGEA PFYTLGPLTT DIAPGYDHIT SGIGAAMIGW FGTAMLCYVT PKEHLGLPNR DDVKIGVITY KIAAHAADLA KGHPGAQIRD DALSRARFEF RWEDQFNLSL DPETARSMHD ETLPKEAHKV AHFCSMCGPK FCSMRISHDI RAEAQKDGMA KMAEKFRKGG DLYVPLDKVK EI
|
| |