Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dd1591_3870 |
Symbol | |
ID | 8118827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dickeya zeae Ech1591 |
Kingdom | Bacteria |
Replicon accession | NC_012912 |
Strand | + |
Start bp | 4374449 |
End bp | 4376416 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644854241 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_003006149 |
Protein GI | 251791428 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTACAG AACCCACCCA GTCTACGGAA CAGAATGTTT CCGGCGAAAA GAAATCCTCC GGTCGTCGCG AACAACGCGC CGCGGCACAG CAATTTATCG ACACGCTGCG CGGCTCGACC TTTCCTAACT CGCAGCGCAT TTACCTAAGC GGCTCACGCC CGGACATCCG TGTGCCGATG CGGGAAATCC AGCTTAGCCC AACACTGCTG GGCGGTAGCC GCGATAATCC GCGCTATGAA GCCAACGAGG CCATACCGGT CTATGATACC GCCGGACCGT ATGGCGACCC TGCCGTCACC CCGGACGTAC ACCGTGGGCT CAGTAAACTG CGCGCCGGCT GGGTTGCGGA ACGCGACGAC ACGGAAACCT TATCGCACGT CAGCTCCGGC TTTACCCAGC AACGGCTGGC GGATATCGGC CTCGACCACC TGCGTTTTGA ACATCTGCCG CAGCCTAAGC GCGCCAAAGC GGGCCGTCGC GTTACCCAGT TGCATTACGC CCGTCAGGGC ATCATCACGC CGGAAATGGA ATTCATCGCC ATCCGCGAAA ACATGGGGCG CGAGCGCATT CGCGGCGAGG TTCTGCGCCG GCAGCATCCG GGCCACAGTT TCGGCGCACT GCTGCCGGAC AACATCACGC CGGAATTCGT GCGTCAGGAG GTCGCCGCCG GGCGCGCCAT CATCCCCGCC AACATCAACC ACCCGGAATC GGAACCGATG ATCATCGGCC GTAATTTTCT GGTGAAGGTG AATGCGAATA TCGGCAACTC CGCCGTGACG TCCTCCATCG AAGAAGAGGT CGAGAAACTA GTGTGGGCCA CGCGCTGGGG GGCAGACACC GTAATGGATT TATCCACCGG GCGCTATATT CATGAAACCC GCGAGTGGAT CTTGCGCAAC AGCCCGGTGC CGATCGGCAC GGTGCCTATC TATCAGGCGC TGGAAAAGGT CAACGGCATC GCGGAAAACC TCACCTGGGA ACTGTTCCGC GACACCTTGC TGGAACAAGC CGAACAAGGG GTGGACTACT TCACCATCCA CGCCGGCGTG CTGTTACGCT ATGTGCCGAT GACCGCCAAA CGCCTGACTG GCATTGTGTC GCGCGGCGGT TCCATCATGG CGAAATGGTG CCTGTCTCAC CATCAGGAAA ACTTCCTCTA CCAGCATTTC CGTGAAATCT GCGAAATCTG TGCCGCCTAT GACGTCGCGC TGTCGCTGGG CGATGGCCTG CGCCCCGGCT CGATTCAGGA TGCCAACGAC GAAGCGCAGT TCGCCGAACT GCACACGCTG GGCGAGCTGA CTAAAATCGC CTGGGAATAC GACGTGCAGG TAATGATCGA AGGCCCCGGT CATGTGCCGA TGCAGATGAT CCGCCGCAAC ATGACCGAAG AACTGGAGCA CTGCCACGAG GCGCCGTTCT ACACGCTGGG CCCGCTCACC ACCGACATCG CACCCGGCTA CGACCATTTC ACCTCCGGCA TCGGCGCGGC GATGATCGGC TGGTTCGGCT GCGCCATGCT GTGCTACGTC ACCCCGAAAG AGCATTTGGG GCTGCCGAAC AAAAACGACG TCAAGCAGGG GCTGATCGCC TACAAGATCG CCGCTCATGC CGCCGATTTG GCGAAGGGCC ACCCCGGCGC GCAGATCCGC GATAACGCCA TGTCCAAGGC ACGTTTTGAA TTCCGCTGGG AAGATCAGTT CAATCTGGCG TTGGACCCGG AAACCGCCCG CGCCTACCAC GACGAAACCC TGCCGCAGGA ATCCGGTAAA GTGGCGCATT TCTGCTCAAT GTGCGGACCG AAATTCTGCT CGATGAAGAT CACACAGGAA GTGCGCGACT ACGCCGCCCG TCAGGAGGCA CAGGCACAAC CGGTGGATGT CGGTATGGCG CAGATGTCCG CCGAATTCCG CGCCCGCGGC AGCGAGTTGT ACCACACCGC CACCAGCCTG CCGCAGGAGG AACGCTGA
|
Protein sequence | MSTEPTQSTE QNVSGEKKSS GRREQRAAAQ QFIDTLRGST FPNSQRIYLS GSRPDIRVPM REIQLSPTLL GGSRDNPRYE ANEAIPVYDT AGPYGDPAVT PDVHRGLSKL RAGWVAERDD TETLSHVSSG FTQQRLADIG LDHLRFEHLP QPKRAKAGRR VTQLHYARQG IITPEMEFIA IRENMGRERI RGEVLRRQHP GHSFGALLPD NITPEFVRQE VAAGRAIIPA NINHPESEPM IIGRNFLVKV NANIGNSAVT SSIEEEVEKL VWATRWGADT VMDLSTGRYI HETREWILRN SPVPIGTVPI YQALEKVNGI AENLTWELFR DTLLEQAEQG VDYFTIHAGV LLRYVPMTAK RLTGIVSRGG SIMAKWCLSH HQENFLYQHF REICEICAAY DVALSLGDGL RPGSIQDAND EAQFAELHTL GELTKIAWEY DVQVMIEGPG HVPMQMIRRN MTEELEHCHE APFYTLGPLT TDIAPGYDHF TSGIGAAMIG WFGCAMLCYV TPKEHLGLPN KNDVKQGLIA YKIAAHAADL AKGHPGAQIR DNAMSKARFE FRWEDQFNLA LDPETARAYH DETLPQESGK VAHFCSMCGP KFCSMKITQE VRDYAARQEA QAQPVDVGMA QMSAEFRARG SELYHTATSL PQEER
|
| |