Gene Dd1591_3870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDd1591_3870 
Symbol 
ID8118827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDickeya zeae Ech1591 
KingdomBacteria 
Replicon accessionNC_012912 
Strand
Start bp4374449 
End bp4376416 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content61% 
IMG OID644854241 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_003006149 
Protein GI251791428 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTACAG AACCCACCCA GTCTACGGAA CAGAATGTTT CCGGCGAAAA GAAATCCTCC 
GGTCGTCGCG AACAACGCGC CGCGGCACAG CAATTTATCG ACACGCTGCG CGGCTCGACC
TTTCCTAACT CGCAGCGCAT TTACCTAAGC GGCTCACGCC CGGACATCCG TGTGCCGATG
CGGGAAATCC AGCTTAGCCC AACACTGCTG GGCGGTAGCC GCGATAATCC GCGCTATGAA
GCCAACGAGG CCATACCGGT CTATGATACC GCCGGACCGT ATGGCGACCC TGCCGTCACC
CCGGACGTAC ACCGTGGGCT CAGTAAACTG CGCGCCGGCT GGGTTGCGGA ACGCGACGAC
ACGGAAACCT TATCGCACGT CAGCTCCGGC TTTACCCAGC AACGGCTGGC GGATATCGGC
CTCGACCACC TGCGTTTTGA ACATCTGCCG CAGCCTAAGC GCGCCAAAGC GGGCCGTCGC
GTTACCCAGT TGCATTACGC CCGTCAGGGC ATCATCACGC CGGAAATGGA ATTCATCGCC
ATCCGCGAAA ACATGGGGCG CGAGCGCATT CGCGGCGAGG TTCTGCGCCG GCAGCATCCG
GGCCACAGTT TCGGCGCACT GCTGCCGGAC AACATCACGC CGGAATTCGT GCGTCAGGAG
GTCGCCGCCG GGCGCGCCAT CATCCCCGCC AACATCAACC ACCCGGAATC GGAACCGATG
ATCATCGGCC GTAATTTTCT GGTGAAGGTG AATGCGAATA TCGGCAACTC CGCCGTGACG
TCCTCCATCG AAGAAGAGGT CGAGAAACTA GTGTGGGCCA CGCGCTGGGG GGCAGACACC
GTAATGGATT TATCCACCGG GCGCTATATT CATGAAACCC GCGAGTGGAT CTTGCGCAAC
AGCCCGGTGC CGATCGGCAC GGTGCCTATC TATCAGGCGC TGGAAAAGGT CAACGGCATC
GCGGAAAACC TCACCTGGGA ACTGTTCCGC GACACCTTGC TGGAACAAGC CGAACAAGGG
GTGGACTACT TCACCATCCA CGCCGGCGTG CTGTTACGCT ATGTGCCGAT GACCGCCAAA
CGCCTGACTG GCATTGTGTC GCGCGGCGGT TCCATCATGG CGAAATGGTG CCTGTCTCAC
CATCAGGAAA ACTTCCTCTA CCAGCATTTC CGTGAAATCT GCGAAATCTG TGCCGCCTAT
GACGTCGCGC TGTCGCTGGG CGATGGCCTG CGCCCCGGCT CGATTCAGGA TGCCAACGAC
GAAGCGCAGT TCGCCGAACT GCACACGCTG GGCGAGCTGA CTAAAATCGC CTGGGAATAC
GACGTGCAGG TAATGATCGA AGGCCCCGGT CATGTGCCGA TGCAGATGAT CCGCCGCAAC
ATGACCGAAG AACTGGAGCA CTGCCACGAG GCGCCGTTCT ACACGCTGGG CCCGCTCACC
ACCGACATCG CACCCGGCTA CGACCATTTC ACCTCCGGCA TCGGCGCGGC GATGATCGGC
TGGTTCGGCT GCGCCATGCT GTGCTACGTC ACCCCGAAAG AGCATTTGGG GCTGCCGAAC
AAAAACGACG TCAAGCAGGG GCTGATCGCC TACAAGATCG CCGCTCATGC CGCCGATTTG
GCGAAGGGCC ACCCCGGCGC GCAGATCCGC GATAACGCCA TGTCCAAGGC ACGTTTTGAA
TTCCGCTGGG AAGATCAGTT CAATCTGGCG TTGGACCCGG AAACCGCCCG CGCCTACCAC
GACGAAACCC TGCCGCAGGA ATCCGGTAAA GTGGCGCATT TCTGCTCAAT GTGCGGACCG
AAATTCTGCT CGATGAAGAT CACACAGGAA GTGCGCGACT ACGCCGCCCG TCAGGAGGCA
CAGGCACAAC CGGTGGATGT CGGTATGGCG CAGATGTCCG CCGAATTCCG CGCCCGCGGC
AGCGAGTTGT ACCACACCGC CACCAGCCTG CCGCAGGAGG AACGCTGA
 
Protein sequence
MSTEPTQSTE QNVSGEKKSS GRREQRAAAQ QFIDTLRGST FPNSQRIYLS GSRPDIRVPM 
REIQLSPTLL GGSRDNPRYE ANEAIPVYDT AGPYGDPAVT PDVHRGLSKL RAGWVAERDD
TETLSHVSSG FTQQRLADIG LDHLRFEHLP QPKRAKAGRR VTQLHYARQG IITPEMEFIA
IRENMGRERI RGEVLRRQHP GHSFGALLPD NITPEFVRQE VAAGRAIIPA NINHPESEPM
IIGRNFLVKV NANIGNSAVT SSIEEEVEKL VWATRWGADT VMDLSTGRYI HETREWILRN
SPVPIGTVPI YQALEKVNGI AENLTWELFR DTLLEQAEQG VDYFTIHAGV LLRYVPMTAK
RLTGIVSRGG SIMAKWCLSH HQENFLYQHF REICEICAAY DVALSLGDGL RPGSIQDAND
EAQFAELHTL GELTKIAWEY DVQVMIEGPG HVPMQMIRRN MTEELEHCHE APFYTLGPLT
TDIAPGYDHF TSGIGAAMIG WFGCAMLCYV TPKEHLGLPN KNDVKQGLIA YKIAAHAADL
AKGHPGAQIR DNAMSKARFE FRWEDQFNLA LDPETARAYH DETLPQESGK VAHFCSMCGP
KFCSMKITQE VRDYAARQEA QAQPVDVGMA QMSAEFRARG SELYHTATSL PQEER