Gene Dd703_3720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDd703_3720 
SymbolthiH 
ID8087090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDickeya dadantii Ech703 
KingdomBacteria 
Replicon accessionNC_012880 
Strand
Start bp4310150 
End bp4311283 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content62% 
IMG OID644837798 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_002989297 
Protein GI242241116 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.526508 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAACT CGTTTGCACA ACGCTGGCGA CAGCTGGACT GGGACGACCT GACGCTGCGC 
ATCAACGGTA AAACCCGCGC CGACGTCGAA CAGGCGCTGA ACGCCGAACA CCCCGACATG
GAGGATTTCA TGGCGCTGCT GTCTCCGGCC GCCGCCTCTT ATCTGGAACC GCTGGCCCAA
CGCGCCCAAC AGCTGACCCG GCAGCGCTTC GGCAATACCG TCGGCTTTTA CGTGCCGCTC
TATCTCTCCA ATCTCTGCGC CAACGACTGC ACCTACTGCG GCTTCTCCAT GAGCAACCAC
CTCAAGCGCA AAACCTTGAA CGACGAGGAA ATTCGGCGGG AATGCGGTGC CATCAAGGAA
TTGGGGTTCG ACCACCTGCT GCTAGTGACG GGCGAGCACC AACGTAAGGT GGGGATGGAC
TATTTCCGGC GCGTATTTCC ACTGATTCGC CCGCACTTCA GCGCGCTGAT GATGGAAGTG
CAGCCGCTGG CGCAGGAAGA GTACGCGGAG CTGAAAACGC TGGGGCTGGA CGGGGTGATG
GTGTATCAGG AAACCTACCA TCAGGCGACT TACGCCCGCC ACCACCTGCA CGGGAAAAAA
CAGGATTTCG CCTGGCGGCT GAACACCCCG GATCGGCTGG GCCGCGCCGG GATCGACAAG
ATCGGGCTGG GCGCGTTGAT CGGCCTGTCC GACAGTTGGC GCACCGACTG CTATATGGTG
GCGGAACACT TGCGCCATCT GCAACGCCAC TATTGGCAAA GCCGCTACTC GTTGTCGTTT
CCGCGCCTGC GCCCGTGCGC CGGCGGCATC GAACCCGCTT CGCTGATGGA TGAAGCGCAA
CTGATGCAGA CGATCTGCGC CTTCCGGCTG CTGGCGCCCG ATGTAGAACT GTCGCTGTCG
ACCCGGGAAT CGCCCCATTT TCGCGACCAC ATGATCCCGA TCGCCATCAA CAACGTCAGC
GCTTTCTCCA AAACCCAGCC GGGCGGCTAC GCCGACGGCC ATGCGGAATT GGAGCAGTTC
TCCCCGCACG ACGACCGCCG GCCGGAAGCA GTGGCCGATG TCTTGACGCG CGCCGGCCTG
CAGCCGGTCT GGAAAGACTG GGAGGGATAT CTGGGCCGGA CGATCGCAAA CTGA
 
Protein sequence
MDNSFAQRWR QLDWDDLTLR INGKTRADVE QALNAEHPDM EDFMALLSPA AASYLEPLAQ 
RAQQLTRQRF GNTVGFYVPL YLSNLCANDC TYCGFSMSNH LKRKTLNDEE IRRECGAIKE
LGFDHLLLVT GEHQRKVGMD YFRRVFPLIR PHFSALMMEV QPLAQEEYAE LKTLGLDGVM
VYQETYHQAT YARHHLHGKK QDFAWRLNTP DRLGRAGIDK IGLGALIGLS DSWRTDCYMV
AEHLRHLQRH YWQSRYSLSF PRLRPCAGGI EPASLMDEAQ LMQTICAFRL LAPDVELSLS
TRESPHFRDH MIPIAINNVS AFSKTQPGGY ADGHAELEQF SPHDDRRPEA VADVLTRAGL
QPVWKDWEGY LGRTIAN