Gene Dde_2278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDde_2278 
SymbolthiH 
ID3757289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio desulfuricans subsp. desulfuricans str. G20 
KingdomBacteria 
Replicon accessionNC_007519 
Strand
Start bp2299179 
End bp2300588 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content56% 
IMG OID637783169 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_388770 
Protein GI78357321 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.751586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTTTG ATTCCCGTTC CCTGCCAGGC TTCATAGACG AAGAAAAAAT CGAATCTGTG 
ATTGCCGCAA CGGCCAAACC TGACGCTGTG CGTGTGCGCG AAATTCTCGC CAAGGCACGT
GAAGCAAAGG GCCTTGATGC CGAAGAGACC GCAACCCTGC TGCAACTCGA TAACGAAGAA
CTGGATGCGG AGCTGTTTGC CACAGCCAAA AAGGTTAAAC AGACCATCTA CGGTAACCGG
CTTGTTCTTT TTGCTCCTCT TTATATTACC AACGAATGCT ATAACCGGTG TGCCTATTGC
GGATTTAACG CCACAAACAG CGATCTGAAG CGCCGGACTC TCTCGGAAGA TGAAATCCGG
GCCGAAGTGG AAGTGCTGGA ACGTCTGGGG CATAAGCGCC TGCTGCTTGT GTACGGAGAG
CACCCGCGCC TTGATGCCGA CTGGATGGCA CGCACCATTC AGGTGGTGTA TGATACTGTT
TCTGAAAAAA GCGGTGAAAT CCGCCGTGTG AACATCAACT GTGCCCCGCA GACCGTGGAC
GGCTTCAGAA AGCTGCACGA TGTGGGCATA GGTACCTACC AGTGTTTTCA GGAAACCTAC
CACAAGGCGA CGTATGACAA GGCGCATCTG GGCGGTCCCA AAAAGGATTA CCTGTGGCGG
TTGTATGCCA TGCACCGCGC CATGGAGGCC GGCATCGACG ACGTGGGCAT GGGCCCCCTG
CTCGGTCTGT ACGACTACCG GTTTGAGATT CTTGCACTGA TGCAGCATGC CGCCGATCTG
GAAAAACATT TCGGCGTAGG CCCGCATACC ATCTCTTTCC CCAGGCTGGA ACCGGCCCTC
AATGCCGATA TGGCATTCAA TCCGCCGCAC CCGCTCACCG ATTCCCAGTT TAAACGAATG
GTTGCCGTGC TCCGGCTGGC AGTGCCGTAT ACAGGGCTTA TTCTCAGCAC GCGTGAAAAT
GCAGCCATGC GGCGTGAACT GCTCGAGCTG GGCGTTTCGC AGATCAGTGC GGGTTCGCGC
ACCTATCCGG GTGCCTACAG CGACCCGAGC TACGACCGGC CCGATGTGCA GCAGTTCTGC
GTAGGCGACA GCCGCAGTCT GGACGAGGTC ATAGCAGAGC TTGTCTCTTT GGGATACCTG
CCCTCGTGGT GCACGGCCTG TTACCGTCTG GGCCGTACCG GCGAACACTT TATGGAGCTG
GCAAAAAAAG GCTTCATTCA GGAATTCTGC CATCCCAACG CGCTGCTTAC CTTCAATGAA
TATCTGCATG ACTACGCTTC TGAATCGACA CGCGAAGCGG GCAGAAAGCT TATTGAAAAA
GAGGCGGCAG GCTGTCCGGA AAACAGGCGC GAGCTTGTTG CTTCGCGTCT GCAGCGCATA
GACGGCGGCG AGCGCGATTT GTACATCTGA
 
Protein sequence
MSFDSRSLPG FIDEEKIESV IAATAKPDAV RVREILAKAR EAKGLDAEET ATLLQLDNEE 
LDAELFATAK KVKQTIYGNR LVLFAPLYIT NECYNRCAYC GFNATNSDLK RRTLSEDEIR
AEVEVLERLG HKRLLLVYGE HPRLDADWMA RTIQVVYDTV SEKSGEIRRV NINCAPQTVD
GFRKLHDVGI GTYQCFQETY HKATYDKAHL GGPKKDYLWR LYAMHRAMEA GIDDVGMGPL
LGLYDYRFEI LALMQHAADL EKHFGVGPHT ISFPRLEPAL NADMAFNPPH PLTDSQFKRM
VAVLRLAVPY TGLILSTREN AAMRRELLEL GVSQISAGSR TYPGAYSDPS YDRPDVQQFC
VGDSRSLDEV IAELVSLGYL PSWCTACYRL GRTGEHFMEL AKKGFIQEFC HPNALLTFNE
YLHDYASEST REAGRKLIEK EAAGCPENRR ELVASRLQRI DGGERDLYI