Gene GWCH70_0614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0614 
Symbol 
ID7978803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp678419 
End bp679444 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content46% 
IMG OID644797601 
Productthiamine/molybdopterin biosynthesis ThiF/MoeB-like protein 
Protein accessionYP_002948775 
Protein GI239826151 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATTGAAC GGTATTCTCG ACAAGAGTTA TTTGCGCCAA TTGGCGCAGA AGGACAAAAA 
AAGATTATGC GGAAACATGT GCTCATTATT GGTGCGGGTG CGCTAGGAAC AGGAAATGCA
GAGGCGCTTG TGCGGGCAGG CGTTGGCAAA GTCACCATTG TTGACCGCGA TTATGTCGAA
TGGAGCAATT TGCAACGTCA GCAATTATAT AGCGAAGCGG ACGCAAAAGA ATGTATCCCA
AAAGCAATTG CTGCAAAGCG GCGGCTTGAA GAGGTAAATT CTGATGTCGC AATTGATGCC
ATCGTCGGCG ATGTAACGGC ACAAGAGCTT GAAGAGCTTA TTGCAGAGCG AAAGCCCGAC
CTTTTGATTG ATGCGACAGA TAATTTTGAT ATACGTATGA TTATTAACGA TGCTGCGTAT
AAATATCGCA TCCCGTGGAT TCACGGCGCG TGTGTCGGAA GCTATGGCAT TAGTTACGCG
TTTATCCCAG GGAAGACCCC ATGTTTTCAC TGTCTGCTCG AAACGGTGCC AGTAGGCGGT
TTGACATGTG ATACAGCAGG AATTATCAGC CCTGCTGTGC AAATGGTCGT CGCCTATCAA
GTAACGGAAG CATTAAAAAT TCTTGTTGAA GATTGGGCGG CGCTGCGCAA TAAACTTGTG
TCGTTTGATT TATGGAAAAA TCAGCATACG GCGATTCGCA TTGATCAAGT GAAAAAAGAA
GATTGCCCTA CTTGCGGCAC TCATCCATCG TATCCGTACC TTTCTTATGA TCAACAGACA
AAAACAGCGG TATTATGCGG ACGAAATTCC GTACAAATTC GCCCGGCTGC GCCTCGAAAC
TACAACTTGC AAGAGCTGGC TGAATTATTT GTCAAACAAG GATTGCCCGT AGATGTCAAC
CCGTATCTTG TCTCTGTATC GCTTGGAGAG CGACGGCTTG TTGTCTTTCA AGACGGACGC
GCGCTCATTC ATGGGACAAA GGATATTCAA GAGGCAAAAA CGATTTATTA TCGCTATTTA
GGCTAG
 
Protein sequence
MIERYSRQEL FAPIGAEGQK KIMRKHVLII GAGALGTGNA EALVRAGVGK VTIVDRDYVE 
WSNLQRQQLY SEADAKECIP KAIAAKRRLE EVNSDVAIDA IVGDVTAQEL EELIAERKPD
LLIDATDNFD IRMIINDAAY KYRIPWIHGA CVGSYGISYA FIPGKTPCFH CLLETVPVGG
LTCDTAGIIS PAVQMVVAYQ VTEALKILVE DWAALRNKLV SFDLWKNQHT AIRIDQVKKE
DCPTCGTHPS YPYLSYDQQT KTAVLCGRNS VQIRPAAPRN YNLQELAELF VKQGLPVDVN
PYLVSVSLGE RRLVVFQDGR ALIHGTKDIQ EAKTIYYRYL G