Gene GWCH70_0354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0354 
Symbol 
ID7977467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp404275 
End bp405855 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content47% 
IMG OID644797345 
Producthypothetical protein 
Protein accessionYP_002948545 
Protein GI239825921 
COG category 
COG ID 
TIGRFAM ID[TIGR03605] SagB-type dehydrogenase domain 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCTAG AGACATTTTT ACACCATCTT CATTTTGACA TCGATAAGAT TATGCCGCCA 
AATTGGGAGG TAGATTGGGA AGATGCGCCG CTTGCGTATA AGCTGTACCG TAACTTGCCA
GTGATTCCGC TTTCTCCAGA AGTGCCGTTA ACACTCGAGG GAAGAGAAGC GTCTGGAAAG
CTCGGCCTAG AAGAAATAGG TCATTTTCTC TGGTACGTTT TCGGCCTTAC TCAATTTTCT
CAGCTAGCCT TTTCCATGGG TCCCACAGAA CAAACGGTAA ACCTAATGCA CTTGTACCGG
CGGTTTGTTC CCTCCGGCGG GGCGTTGTAT CCAAACGAAT TATACGTGTA TTTGAAAATA
AAGGATATTC CAGATGGAGT GTACCATTAC GATGTGGCAC ACCATCGCTT GGTGTTGTTG
CGGGAAGGCA ATTTCGATTC CTATCTAACT AGGGCGCTGG GCAATCGCTG TGACGTATCG
GCTTGTTTCG GTGTTGTTTT TGTATCGACA ATGTTTTGGA AGAATTTCTT TAAATACAAT
AATTTTGCTT ACCGTCTGCA AGGGCTGGAT GCTGGCGTGC TAATTGGACA GCTGTTGGAA
GCGGCGAAAC GGTTCGGCTT CGCATCGGCA GTGTATTTCC AATTTCTTGA CAGGGCCGTC
AACCATCTGC TTGGACTGTC CGAACAGGAG GAGAGTGTGT ATGCGGTTAT TCCATTATCT
TCGGAGTCTT CCATCACTTG GTTTGATAAC GATAATAACT TAAAAGAAAG TGCCTCTGCC
ACTGAATTGT GCAGAGAATT GCCAGCAGTT CAGCATCATT ACTATGTTCG GTCGCGAAGG
ATTATCGACT ATCCGATGCT GAGAAAAATG AATGAGGCAT CGATGTTGGA ATCGCCGCGA
TCATTTCGGC AGATTAAGGG AGATAAGAGA GATGCCTGTG GGATGCAAGC AGTAGTTCTG
CCTTGTGTGA AGCGGTTATC GTATGATCTG GCGTCAGTCT GCCAGAAGCG GCATTCACCA
GATATGGATT TTGTTTTGGG AAAGGTAAGC CAAGAACAAT TGGCAGCTTT GCTAAAAGAG
GCGACGCTTT CTTTCTCGTA TCGAAATGAT TTGGATGGAG AACACGAGAA GCCGCCGTCC
CGTGTCTCCC TGTATGGCTG TTTTTATAAC GTTGAAGGCG TTCCAGATGG AGCTTACTAC
TATGACAGTG CTGCTCATGC GCTAGGTCGG ATACGTTCCG GAGATTATCG ACACTACCTG
CAATACGGAA TGTCAATGGA CAATGTAAAT CTATTCCAAG TACCGCTCTG TCTACACGTG
GCAGGAGACA GGGATCACCT CCAAATGGCA TTAGGGTACA GAGGATATCG CATTCAACAG
ATGGAGGCGG GGATGCTCGT GCAACGACTG CTCTTGGTGG CGTCTGCCAT GGGGATGGGT
GGGCACCCGC TTCTCGGATT TGATGTAAAC TTATGCGATA AACTTTACAA GATCGATTCG
CAAGGGAAAA CAAGCTTAAT CCAAATCCCG ATCGGACCCT ATCGTCCCCG CGCCTGGTTA
AAAGGGAGTT TGCGCAGCTA G
 
Protein sequence
MELETFLHHL HFDIDKIMPP NWEVDWEDAP LAYKLYRNLP VIPLSPEVPL TLEGREASGK 
LGLEEIGHFL WYVFGLTQFS QLAFSMGPTE QTVNLMHLYR RFVPSGGALY PNELYVYLKI
KDIPDGVYHY DVAHHRLVLL REGNFDSYLT RALGNRCDVS ACFGVVFVST MFWKNFFKYN
NFAYRLQGLD AGVLIGQLLE AAKRFGFASA VYFQFLDRAV NHLLGLSEQE ESVYAVIPLS
SESSITWFDN DNNLKESASA TELCRELPAV QHHYYVRSRR IIDYPMLRKM NEASMLESPR
SFRQIKGDKR DACGMQAVVL PCVKRLSYDL ASVCQKRHSP DMDFVLGKVS QEQLAALLKE
ATLSFSYRND LDGEHEKPPS RVSLYGCFYN VEGVPDGAYY YDSAAHALGR IRSGDYRHYL
QYGMSMDNVN LFQVPLCLHV AGDRDHLQMA LGYRGYRIQQ MEAGMLVQRL LLVASAMGMG
GHPLLGFDVN LCDKLYKIDS QGKTSLIQIP IGPYRPRAWL KGSLRS