Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0354 |
Symbol | |
ID | 7977467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 404275 |
End bp | 405855 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644797345 |
Product | hypothetical protein |
Protein accession | YP_002948545 |
Protein GI | 239825921 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03605] SagB-type dehydrogenase domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGCTAG AGACATTTTT ACACCATCTT CATTTTGACA TCGATAAGAT TATGCCGCCA AATTGGGAGG TAGATTGGGA AGATGCGCCG CTTGCGTATA AGCTGTACCG TAACTTGCCA GTGATTCCGC TTTCTCCAGA AGTGCCGTTA ACACTCGAGG GAAGAGAAGC GTCTGGAAAG CTCGGCCTAG AAGAAATAGG TCATTTTCTC TGGTACGTTT TCGGCCTTAC TCAATTTTCT CAGCTAGCCT TTTCCATGGG TCCCACAGAA CAAACGGTAA ACCTAATGCA CTTGTACCGG CGGTTTGTTC CCTCCGGCGG GGCGTTGTAT CCAAACGAAT TATACGTGTA TTTGAAAATA AAGGATATTC CAGATGGAGT GTACCATTAC GATGTGGCAC ACCATCGCTT GGTGTTGTTG CGGGAAGGCA ATTTCGATTC CTATCTAACT AGGGCGCTGG GCAATCGCTG TGACGTATCG GCTTGTTTCG GTGTTGTTTT TGTATCGACA ATGTTTTGGA AGAATTTCTT TAAATACAAT AATTTTGCTT ACCGTCTGCA AGGGCTGGAT GCTGGCGTGC TAATTGGACA GCTGTTGGAA GCGGCGAAAC GGTTCGGCTT CGCATCGGCA GTGTATTTCC AATTTCTTGA CAGGGCCGTC AACCATCTGC TTGGACTGTC CGAACAGGAG GAGAGTGTGT ATGCGGTTAT TCCATTATCT TCGGAGTCTT CCATCACTTG GTTTGATAAC GATAATAACT TAAAAGAAAG TGCCTCTGCC ACTGAATTGT GCAGAGAATT GCCAGCAGTT CAGCATCATT ACTATGTTCG GTCGCGAAGG ATTATCGACT ATCCGATGCT GAGAAAAATG AATGAGGCAT CGATGTTGGA ATCGCCGCGA TCATTTCGGC AGATTAAGGG AGATAAGAGA GATGCCTGTG GGATGCAAGC AGTAGTTCTG CCTTGTGTGA AGCGGTTATC GTATGATCTG GCGTCAGTCT GCCAGAAGCG GCATTCACCA GATATGGATT TTGTTTTGGG AAAGGTAAGC CAAGAACAAT TGGCAGCTTT GCTAAAAGAG GCGACGCTTT CTTTCTCGTA TCGAAATGAT TTGGATGGAG AACACGAGAA GCCGCCGTCC CGTGTCTCCC TGTATGGCTG TTTTTATAAC GTTGAAGGCG TTCCAGATGG AGCTTACTAC TATGACAGTG CTGCTCATGC GCTAGGTCGG ATACGTTCCG GAGATTATCG ACACTACCTG CAATACGGAA TGTCAATGGA CAATGTAAAT CTATTCCAAG TACCGCTCTG TCTACACGTG GCAGGAGACA GGGATCACCT CCAAATGGCA TTAGGGTACA GAGGATATCG CATTCAACAG ATGGAGGCGG GGATGCTCGT GCAACGACTG CTCTTGGTGG CGTCTGCCAT GGGGATGGGT GGGCACCCGC TTCTCGGATT TGATGTAAAC TTATGCGATA AACTTTACAA GATCGATTCG CAAGGGAAAA CAAGCTTAAT CCAAATCCCG ATCGGACCCT ATCGTCCCCG CGCCTGGTTA AAAGGGAGTT TGCGCAGCTA G
|
Protein sequence | MELETFLHHL HFDIDKIMPP NWEVDWEDAP LAYKLYRNLP VIPLSPEVPL TLEGREASGK LGLEEIGHFL WYVFGLTQFS QLAFSMGPTE QTVNLMHLYR RFVPSGGALY PNELYVYLKI KDIPDGVYHY DVAHHRLVLL REGNFDSYLT RALGNRCDVS ACFGVVFVST MFWKNFFKYN NFAYRLQGLD AGVLIGQLLE AAKRFGFASA VYFQFLDRAV NHLLGLSEQE ESVYAVIPLS SESSITWFDN DNNLKESASA TELCRELPAV QHHYYVRSRR IIDYPMLRKM NEASMLESPR SFRQIKGDKR DACGMQAVVL PCVKRLSYDL ASVCQKRHSP DMDFVLGKVS QEQLAALLKE ATLSFSYRND LDGEHEKPPS RVSLYGCFYN VEGVPDGAYY YDSAAHALGR IRSGDYRHYL QYGMSMDNVN LFQVPLCLHV AGDRDHLQMA LGYRGYRIQQ MEAGMLVQRL LLVASAMGMG GHPLLGFDVN LCDKLYKIDS QGKTSLIQIP IGPYRPRAWL KGSLRS
|
| |