Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3404 |
Symbol | |
ID | 7976183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 3434646 |
End bp | 3435860 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644800168 |
Product | 2-alkenal reductase |
Protein accession | YP_002951307 |
Protein GI | 239828683 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000199596 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGATATT ACGATGATCA CTACGAATAC CAGCAAAAAC AAAAGGGAAA CCGTGGCCGC TGGTTTTTTT CCGCGTTAGT TGGCGCTGTT TTAGGAGGGT TATTAGTAGT AATTTCTGTT CCAGCTCTTT CAAAATGGAA TGTGCTTCCT TATCAAGTTA CACCGAGAGA GAGTGAACAG GTACAAAATG AAGAAACAGC AAAAGAACCT GCTATACGGC AACAAGTTTC TGTCGATGTG TCAAGCCAAG TAACAAAAGC GATTGATAAA GTATCCGATG CGGTTGTTGG CATTGTAAAC ATTCAAGCAG CGAACTTTTG GTCGCAGGGC GGAGAAGCGG GAACTGGTTC AGGCGTCATC TATAAAAAAG AAAATGGGAA AGCATTTATT GTGACGAACC ATCATGTTGT CGAAAACGCC AGTGAATTAG AAGTAAGCTT AAAAGATGGA ACAAGAGTGC CAGCGAAGCT GTTGGGAAGC GATGTGTTAA TGGACTTAGC AGTATTGGAA ATTGATGCGA AGCATGTCAA AAAAGTAGCT GAGTTTGGCA ATTCAGATAC AGTCAAACCG GGAGAACCAG TCATTGCGAT CGGCAATCCG CTTGGGTTGC AATTCGCTGG CTCTGTTACG CAAGGAATTA TATCAGGAAC GAATCGAACC GTAGAAGTTG ACTTAGACCA AGATGGTACT CCGGATTGGA ATGCAGAAGT ATTGCAAACG GATGCTGCCA TTAACCCGGG CAATAGCGGT GGCGCCCTTG TCAATATTCA AGGGCAAGTT ATTGGCATTA ACTCCATGAA AATTGCCCAA GAAGCAGTGG AGGGAATTGG ATTTGCCATC CCAATCAATA CAGCGATTCC GGTAATTTCA GATTTAGAAA AATACGGACA AGTACGCCGC CCATATATGG GGGTAGAACT TCGCTCCTTA AGCGATATTT CTTCTTATCA TTTGCAAGCA ACCTTGCACT TGCCAAAAGA TGTAACAGAA GGTGTGGCTG TCATTCAAGT AGTGCCAATG TCTCCAGCAG CGCAAGCGGG ATTAAAGCAA TTTGATGTGA TCGTAGCATT AGATGATCAC AAAATTCGTG ATGTGTTAGA TTTAAGAAAA TATTTGTACA CAAAAAAATC GATTGGCGAT ACAATGAAAG TAACATTTTA TCGAGACGGC AAAAAACATA CAGTGACGAT AAAATTAGAG AAAGAATCGT TTTAA
|
Protein sequence | MGYYDDHYEY QQKQKGNRGR WFFSALVGAV LGGLLVVISV PALSKWNVLP YQVTPRESEQ VQNEETAKEP AIRQQVSVDV SSQVTKAIDK VSDAVVGIVN IQAANFWSQG GEAGTGSGVI YKKENGKAFI VTNHHVVENA SELEVSLKDG TRVPAKLLGS DVLMDLAVLE IDAKHVKKVA EFGNSDTVKP GEPVIAIGNP LGLQFAGSVT QGIISGTNRT VEVDLDQDGT PDWNAEVLQT DAAINPGNSG GALVNIQGQV IGINSMKIAQ EAVEGIGFAI PINTAIPVIS DLEKYGQVRR PYMGVELRSL SDISSYHLQA TLHLPKDVTE GVAVIQVVPM SPAAQAGLKQ FDVIVALDDH KIRDVLDLRK YLYTKKSIGD TMKVTFYRDG KKHTVTIKLE KESF
|
| |