Gene GWCH70_3404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3404 
Symbol 
ID7976183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3434646 
End bp3435860 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content42% 
IMG OID644800168 
Product2-alkenal reductase 
Protein accessionYP_002951307 
Protein GI239828683 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000199596 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATATT ACGATGATCA CTACGAATAC CAGCAAAAAC AAAAGGGAAA CCGTGGCCGC 
TGGTTTTTTT CCGCGTTAGT TGGCGCTGTT TTAGGAGGGT TATTAGTAGT AATTTCTGTT
CCAGCTCTTT CAAAATGGAA TGTGCTTCCT TATCAAGTTA CACCGAGAGA GAGTGAACAG
GTACAAAATG AAGAAACAGC AAAAGAACCT GCTATACGGC AACAAGTTTC TGTCGATGTG
TCAAGCCAAG TAACAAAAGC GATTGATAAA GTATCCGATG CGGTTGTTGG CATTGTAAAC
ATTCAAGCAG CGAACTTTTG GTCGCAGGGC GGAGAAGCGG GAACTGGTTC AGGCGTCATC
TATAAAAAAG AAAATGGGAA AGCATTTATT GTGACGAACC ATCATGTTGT CGAAAACGCC
AGTGAATTAG AAGTAAGCTT AAAAGATGGA ACAAGAGTGC CAGCGAAGCT GTTGGGAAGC
GATGTGTTAA TGGACTTAGC AGTATTGGAA ATTGATGCGA AGCATGTCAA AAAAGTAGCT
GAGTTTGGCA ATTCAGATAC AGTCAAACCG GGAGAACCAG TCATTGCGAT CGGCAATCCG
CTTGGGTTGC AATTCGCTGG CTCTGTTACG CAAGGAATTA TATCAGGAAC GAATCGAACC
GTAGAAGTTG ACTTAGACCA AGATGGTACT CCGGATTGGA ATGCAGAAGT ATTGCAAACG
GATGCTGCCA TTAACCCGGG CAATAGCGGT GGCGCCCTTG TCAATATTCA AGGGCAAGTT
ATTGGCATTA ACTCCATGAA AATTGCCCAA GAAGCAGTGG AGGGAATTGG ATTTGCCATC
CCAATCAATA CAGCGATTCC GGTAATTTCA GATTTAGAAA AATACGGACA AGTACGCCGC
CCATATATGG GGGTAGAACT TCGCTCCTTA AGCGATATTT CTTCTTATCA TTTGCAAGCA
ACCTTGCACT TGCCAAAAGA TGTAACAGAA GGTGTGGCTG TCATTCAAGT AGTGCCAATG
TCTCCAGCAG CGCAAGCGGG ATTAAAGCAA TTTGATGTGA TCGTAGCATT AGATGATCAC
AAAATTCGTG ATGTGTTAGA TTTAAGAAAA TATTTGTACA CAAAAAAATC GATTGGCGAT
ACAATGAAAG TAACATTTTA TCGAGACGGC AAAAAACATA CAGTGACGAT AAAATTAGAG
AAAGAATCGT TTTAA
 
Protein sequence
MGYYDDHYEY QQKQKGNRGR WFFSALVGAV LGGLLVVISV PALSKWNVLP YQVTPRESEQ 
VQNEETAKEP AIRQQVSVDV SSQVTKAIDK VSDAVVGIVN IQAANFWSQG GEAGTGSGVI
YKKENGKAFI VTNHHVVENA SELEVSLKDG TRVPAKLLGS DVLMDLAVLE IDAKHVKKVA
EFGNSDTVKP GEPVIAIGNP LGLQFAGSVT QGIISGTNRT VEVDLDQDGT PDWNAEVLQT
DAAINPGNSG GALVNIQGQV IGINSMKIAQ EAVEGIGFAI PINTAIPVIS DLEKYGQVRR
PYMGVELRSL SDISSYHLQA TLHLPKDVTE GVAVIQVVPM SPAAQAGLKQ FDVIVALDDH
KIRDVLDLRK YLYTKKSIGD TMKVTFYRDG KKHTVTIKLE KESF