Gene Bcer98_4001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcer98_4001 
Symbol 
ID5343422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cytotoxicus NVH 391-98 
KingdomBacteria 
Replicon accessionNC_009674 
Strand
Start bp4058885 
End bp4060066 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content40% 
IMG OID640841481 
Product2-alkenal reductase 
Protein accessionYP_001377177 
Protein GI152977660 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTTTA TTGATGAAGA AAATCGTATG AAACGTACAA GGGAAAAGAA GCACAAAGGT 
CTTGTTATTT CTAGCATAGC GGGGACTGTT GTAGGAGCTT CATGGTTTGC ATTTGGAGCT
CCTTTGTTTT CAAAGTCCGA GACTAAAAAT ATGCAGCAAG CAGAAGCAAG TGAAGGAAAC
ATAGTGAATG AACCATCACA AATGATGCAA CATTCCGGCT TTGTAGATGC TGTTGATCGC
GCTTCCGAAG CGGTCGTTGG TGTAATCAAT ATTCAGCGTG ATCATTTTTC TGAAGTAGAT
GCAGAAGCTG GCACAGGATC AGGTGTTATT TATAAAAAAA CAAATGGTCA TGCTTATATT
GTCACAAATC ATCATGTCAT TGCTGGTGCG AATCGAATTG AAATTAGCTT AAGTGATGGG
ACAAAAGTTC CTGGAAAAAT ATTAGGAAGC GACATTGTGA CCGATTTAGC TGTTATAGAA
ATAAATGCGA AGCCTGTCAA AAAGGTAATT GAAATTGGTG ATTCGAATAC AGTACGTCGC
GGGGAAGCAG TTATTGCGAT CGGAAACCCA CTTGGATTAC AGTTTTCAGG GACAGTAACA
CAAGGAATTA TATCTGCAAA CGAACGAATT GTTCCTGTAG ATTTAGACCA AGATGGACAT
TATGACTGGC AAGTGGAAGT GTTACAGACC GATGCGGCTA TTAATCCAGG AAATAGTGGC
GGGGCGCTTA TAAATGCAGC GGGGAAACTG ATTGGAATTA ACTCCATGAA AATCGCTGCA
AAAGAAGTAG AAGGAATTGG TTTAGCGATT CCTGTGACAA GGGCCGTTCC TATTATGAAT
GAGCTTGAAA AGTATGGTAC TGTAAGAAGG CCATATATTG GAATTGAACT TCGGTCATTA
AATGAAATTC CAAACTACTA CTGGGAAGAA ACGTTACACT TGCCGGGTGG CGTTACAAAT
GGTGTGTGTA TTTTAGACGT GAAAAGTCCA TCACCAGGTG CTGCCGCAGG CCTTAGAGAG
CATGACGTTA TTGTCGCGGT AGATGGAAAG CCAATACAAG ATATCATTGC TTTTCGGACA
GCTTTATATA ATAAGAAAAT CGATGATAAA ATGACTCTTA CTTTTTATCG TGGTACAAAG
CGCTCCACAA CAACAGTGAA ATTGGCTAGT CAAAAATATT AA
 
Protein sequence
MSFIDEENRM KRTREKKHKG LVISSIAGTV VGASWFAFGA PLFSKSETKN MQQAEASEGN 
IVNEPSQMMQ HSGFVDAVDR ASEAVVGVIN IQRDHFSEVD AEAGTGSGVI YKKTNGHAYI
VTNHHVIAGA NRIEISLSDG TKVPGKILGS DIVTDLAVIE INAKPVKKVI EIGDSNTVRR
GEAVIAIGNP LGLQFSGTVT QGIISANERI VPVDLDQDGH YDWQVEVLQT DAAINPGNSG
GALINAAGKL IGINSMKIAA KEVEGIGLAI PVTRAVPIMN ELEKYGTVRR PYIGIELRSL
NEIPNYYWEE TLHLPGGVTN GVCILDVKSP SPGAAAGLRE HDVIVAVDGK PIQDIIAFRT
ALYNKKIDDK MTLTFYRGTK RSTTTVKLAS QKY