Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCZK2564 |
Symbol | soxA |
ID | 3024013 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_006274 |
Strand | + |
Start bp | 2687119 |
End bp | 2688354 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637546787 |
Product | sarcosine oxidase, alpha subunit |
Protein accession | YP_084153 |
Protein GI | 52142677 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0492] Thioredoxin reductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.13728 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGATG TCATAATTAT CGGAGCTGGC CCAGCGGGTT TATCAGCCTC TATCTCTTGC GCTCGATTTG GACTGAATGT ACTTGTTATT GATGAATTTA TGAAGCCTGG CGGGAGATTG TTAGGACAAT TACATCAAGA GCCTTCTGGA GAATGGTGGA ACGGAATAGA AGAATCGAAG CGACTTCACG ATGAAGCAAA ATCACTTTCA GTTCATATTC GATGCGGTAT TTCCGTCTAT AATTTAGATA AAGATGAAAG TTGTTGGTTC GTACATACGA ATATCGGTAC GTTAGAAGCA CCGTTCGTTT TACTTGCTAC TGGCGCTGCT GAGTACTCCA TTCCCCTTCC CGGTTGGACA CTTCCAGGAG TCATGTCAAT CGGGGCAGCC CAAGTAATGA CAAATGTTCA TCGCGTTCAA GTTGGGGAAA AAGGAATCAT TATTGGTGCT AACATTTTGT CATTCGCTAT TTTGAATGAA TTACAATTAG CAGGCATCAA AGTTGAACAT ATCGTACTTC CTGAAAAAAG TGAATTAAGT CAAAAGGCTG GAGAACCGGA AGAAGTTTTA AATTCTCTAT TACATGCTGC ACACCTTGCC CCGTCACCTT TATTACGTAT AGGTAGTAAA TTGATGAAAT ATAATTGGGC TAAACAAGCT GGATTAACCT TCTATCCAAA CAGCGGAATG AAAATAAACG GTACACCTCT TCACCTTCGA AAAGCAGCGC TTGAAATTAT CGGAACAGAT CAAGTTGAAG GTGTACGCAT TGTTAATATT GATACGAAAG GAAATATCGT AACTGGATCG GAACAAATAT ATGAGGCAGA CTTCGTTTGT ATTGCCGGTG GCTTATATCC TCTTGCTGAA CTAGCTGCTG TAGCTGGCTG CCCTTTCCGT TACATTCCGG AATTAGGCGG TCATGTTCCG CTCCATTCTG AAACAATGGA AACCCCTCTT TCTGGTTTAT TTGTAGCCGG CAATATAACT GGCATTGAAA GCGGGAAAAT TGCAATGGCA CAAGGAACTG TTGCTGGATA TTCAATTGTA AAACAAGCAA ATAAAAAATC TCATTCGGTT GAACAACACT TACAACAAGC GATCCAACAT GTACATGCCG TACGTCAACA AGCTACTATT CAATTTAATC CGATGGTTGA TATCGGTAGA CGGAAGATGA ATGAAATTTG GCAGGATTAT TCCCTTGCAT ATGCCCATAC TAAAAAGAGC TGCTGA
|
Protein sequence | MNDVIIIGAG PAGLSASISC ARFGLNVLVI DEFMKPGGRL LGQLHQEPSG EWWNGIEESK RLHDEAKSLS VHIRCGISVY NLDKDESCWF VHTNIGTLEA PFVLLATGAA EYSIPLPGWT LPGVMSIGAA QVMTNVHRVQ VGEKGIIIGA NILSFAILNE LQLAGIKVEH IVLPEKSELS QKAGEPEEVL NSLLHAAHLA PSPLLRIGSK LMKYNWAKQA GLTFYPNSGM KINGTPLHLR KAALEIIGTD QVEGVRIVNI DTKGNIVTGS EQIYEADFVC IAGGLYPLAE LAAVAGCPFR YIPELGGHVP LHSETMETPL SGLFVAGNIT GIESGKIAMA QGTVAGYSIV KQANKKSHSV EQHLQQAIQH VHAVRQQATI QFNPMVDIGR RKMNEIWQDY SLAYAHTKKS C
|
| |