Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BT9727_2599 |
Symbol | soxA |
ID | 2855823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus thuringiensis serovar konkukian str. 97-27 |
Kingdom | Bacteria |
Replicon accession | NC_005957 |
Strand | + |
Start bp | 2673299 |
End bp | 2674534 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637514022 |
Product | sarcosine oxidase, alpha subunit |
Protein accession | YP_036924 |
Protein GI | 49477876 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0492] Thioredoxin reductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGATG TCATAATTAT CGGAGCTGGA CCAGCGGGTT TATCAGCCTC TATCTCTTGT GCTCGTTTTG GACTAAATGT ACTTGTTATT GATGAATTTA TGAAGCCTGG CGGGAGATTG TTAGGACAAT TACATCAAGA GCCTTCTGGA GAATGGTGGA ACGGAATAGA AGAATCGAAG CGACTTCAAG AAGAAGCAAA ATCACTTTCA GTTCATATTC GATGTGGTAT TTCCGTCTAT AATTTAGATA AAGATGAAAG TTGTTGGTTC GTACATACGA ATATCGGTAC GTTAGAAGCA CCGTTCATTT TACTTGCTAC TGGCGCTGCT GAGTACTCCA TTCCCCTTCC CGGTTGGACA CTTCCAGGAG TCATGTCAAT CGGGGCAGCC CAAGTAATGA CAAATGTTCA TCGCGTTCAA GTTGGGGAAA AAGGAATCAT TATTGGTGCT AACATTTTGT CATTCGCTAT TTTGAATGAA TTACAATTAG CAGGCATCAA AGTTGAACAT ATCGTACTTC CTGAAAAAAG TGAATTAAGT CAAAAGGCTG GAGAACCGGA AGAAGTTTTA AATTCTCTAT TACATGCTGC ACACCTTGCC CCGTCACCTT TATTACGTAT AGGTAGTAAA TTGATGAAAT ATAATTGGGC TAAACAAGCT GGATTAACCT TCTATCCAAA CAGCGGAATG AAAATAAACG GTACACCTCT TCACCTTCGA AAAGCAGCGC TTGAAATTAT CGGAACAGAT CAAGTTGAAG GTGTACGCAT TGTTAATATT GATACGAAAG GAAATATCGT AACTGGATCG GAACAAATAT ATGAGGCAGA CTTCGTTTGT ATTGCCGGTG GCTTATATCC TCTTGCTGAA CTAGCTGCTG TAGCTGGCTG CCCTTTCCGT TACATTCCGG AATTAGGCGG TCATGTTCCG CTCCATTCTG AAACAATGGA AACCCCTCTT TCTGGTTTAT TTGTAGCCGG CAATATAACT GGCATTGAAA GCGGGAAAAT TGCAATGGCA CAAGGAACTG TTGCTGGATA TTCAATTGTA AAACAAGCAA ATAAAAAATC TCATTCGGTT GAACAACACT TACAACAAGC GATCCAACAT GTACATGCCG TACGTCAACA AGCTGCTATT CAATTTAATC CGATGGTTGA TATCGGTAGA CGGAAGATGA ATGAAATTTG GCAGGATTAT TCCCTTGCAT ATGCCCATAC TAAAAAGAGC TGCTGA
|
Protein sequence | MNDVIIIGAG PAGLSASISC ARFGLNVLVI DEFMKPGGRL LGQLHQEPSG EWWNGIEESK RLQEEAKSLS VHIRCGISVY NLDKDESCWF VHTNIGTLEA PFILLATGAA EYSIPLPGWT LPGVMSIGAA QVMTNVHRVQ VGEKGIIIGA NILSFAILNE LQLAGIKVEH IVLPEKSELS QKAGEPEEVL NSLLHAAHLA PSPLLRIGSK LMKYNWAKQA GLTFYPNSGM KINGTPLHLR KAALEIIGTD QVEGVRIVNI DTKGNIVTGS EQIYEADFVC IAGGLYPLAE LAAVAGCPFR YIPELGGHVP LHSETMETPL SGLFVAGNIT GIESGKIAMA QGTVAGYSIV KQANKKSHSV EQHLQQAIQH VHAVRQQAAI QFNPMVDIGR RKMNEIWQDY SLAYAHTKKS C
|
| |