Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5746 |
Symbol | soxB |
ID | 7381627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | - |
Start bp | 774286 |
End bp | 775536 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643649299 |
Product | sarcosine oxidase beta subunit |
Protein accession | YP_002547536 |
Protein GI | 222106745 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.267033 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCTATT CGGCTTTGTC GATTTTTCTG AACGGTCTTC GTGGCAATAA AAACTGGGCC GCCCATTGGC GACAGCCGGA GCCAAAGCCC CATTATGATG TGGTGATTGT TGGCGGCGGT GGCCATGGGC TGGCCACCGC CTATTACCTT TCCAAAACCT TCGGCATCAA AAACATCGCC GTGGTGGAAA AGGGCTATAT CGGTTCCGGC AATGTCGGGC GCAACACCAC TATCATCCGC TCCAATTATC TGTTGCCGGG CAACAATCCC TTCTACGAAT TCTCCATGAA ACTATGGGAA GGGCTGGAGC AGGATTTCAA TTTCAATGCC ATGGTGTCGC AGCGCGGTGT GGTCAATCTC TACCATTCTG ATGCCCAGCG TGATGCCTAT ACAAGGCGCG GCAATGCCAT GCGCCTGCAT GGCGTGGATG CCGAATTGCT GGACCGCGAC GCCATCAAGG CCATGCTGCC GTTTCTGGAT TACGACAATG CTCGCTTTCC CATTCAAGGC GGTTTGATGC AGCGGCGCGG TGGCACGGTG CGCCATGATG CCGTGGCTTG GGGCTATGCT CGGGGTGCGG ATACTCACGG GGTAGACATT CTTCAAAACT GCGAAGTCAC TGGAATTCGC CGCGAAAATG GTAAGGCGGT GGGGGTGGAA ACCACACGTG GCTTCATCGG TTGCGGCAAG CTGGCGCTGG CGGCTGCGGG CAATTCATCC GGCGTGGCGG AAATGGCGGG GTTGAAATTG CCGATGGAAA GCCATGTGCT GCAAGCCTTT GTCTCGGAGG GGCTAAAACC CTTTATCGAT TGCGTCGTCA CCTTTGGCGC GGGCCATTTC TACGTATCAC AATCGGATAA GGGCGGGCTG GTGTTTGGCG GCGATATTGA TGGCTATAAT TCTTACGCCC AGCGCGGCAA TCTCGCCACC GTTGAGCATG TGGCGGAAGC GGGCAAGGCG ATGATTCCGG CGCTGTCGCG CATCCGCGTA CTGCGCAGCT GGGGCGGTGT GATGGACATG AGCATGGATG GCTCGCCGAT TATCGACAAG GTGCATCTGG ACAATCTCTA TCTCAATTCC GGCTGGTGCT ATGGCGGCTT CAAAGCCACG CCTGCCTCGG GCTATTGCTT TGCCCATCTC ATCGCCAAGG GCGAGAGCCA CGAAACGGCG CGTGCCTTCC GGCTGGATCG GTTTGAACGC GGCCACATCA TCGATGAAAA GGGCCAGGGT GCCCAGCCGA ACCTTCACTA A
|
Protein sequence | MRYSALSIFL NGLRGNKNWA AHWRQPEPKP HYDVVIVGGG GHGLATAYYL SKTFGIKNIA VVEKGYIGSG NVGRNTTIIR SNYLLPGNNP FYEFSMKLWE GLEQDFNFNA MVSQRGVVNL YHSDAQRDAY TRRGNAMRLH GVDAELLDRD AIKAMLPFLD YDNARFPIQG GLMQRRGGTV RHDAVAWGYA RGADTHGVDI LQNCEVTGIR RENGKAVGVE TTRGFIGCGK LALAAAGNSS GVAEMAGLKL PMESHVLQAF VSEGLKPFID CVVTFGAGHF YVSQSDKGGL VFGGDIDGYN SYAQRGNLAT VEHVAEAGKA MIPALSRIRV LRSWGGVMDM SMDGSPIIDK VHLDNLYLNS GWCYGGFKAT PASGYCFAHL IAKGESHETA RAFRLDRFER GHIIDEKGQG AQPNLH
|
| |