Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_3774 |
Symbol | soxB |
ID | 7388230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | + |
Start bp | 3140646 |
End bp | 3141899 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643652545 |
Product | sarcosine oxidase beta subunit |
Protein accession | YP_002550726 |
Protein GI | 222149769 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCAAAT ATTCGGTTTT TGCCGTAGCC CGCGAAGCGC TTCGCGCCCA TACCGGCTGG AGTGCGCAAT GGACATCGCC AGAGCCACGC AAATCCTATG ATGTCATTAT TATTGGCGGT GGCGGCCATG GTCTGGGGGC TGCTTATTAT CTGGCCAAGG AACATGGCAT CACCAATATT GCCGTGATTG AAAAAAGCTG GATTGGCGGC GGCAACACCG GGCGCAACAC CACCATTATT CGCTCCAACT ATCTCTATGA AGAGAGCATG GACATTTACG AGCACTCCCT GAAACTCTGG GAGGGCTTGA GCCAGGAGCT TAATTACAAT GTGATGTATT CGGCCCGTGG CGTGATGATG CTGTCGCACA ATGTGCATGA CAAGCAGAGT TTTAAGCGCC ATGTTCATGC CAACACGCTC TATGGCATTG ATAATGAGTG GCTGACGCCG GAACAGGCTA AGGCCTTCTG TCCGCCGCTG GATATTGCCA AAACCGCCCG CTACCCCATC AATGGAGCCG CCCTGCAACG GCGCGGCGGC ACGGCCCGCC ATGATGCCGT GGCTTGGGGC TATGCCCGCG CAGCCTCTGA TCGCGGTGTG CATATCATTC AAAATTGCGA AGTGACGGGT ATTCGCCGGG GAGCAAGTGG TGAAGTCACG GGTGTTGATA CCTCCAAGGG CTTTATAGGT GCCAAGAAAA TCGGTGTTTC TGCCTCCGGC CATAATTCCA TGGTGATGGG CATGGCAGAT GTGCGCCTGC CGATTCATTC CACGCCGTTG CAGGCGCTGG TGTCGGAGCC GCTGAAGCCG ATTTTCCCCT GCGTGGTGAT GTCCAACACC GTGCATGCCT ATATCTCGCA ATCCGATAAG GGCGAGTTGG TGATTGGGGC TGGCACCGAC CAGTATAATT CCTATTCCCA GACAGGCGGC CTGCAAATCA TCACCCACAC GCTGGATGCG ATTTGCGAGC TGTTCCCCAT CTTCCGCCGC GTCAAGATGA TGCGCCAGTG GGGCGGGATT ACTGATAACA CCGCCGACCG CTCGCCCATC CAGAGCGTAA CCCCCGTGCC AAACCTGTTT GTCAATTGCG GCTGGGGTAC GGGCGGCTTC AAGGCCACAC CGGGCTCTGC CAATCTGTTT GCCCATCTGA TTGCCAAGGG CGAACCGCAT GCGCTGGCCA AGGGCCTGAC GCTGGATCGG TTCCGCACCG GGCGATTGAT TGATGAGGCG GCAGCCGCCG CCGTGGCACA CTAG
|
Protein sequence | MRKYSVFAVA REALRAHTGW SAQWTSPEPR KSYDVIIIGG GGHGLGAAYY LAKEHGITNI AVIEKSWIGG GNTGRNTTII RSNYLYEESM DIYEHSLKLW EGLSQELNYN VMYSARGVMM LSHNVHDKQS FKRHVHANTL YGIDNEWLTP EQAKAFCPPL DIAKTARYPI NGAALQRRGG TARHDAVAWG YARAASDRGV HIIQNCEVTG IRRGASGEVT GVDTSKGFIG AKKIGVSASG HNSMVMGMAD VRLPIHSTPL QALVSEPLKP IFPCVVMSNT VHAYISQSDK GELVIGAGTD QYNSYSQTGG LQIITHTLDA ICELFPIFRR VKMMRQWGGI TDNTADRSPI QSVTPVPNLF VNCGWGTGGF KATPGSANLF AHLIAKGEPH ALAKGLTLDR FRTGRLIDEA AAAAVAH
|
| |