Gene RSc1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSc1103 
SymbolsoxA2 
ID1219915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003295 
Strand
Start bp1159152 
End bp1162163 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content66% 
IMG OID637237469 
Productsarcosine oxidase subunit alpha 
Protein accessionNP_519224 
Protein GI17545822 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0428535 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA AAGACCGTCT CGGTACGGGT GGCCGTATCA ATCGTGCGAT TCCGCTGACG 
TTTACGTTCA ACGGCCGCAC GTATCAGGGT TTCCAGGGCG ACACGCTGGC GTCTGCGCTG
CTCGCGAATG GCGTGCATTT CGTCGCGCGC AGCTTCAAGT ACCACCGTCC GCGCGGGATC
ATGACGGCGG GCGTCGAGGA GCCGAACGCG GTGGTGCAGC TCGAGTCGGG CCCGTACAGC
GTGCCGAATG CGCGCGCGAC CGAGATCGAG CTATACCAGG GGCTCATCGC CACCAGCGTG
AACGCCGAAC CGTCGCTCGA AAACGATCGC TATGCGATCA GCCAGATGTT TTCGCGCTTC
CTGCCCGCCG GTTTCTACTA CAAGACCTTC ATGTGGCCGC GCAAGATGTG GCCGAAGTAC
GAAGAAAAGA TCCGCGAAGC GGCCGGCCTT GGCAAGGCGC CCGACATGCG CGACGCGGAC
CGCTACGACA AGTGTTACGC GCACTGCGAC GTGCTCGTCG TGGGTGGCGG CCCGACGGGG
CTCGCGGCCG CACACGCGGC GGCGATGGCC GGGGCGCGCG TCATCCTGGT TGAAGACCAG
CGCGAGCTTG GCGGCAGCCT GCTGTCGTGC CGCGCGGAAA TCGGCGGCAA GCCAGCGCTG
CAGTGGGTCG AGAAGATCGA AGCCCAGCTG CGCAAGCTGC CCGACGTGAG TATCCTCACG
CGCAGTACCG CGTTCGGCTA TCAGGATCAC AACCTCGTGA CCGTCACGCA GCGCCTGACG
GATCATCTGC CGATCTCGAT GCGCAAGGGC ACGCGCGAGC TGCTGTGGAA GGTCCGCGCC
AAGCGCGTCA TTCTCGCAAC GGGCGCGCAC GAGCGGCCGA TCGTGTTCGG CAACAACGAC
CTGCCGGGTG TGATGCTCGC GGGCGCCGTG TCCACGTACA TCCATCGCTT CGGCGTGCTG
CCGGGGCGTG ACGCCGTCGT GTTCACGAAC AACGACCGCG CCTACCAGAC CGCGCTCGAT
CTGAAGGCGT GCGGTGCGAA GGTCACGGTC GTCGACGCGC GCGCACCCGG CAACGGTGCG
CTGCCCGCCG TTGCGAAGCG CCAGGGCGTA ACGGTGATGC ATGGCGCGGT GATCACGGCT
GCGTCCGGCA AGTGGCGCGT GTCATCGGTC GACGTCGCGT CTTACGCGAA TGGGCAGGTG
GGCGGCAAGC AGAAGACGCT GCCGTGCGAC CTTGTCGCGA CGTCGGGCGG TTTCAGCCCG
GTGCTGCACC TGTTCGCGCA ATCGGGCGGC AAGGCCCAGT GGAACGATGA CAAGGCGTGC
TTCGTGCCGG GCAAGACCGT GCAGGCCGAG GCGAGCGTCG GCGCGGCAGC GGGTGAATTC
GCCCTTGCGC ACGCGCTGCA GCTTGCAGTG GATGCGGGGG CCGAGGCTGC ACAGGCGGCG
GGTTGCACGG CCGCGCAACG CGCTGTCGCA CCGCGGGTCG CGGAAACGGC CGAAGGCGCG
CTGCAACCGC TGTGGCTCAT CGGTAGCCGC GAGGCCGCTG CACGCGGGCC GAAGCAGTTC
GTTGACTTCC AGAATGACGT GGCGGTCACC GACATCCTGC TCGCCGCGCG CGAGGGTTTC
GAGTCGGTCG AGCACGTCAA GCGCTACACG GCGATGGGCT TCGGCACCGA TCAGGGCAAG
CTCGGCAACA TCAACGGGAT GGCGATTCTC GCCCAGGCGC TCGGCAAGTC GATTCCGGAA
ACGGGCACGA CGACGTTCCG CCCCAACTAC ACACCCGTGT CGTTCGGCAC GTTCGCGGGC
CGCGAGCTGG GCAACTTCCT CGACCCGGTC CGCAAGACCT GCATTCATGA GTGGCATGTC
GAGCACGGTG CGCTGTTCGA AGACGTCGGC AACTGGAAAC GACCCTGGTA TTTTCCGAAG
AACGGCGAAG ACCTGCATGC GGCCGTGAAG CGCGAGTGCC TGGCGGTGCG CAACGGTGTC
GGCATGCTCG ATGCGTCCAC GCTCGGCAAG ATCGACATCC AGGGCCCCGA CGCGGTGAAG
CTGCTGAACT GGGTGTACAC GAACCCGTGG GGCAAGCTCG ACGTCGGCAA GTGCCGCTAC
GGGCTGATGC TCGATGAGAA CGGGATGGTG TTCGACGACG GCGTGACCGT GCGCCTCGCC
GACCAGCATT TCATGATGAC GACCACGACG GGCGGCGCAG CGCGGGTGCT CACCTGGCTC
GAGCGCTGGC TGCAGACCGA GTGGCCCGAC ATGAAGGTGC GGCTCGCGTC CGTCACCGAC
CACTGGGCGA CGTTCGCGGT GGTCGGCCCG AAGAGCCGCA AGGTCGTGCA GAAGGTGTGC
CAGGACATCG ACTTCGGCAA CGAAGCGTTC CCGTTCATGA GCTATCGCAA CGGCACCGTC
GCGGGCGCCA AGGCGCGCGT GATGCGGATC AGCTTCTCGG GCGAACTGGC CTACGAAGTG
AACGTGCCGG CCAATGCCGG GCGCGCGGTG TGGGAAGCGC TGATGGCCGC GGGTGCCGAG
TTCGACATCA CGCCGTACGG CACCGAAACG ATGCACGTGC TGCGCGCGGA AAAGGGCTAC
ATCATCGTCG GCCAGGACAC CGACGGTTCG ATCACGCCAT CCGACCTCGG CATGGGCGGC
CTCGTCGCGA AGACGAAGGA CTGCCTCGGC AAGCGTTCGC TCGCGCGTTC CGATACCGCA
AAGGCGGGCC GCAAGCAGTT CGTCGGCCTG TTGACCGACG ATGCGCAGTG CGTGCTGCCG
GAGGGCGCGC AGATCATCGA CAAGGACACG CAGGTCCGCG TGACGGAACC GACGCCGATG
ATCGGCCACG TGACGTCGAG CTACTACAGC CCGATCCTGC AACGTTCGAT CGCGCTGGCG
GTGGTGAAGG GTGGTCTGGG CAAGATGGGC GAGAGCGTCG TGATTCCGCT GGCCAACGGC
AGGCGTGTCA CCGCGAAGAT CGCGAGCCCG GTTTTCTACG ATACGGAAGG GGTGCGTCAG
CATGTGGAAT GA
 
Protein sequence
MSQKDRLGTG GRINRAIPLT FTFNGRTYQG FQGDTLASAL LANGVHFVAR SFKYHRPRGI 
MTAGVEEPNA VVQLESGPYS VPNARATEIE LYQGLIATSV NAEPSLENDR YAISQMFSRF
LPAGFYYKTF MWPRKMWPKY EEKIREAAGL GKAPDMRDAD RYDKCYAHCD VLVVGGGPTG
LAAAHAAAMA GARVILVEDQ RELGGSLLSC RAEIGGKPAL QWVEKIEAQL RKLPDVSILT
RSTAFGYQDH NLVTVTQRLT DHLPISMRKG TRELLWKVRA KRVILATGAH ERPIVFGNND
LPGVMLAGAV STYIHRFGVL PGRDAVVFTN NDRAYQTALD LKACGAKVTV VDARAPGNGA
LPAVAKRQGV TVMHGAVITA ASGKWRVSSV DVASYANGQV GGKQKTLPCD LVATSGGFSP
VLHLFAQSGG KAQWNDDKAC FVPGKTVQAE ASVGAAAGEF ALAHALQLAV DAGAEAAQAA
GCTAAQRAVA PRVAETAEGA LQPLWLIGSR EAAARGPKQF VDFQNDVAVT DILLAAREGF
ESVEHVKRYT AMGFGTDQGK LGNINGMAIL AQALGKSIPE TGTTTFRPNY TPVSFGTFAG
RELGNFLDPV RKTCIHEWHV EHGALFEDVG NWKRPWYFPK NGEDLHAAVK RECLAVRNGV
GMLDASTLGK IDIQGPDAVK LLNWVYTNPW GKLDVGKCRY GLMLDENGMV FDDGVTVRLA
DQHFMMTTTT GGAARVLTWL ERWLQTEWPD MKVRLASVTD HWATFAVVGP KSRKVVQKVC
QDIDFGNEAF PFMSYRNGTV AGAKARVMRI SFSGELAYEV NVPANAGRAV WEALMAAGAE
FDITPYGTET MHVLRAEKGY IIVGQDTDGS ITPSDLGMGG LVAKTKDCLG KRSLARSDTA
KAGRKQFVGL LTDDAQCVLP EGAQIIDKDT QVRVTEPTPM IGHVTSSYYS PILQRSIALA
VVKGGLGKMG ESVVIPLANG RRVTAKIASP VFYDTEGVRQ HVE