Gene RSP_2689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_2689 
SymbolsoxA 
ID3720380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp1333021 
End bp1336002 
Gene Length2982 bp 
Protein Length993 aa 
Translation table11 
GC content70% 
IMG OID640070865 
Productputative sarcosine oxidase, alpha subunit 
Protein accessionYP_352745 
Protein GI77463241 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase) 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACGC GTCTCGCACG GGGCGGACGG CTGATCGACC GCAGCCGCGC CATCGACTTC 
ACCTTCAACG GCAAGCGGAT GCGCGGCTTC GCGGGCGACA CGCTGGCCGC GGCGCTTCTC
GCCAACGACC AGATGCTGGT CGGGCGCAGC TTCAAGTATC ACCGTCCGCG CGGCATCGTG
GCGGCGGGCG CCGAAGAGCC GAACGCGCTG GTCCAGCTCG GCACCGGCGG CCGGTCGGAG
CCGAACCAGC GCACCACCAC GACCGAGCTT TTCGCGGGGC TGACGGCGGC CAGCCAGAAC
CACTGGCCGA GCCTCGAATT CGACGTGGGC GCGGTCAATG CGGCAGCGTC GCGCTTCCTG
CCCGCAGGCT TCTACTACAA GACCTTCCTG CAGCCGCGCG CCGCGTGGAA ACACCTGTTC
GAGCCGGTCA TCCGCCGCTC GGCGGGCCTC GGACGGCCGC CGGAAGAACC CGATGCGGAC
CGCTACGAGC AGGCCTATGC CTTCTGCGAC CTCCTTGTGG TGGGCGGAGG CATCGCGGGG
CTTCAGGCAG CGGTCAGCGC CTCGGCCTCG GGCCGGAAGG TGATGCTCCT CGAGCAGACG
CCGCACTGGG GCGGACGCGC CCCTGTCGAT GACGTGCTGA TCGACGGGCG CCCGGCGGCG
GACTGGGTGG CGGACACGGT GGCGGCACTC GAGGCCGCGC CCAATGTCAC GCTCCGCACC
CGCTGCATGG CGGCCGGCGT CTACGACCAC GGCTATGTGC TGGCCGAGGA GCGCGTGGCC
GATCATACGC CGGGCGACGG ACGGCCGAAG AAGCGCCTCT GGCGCATCCG CGCGGGCAAG
GTCCTCACGG CGACGGGAGC CATCGAGCGT CCGCTGCCCT TCGCCGGCAA CGATATTCCG
GGCGTCATGC TCGCCTCGGC GGTGCGCGAC TATCTGGTGA ACTGGGCCGT CTCTCCCGGC
GACCGGGTGG TGATCGTCAC GAACAACGAC GACGCCTACC GCACCGGCAT CGCCGTTCAC
CGCGCGGGGC TGACCGTGCC GGCCGTGCTC GATGCGCGGG CCGAGGCCCA CGGCGCGCTG
CCCGAAGAGG TGCGCAGCCT CGGCATCCCG GTCCTCACCA ATCGCGCGGT GGCCAAGGTC
AAGGGCGGCA AGCGCGTGAC CGGCGTCACC GTCTGCGCGC AGGCGGGCGA GGGGGCGGTG
CTCGACGATT TCGACTGCGA TGCGGTCGCC ATGTCGGGCG GCTGGTCGCC GGTCGTCCAT
CTCTGGAGCC ACTGCGGCGG CAAGCTGATC TGGGACGAGG CGCAGGCGGC CTTCCGGCCC
GACCCGGCCC GCCCGCCGAT CACCCATGAC GGCTCGCCGA TGGTGGCGGT TGCGGGCTCG
GCCAATGGCG AGCTGCTCTC GGCCGATGTG CTGGCCGATG CCCTCCGCGC CGCGGGCGGC
GAGGGCGAGG CGCCCCGGGC GCAGAGCCCC GAGGAGGCGC CGACCGAGCC GGTCTGGATC
ATGCCGCAGG GCGCCCCGCC GGCGCTCCGC TCGAAGATGT GGCTCGACTA TCAGAACGAC
GTGAAAGTGT CGGACGTGCA GCTCGCCGCC CGCGAGGGCT ACGAGTCGGT CGAGCACACC
AAGCGCTATA CGACGCTCGG CATGGCCACC GATCAGGGCA AGCTCAGCAA CATCAACGGG
CTGGCCGTGC TCGCAGGCTC GCTCAATGCG CCGATCCCCG CGGTGGGCAC CACCACCTTC
CGCCCGCCCT ACACGCCCGT CACCCTCGGC GCGCTGGTGG GCGAGGCGCG GGGCGAGATC
TTCCAGCCGC TGCGCCGTAC GCCGATGCAC GACTGGCACG AGGCCCATGG CGCCTACTGG
GAGCCGGTGG GCCTCTGGCG CCGGCCCTAC TGCTACAGCC GTCCGGGCGA GAGCCATGGC
GATGCGGTGG CCCGCGAGGT CACCAACGCG CGCACGAAGC TCGGTCTCCT CGACGCCTCG
ACGCTGGGCA AGATCCTCGT GAAGGGGCCC GATGCGGGCC GCTTCCTCGA CATGCTCTAC
ACCAACGTCA TGTCGAGCCT GCCGGTCGGG CGCTGCCGCT ACGGCCTCAT GTGCAACGAG
AACGGCTTCC TGATGGACGA TGGCGTGGTG GTCCGTCTCT CCGAGGACAG CTGGCTCTGC
CACACGACCT CGGGCGGGGC GGACCGGATC CATGCCCATA TGGAGGACTG GCTCCAGTGC
GAATGGTGGG ACTGGCAGGT CTATACCGCC AATCTCACCG AGCAGTTCGC GCAGGTGGCC
ATCGTCGGCC CCAACGCGCG CCTGCTGCTG GAAAAGCTCG GCGGCATGGA TGTCTCGAAG
GAGGCGCTGC CCTTCATGCA CTGGGCGGAA GGCACCATCG CGGGCATTCC CGCGCGGGTG
TTCCGCATCA GCTTCTCGGG CGAGCTCTCC TACGAGGTGG CGGTGCCCGC GGGGCAGGGG
CTGGCCTTCT GGCAGGCCTG CCTCGAGGCG GGCGCCGAAT TCGGCCTCAT GCCCTACGGC
ACCGAGGCGC TGCATGTGAT GCGGGCCGAG AAGGGCTTCA TCATGATCGG CGACGAGACC
GACGGCACGG TGGTGCCGCA GGACCTGAAC CTCGGCTGGG CGATCTCGAA GAAGAAGGCG
GATTTCATCG GCAAGCGCGG CATGGAGCGA ACCTTCCTGT CGAGCCCCGA CCGCTGGAAG
CTCGTGGGTC TCGAGACGCT CGACGGCTCG GTCCTGCCCG ACGGCGCCAT CGCGCCCGCG
GCGGGCTCGA ATGGGAACGG GCAGCGCAAC ACTCAGGGCC GCGTGACCTC GACCTACTGG
TCGCCCACGC TGAAGAAGGG CATCGCCATG GGCCTCGTCC ATCGCGGGAC GGACCGGATG
GGCGAGGTGA TCGAGTTCCC GAAGATCTGG GGTGGCGTGG TGCAGGCGCG GATCGTCGAT
CCGGTGTTCT ACGACAAGGC GGGAGAGAAG CAGGATGTCT GA
 
Protein sequence
MSTRLARGGR LIDRSRAIDF TFNGKRMRGF AGDTLAAALL ANDQMLVGRS FKYHRPRGIV 
AAGAEEPNAL VQLGTGGRSE PNQRTTTTEL FAGLTAASQN HWPSLEFDVG AVNAAASRFL
PAGFYYKTFL QPRAAWKHLF EPVIRRSAGL GRPPEEPDAD RYEQAYAFCD LLVVGGGIAG
LQAAVSASAS GRKVMLLEQT PHWGGRAPVD DVLIDGRPAA DWVADTVAAL EAAPNVTLRT
RCMAAGVYDH GYVLAEERVA DHTPGDGRPK KRLWRIRAGK VLTATGAIER PLPFAGNDIP
GVMLASAVRD YLVNWAVSPG DRVVIVTNND DAYRTGIAVH RAGLTVPAVL DARAEAHGAL
PEEVRSLGIP VLTNRAVAKV KGGKRVTGVT VCAQAGEGAV LDDFDCDAVA MSGGWSPVVH
LWSHCGGKLI WDEAQAAFRP DPARPPITHD GSPMVAVAGS ANGELLSADV LADALRAAGG
EGEAPRAQSP EEAPTEPVWI MPQGAPPALR SKMWLDYQND VKVSDVQLAA REGYESVEHT
KRYTTLGMAT DQGKLSNING LAVLAGSLNA PIPAVGTTTF RPPYTPVTLG ALVGEARGEI
FQPLRRTPMH DWHEAHGAYW EPVGLWRRPY CYSRPGESHG DAVAREVTNA RTKLGLLDAS
TLGKILVKGP DAGRFLDMLY TNVMSSLPVG RCRYGLMCNE NGFLMDDGVV VRLSEDSWLC
HTTSGGADRI HAHMEDWLQC EWWDWQVYTA NLTEQFAQVA IVGPNARLLL EKLGGMDVSK
EALPFMHWAE GTIAGIPARV FRISFSGELS YEVAVPAGQG LAFWQACLEA GAEFGLMPYG
TEALHVMRAE KGFIMIGDET DGTVVPQDLN LGWAISKKKA DFIGKRGMER TFLSSPDRWK
LVGLETLDGS VLPDGAIAPA AGSNGNGQRN TQGRVTSTYW SPTLKKGIAM GLVHRGTDRM
GEVIEFPKIW GGVVQARIVD PVFYDKAGEK QDV