Gene Rsph17029_1347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1347 
Symbol 
ID4896761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1395639 
End bp1398620 
Gene Length2982 bp 
Protein Length993 aa 
Translation table11 
GC content71% 
IMG OID640111934 
Productsarcosine oxidase alpha subunit family protein 
Protein accessionYP_001043229 
Protein GI126462115 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0394007 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGC GTCTCGCACG GGGCGGACGG CTGATCGACC GCAGCCGCGC CATCGACTTC 
ACCTTCAACG GCAGGCGGAT GCGCGGTTTC GCGGGCGACA CGCTCGCCGC GGCGCTTCTC
GCCAACGACC AGATGCTGGT CGGGCGCAGC TTCAAGTATC ACCGTCCGCG CGGCATCGTG
GCGGCGGGCG CCGAAGAGCC GAACGCGCTC GTCCAGCTCG GCACCGGCGG CCGGTCGGAG
CCGAACCAGC GCACCACCAC GACCGAGCTC TTTGCGGGGC TGACGGCGGC CAGCCAGAAC
CACTGGCCGA GCCTCGAATT CGACGTGGGC GCGGTCAATG CGGCGGCGTC GCGCTTCCTG
CCCGCAGGCT TCTACTACAA GACCTTCCTG CAGCCGCGCG CCGCGTGGAA ACACCTGTTC
GAGCCGGTCA TCCGCCGCTC GGCGGGCCTC GGACGGCCGC CGGAAGCGCC CGATGCGGAC
CGCTACGAGC AGGCCTATGC CTTCTGCGAC CTTCTCGTGG TGGGCGGCGG CATCGCGGGG
CTTCAGGCGG CGCTCAGCGC CTCGGCCTCG GGCCGGAAGG TGATGCTGCT CGAGCAGACG
CCGCACTGGG GCGGACGGGC CCCGGTGGAT GACGTGCTGA TCGACGGGCG CCCGGCGGCG
GACTGGGTGG CGGACACGGT GGCAGCGCTC GAGGCCGCGC CCAATGTCAC GCTCCGCACC
CGCTGCATGG CGGCCGGCGT CTACGACCAC GGCTATGTGC TGGCCGAGGA GCGCGTGGCC
GACCATACGC CGGGCGACGG ACGGCCGAAG AAGCGGCTCT GGCGCATCCG CGCGGGCAAG
GTCATCACGG CGACCGGCGC CATCGAGCGG CCGCTGCCCT TCGCGGGCAA CGACATTCCG
GGCGTCATGC TCGCCTCGGC GGTGCGCGAC TATCTGGTGA ACTGGGCCGT CTCGCCCGGC
GACCGGGTGG TGATCGTCAC GAACAACGAC GACGCCTACC GCACCGCCAT CGCCGTGCAC
CGTGCGGGGC TGACCGTGCC GGCCGTGCTT GATGCGCGGG CCGAGGCCCA CGGCGCGCTG
CCCGAAGAGG TGCGCAGCCT CGGCATCCCG GTCCTCACCA ACCGCGCGGT GGCCAAGGTC
AAGGGCAGCA AGCGGGTGAC GGGCGTCACC GTCTGCGCCC AAGCGGGCGA GGGGGCGGTG
CTCGACGATT TCGACTGCGA TGCGGTCGCC ATGTCGGGCG GCTGGTCGCC GGTCGTCCAT
CTCTGGAGCC ACTGCGGCGG CAAGCTGATC TGGGACGAGG CGCAGGCGGC CTTCCGGCCC
GACCCGGCCC GCCCGCCGAT CACCCATGAC GGCTCGCCGA TGGTGGCGGT TGCGGGCTCG
GCCAATGGCG AGATGCTCTC GGCCGACGTG CTGGCCGATG CCCTCCGCGC CGTGGGCGGC
GAGGGCGAGG CGCCCCGGGC GCAGAGCCCC GAGGAGGCGC CGACCGAGCC GGTCTGGATC
ATGCCGCAGG GCGCCCCGCC GGCGCTGCGC TCGAAGATGT GGCTCGACTA TCAGAACGAC
GTGAAAGTGT CGGACGTGCA GCTCGCCGCC CGCGAGGGCT ACGAGTCGGT CGAGCATACC
AAGCGCTACA CGACGCTCGG CATGGCCACC GATCAGGGCA AGCTCAGCAA CATCAACGGG
CTGGCCGTGC TCGCAGGCTC GCTCAATGCG CCGATCCCCG CGGTGGGCAC CACCACCTTC
CGCCCGCCCT ACACGCCCGT CACCCTCGGC GCGCTGGTGG GCGAGGCGCG GGGCGAGATC
TTCCAGCCGC TGCGCCGCAC GCCCATGCAC GACTGGCACG AGGCCAACGG CGCCTATTGG
GAGCCGGTGG GCCTCTGGCG CCGGCCCTAC TGCTACAGCC GTCCGGGCGA GAGCCATGGC
GATGCGGTGG CCCGCGAGGT CACCAACGCG CGCACGAAGC TCGGGCTGCT CGACGCCTCG
ACGCTGGGCA AGATCCTCGT GAAGGGGCCC GATGCGGGCC GCTTCCTCGA CATGCTCTAC
ACCAACGTCA TGTCGAGCCT GCCGGTCGGG CGCTGCCGCT ATGGCCTCAT GTGCAACGAG
AACGGCTTCC TGATGGACGA TGGCGTGGTG GTCCGTCTCT CCGAGGACAG CTGGCTCTGC
CACACGACCT CGGGCGGGGC GGACCGGATC CATGCCCATA TGGAGGACTG GCTCCAGTGC
GAATGGTGGG ACTGGCAGGT CTATACCGCC AATCTCACCG AGCAGTTCGC GCAGGTGGCC
ATCGTCGGCC CCAACGCGCG CCTGCTGCTG GAAAAGCTCG GCGGCATGGA TGTCTCGAAG
GAGGCGCTGC CCTTCATGCA CTGGGCGGAA GGCACCCTCG CGGGCATTCC CGCGCGGGTG
TTCCGCATCA GCTTCTCAGG CGAGCTCTCC TACGAGGTGG CGGTGCCCGC GGGGCAGGGG
CTGGCCTTCT GGCAGGCCTG CCTCGAGGCG GGCGCCGAAT TCGGCCTCAT GCCCTACGGC
ACCGAGGCGC TGCATGTGAT GCGGGCCGAG AAGGGCTTCA TCATGATCGG CGACGAGACC
GACGGTACGG TGGTGCCGCA GGACCTGAAC CTCGGCTGGG CGATCTCGAA GAAGAAGGCG
GACTTCATCG GCAAGCGCGG CATGGAGCGG ACCTTCCTGT CGAGCCCCGA CCGCTGGAAG
CTCGTGGGTC TCGAGACGCT CGACGGCTCG GTCCTGCCCG ATGGCGCCAT CGCGCCCGCG
GCGGGCTCGA ATGGGAACGG GCAGCGCAAC ACCCAGGGCC GCGTGACCTC GACCTACTGG
TCGCCCACGC TGAAGAAGGG GATCGCCATG GGCCTCGTCC ATCGCGGGAC GGACCGGATG
GGCGAGGTGA TCGAGTTCCC GAAGATCTGG GGCGGCGTGG TGCAGGCGCG GATCGTCGAT
CCGGTGTTCT ACGACAAGGC GGGAGAGAAG CAGGATGTCT GA
 
Protein sequence
MSTRLARGGR LIDRSRAIDF TFNGRRMRGF AGDTLAAALL ANDQMLVGRS FKYHRPRGIV 
AAGAEEPNAL VQLGTGGRSE PNQRTTTTEL FAGLTAASQN HWPSLEFDVG AVNAAASRFL
PAGFYYKTFL QPRAAWKHLF EPVIRRSAGL GRPPEAPDAD RYEQAYAFCD LLVVGGGIAG
LQAALSASAS GRKVMLLEQT PHWGGRAPVD DVLIDGRPAA DWVADTVAAL EAAPNVTLRT
RCMAAGVYDH GYVLAEERVA DHTPGDGRPK KRLWRIRAGK VITATGAIER PLPFAGNDIP
GVMLASAVRD YLVNWAVSPG DRVVIVTNND DAYRTAIAVH RAGLTVPAVL DARAEAHGAL
PEEVRSLGIP VLTNRAVAKV KGSKRVTGVT VCAQAGEGAV LDDFDCDAVA MSGGWSPVVH
LWSHCGGKLI WDEAQAAFRP DPARPPITHD GSPMVAVAGS ANGEMLSADV LADALRAVGG
EGEAPRAQSP EEAPTEPVWI MPQGAPPALR SKMWLDYQND VKVSDVQLAA REGYESVEHT
KRYTTLGMAT DQGKLSNING LAVLAGSLNA PIPAVGTTTF RPPYTPVTLG ALVGEARGEI
FQPLRRTPMH DWHEANGAYW EPVGLWRRPY CYSRPGESHG DAVAREVTNA RTKLGLLDAS
TLGKILVKGP DAGRFLDMLY TNVMSSLPVG RCRYGLMCNE NGFLMDDGVV VRLSEDSWLC
HTTSGGADRI HAHMEDWLQC EWWDWQVYTA NLTEQFAQVA IVGPNARLLL EKLGGMDVSK
EALPFMHWAE GTLAGIPARV FRISFSGELS YEVAVPAGQG LAFWQACLEA GAEFGLMPYG
TEALHVMRAE KGFIMIGDET DGTVVPQDLN LGWAISKKKA DFIGKRGMER TFLSSPDRWK
LVGLETLDGS VLPDGAIAPA AGSNGNGQRN TQGRVTSTYW SPTLKKGIAM GLVHRGTDRM
GEVIEFPKIW GGVVQARIVD PVFYDKAGEK QDV