Gene Rsph17025_1874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1874 
Symbol 
ID5084299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1923262 
End bp1926243 
Gene Length2982 bp 
Protein Length993 aa 
Translation table11 
GC content71% 
IMG OID640483435 
Productsarcosine oxidase alpha subunit family protein 
Protein accessionYP_001168070 
Protein GI146277911 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase) 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGC GTCTGGCCCG GGGCGGGCGG CTGATCGACC GCAATCACCC TCTGGGTTTC 
ACCTTCAACG GCAAGCGGAT GCGCGGCTTT GCCGGGGATA CGCTTGCGGC GGCCCTGCTG
GCCAACGACC AGATGCTGGT CGGGCGCAGC TTCAAGTATC ACCGCCCGCG CGGCATCGTG
GCGGCGGGCG CCGAAGAGCC GAACGCGCTT GTCCAGCTTG GCACCGGCGG CCGGTCGGAA
CCCAACCAGC GCACCACCAC GACCGAGCTG TTCGCGGGCC TCTCGGCCGC GAGCCAGAAC
CACTGGCCGA GCCTCGAGTT CGACGTGGGC GCGGTGAACG CGGCGGCGGG GCGCTTCCTG
CCCGCGGGCT TCTACTACAA GACCTTCCTT CAGCCCCGGC TGGCGTGGAA GCACCTGTTC
GAGCCGGTGA TCCGCCGCTC GGCGGGCCTC GGGCGGCCGC CGGAAGAGCC CGATGCCGAC
CGTTACGAGC AGGCCTACGC CTTCTGCGAC CTCCTCGTGG TGGGCGGCGG CATCGCGGGG
TTGCAGGCGG CGCTGAGTGC CTCGGCCTCG GGGCAAAAGG TGATGCTGCT CGAGCAGACG
CCGCACTGGG GCGGGCGCGC CCCGGTCGAT GACGTGCTGA TCGAGGGCCG GCCGGTGGCG
GACTGGGTGG CGGCCACGGT GGCTTCGCTC GAGGCCGCGC CGAACGTCAC GCTGCGCACC
CGCTGCATGG CGGCCGGGGT GCACGACCAC GGCTATGTGC TGGCCGAAGA GCGGGTGGCC
GATCATACGC CGGGCGACGG GCGGCCGAAG AAGCGGCTCT GGCGCATCCG CGCGGGCAAG
GTGGTGACGG CGACGGGCGC CATCGAACGG CCGCTGCCCT TCGCGGGCAA CGACATTCCG
GGCGTGATGC TCGCCTCGGC GGTGCGCGAC TATCTGGTGA ACTGGGCCGT CTCGCCCGGC
GACCGGGTGG TGATCGTGAC GAACAACGAC GACGCCTACC GCACCGCCAT CGCCGTCCAC
CGCGCGGGGC TGACGGTGCC GGCCGTGCTC GATGCGCGGG CCGAGGCCGA CGGCGCGCTG
CCCGAAGAGG TGCGCAGCCT CGGCATCCCG GTCCTGACCA ACCGCGCGGT GGCGAAGGTC
AAGGGCGGCA AGCGCGTGAC CGGCGTCGCC GTCTGCGCCC AGGCGGGCGA GGGGGCGGTG
CTCGACGAGT TTGCCTGCGA TGCGGTCGCC ATGTCGGGCG GCTGGTCGCC GGTCGTCCAT
CTCTGGAGCC ACTGCGGCGG CAAGCTGATC TGGGACGAGG CGCAGGCGGC CTTCCGTCCC
GATCCTGCCC GCCCGCCGAT CACCCACGAC GGCTCGGCGA TGGTGGCGGC CGCGGGTTCG
GCCAATGGTG AGCTGCTCTC GGCCGATGTG CTGGCCGATG CCATCCGCGC CGTGGGCGGC
GAGGGCCCGG CCCCCCGGGC GCAGAGCCCC GAGGAGGCTC CGACCGAGCC GGTCTGGATC
ATGCCGCAGG GGGCCACGCC CGCGCTGCGC TCGAAGATGT GGCTCGACTA CCAGAACGAC
GTGAAGGTGT CGGACGTGCA GCTTGCCGCC CGCGAGGGCT ACGAGTCGGT CGAGCATACC
AAGCGCTACA CGACGCTCGG CATGGCCACC GATCAGGGCA AGCTCAGCAA CATCAACGGG
CTTGCGGTGC TGGCGGGATC GCTCAACGCG CCGATCCCCG CGGTTGGCAC CACCACCTTC
CGCCCGCCCT ACACGCCCGT CACCTTCGGC GCGCTGGTGG GCGAGGCTCG GGGCGAGATC
TTCCAGCCGC TGCGCCGCAC GCCGATGCAC GACTGGCACG AGGCCCATGG CGCCTACTGG
GAGCCGGTGG GCCTCTGGCG TCGGCCCTAC TGCTACAGCC GTCCCGGCGA GAGCCACGGC
GACGCGGTGG CCCGCGAGGT CACCAACGCG CGCACCAAGC TCGGGCTGCT CGACGCCTCG
ACGCTGGGCA AGATCCTCGT GAAGGGGCCC GATGCGGGCC GCTTCCTCGA CATGCTCTAC
ACCAACGTCA TGTCGAGCCT GCCCGTGGGC CGCTGCCGCT ACGGCCTCAT GTGCAACGAG
AACGGCTTCC TGATGGATGA CGGGGTCGTG GCGCGGATCT CCGAGGACAG CTGGCTCTGC
CACACGACCT CGGGCGGGGC CGACCGGATC CACGCCCACA TGGAGGATTG GCTCCAGTGC
GAATGGTGGG ACTGGCAGGT CCATACCGCC AACCTGACCG AGCAGTTCGC GCAGGTGGCC
ATCGTCGGCC CCAACGCGCG CAGGCTGCTG GAAAAGCTCG GCGGGATGGA CGTCTCGAAG
GAGGCGCTGC CCTTCATGCA CTGGGCGGAA GGCACGATCG CGGGCATCCC CGCGCGCGTG
TTCCGCATCA GCTTCTCGGG TGAGCTGTCC TACGAGGTGG CGGTTCCTGC GGGGCAGGGG
CTGGCCTTCT GGCAGGCCTG CCACGAGGCG GGGGCCGAGT TCGGCGCCAT GCCCTACGGC
ACCGAGGCGC TGCATGTGAT GCGGGCCGAG AAGGGCTTCA TCATGATCGG CGACGAGACC
GACGGGACGG TGATCCCGCA GGACCTGAAC CTCGGCTGGG CCATCTCGAA GAAGAAGGCC
GACTTCATCG GCAAGCGCGG GATGGAGCGG GCCTTCCTCG CCAGCCCCGA CCGCTGGAAG
CTCGTGGGGC TCGAGACGCT CGACGGCTCG GTGCTGCCGG ATGGCGCCAT CGCGCCCGCG
CCCGGCTCGA ACGCGAATGG CCAGCGCAAC ACGCAAGGCC GCGTGACCTC GACCTACTGG
TCGCCGACGC TGAAGAAGGG GATCGCCATG GGCCTCGTCC ATCGTGGCCC CGAGCGGATG
GGCGAGGTGA TCGAGTTCCC GAAGATCTGG GGCGGCGTGG TGCAGGCGCG GATCGTGGAT
CCGGTGTTCT ACGACAAGGC GGGAGAGAAG CAGGATGTCT GA
 
Protein sequence
MSTRLARGGR LIDRNHPLGF TFNGKRMRGF AGDTLAAALL ANDQMLVGRS FKYHRPRGIV 
AAGAEEPNAL VQLGTGGRSE PNQRTTTTEL FAGLSAASQN HWPSLEFDVG AVNAAAGRFL
PAGFYYKTFL QPRLAWKHLF EPVIRRSAGL GRPPEEPDAD RYEQAYAFCD LLVVGGGIAG
LQAALSASAS GQKVMLLEQT PHWGGRAPVD DVLIEGRPVA DWVAATVASL EAAPNVTLRT
RCMAAGVHDH GYVLAEERVA DHTPGDGRPK KRLWRIRAGK VVTATGAIER PLPFAGNDIP
GVMLASAVRD YLVNWAVSPG DRVVIVTNND DAYRTAIAVH RAGLTVPAVL DARAEADGAL
PEEVRSLGIP VLTNRAVAKV KGGKRVTGVA VCAQAGEGAV LDEFACDAVA MSGGWSPVVH
LWSHCGGKLI WDEAQAAFRP DPARPPITHD GSAMVAAAGS ANGELLSADV LADAIRAVGG
EGPAPRAQSP EEAPTEPVWI MPQGATPALR SKMWLDYQND VKVSDVQLAA REGYESVEHT
KRYTTLGMAT DQGKLSNING LAVLAGSLNA PIPAVGTTTF RPPYTPVTFG ALVGEARGEI
FQPLRRTPMH DWHEAHGAYW EPVGLWRRPY CYSRPGESHG DAVAREVTNA RTKLGLLDAS
TLGKILVKGP DAGRFLDMLY TNVMSSLPVG RCRYGLMCNE NGFLMDDGVV ARISEDSWLC
HTTSGGADRI HAHMEDWLQC EWWDWQVHTA NLTEQFAQVA IVGPNARRLL EKLGGMDVSK
EALPFMHWAE GTIAGIPARV FRISFSGELS YEVAVPAGQG LAFWQACHEA GAEFGAMPYG
TEALHVMRAE KGFIMIGDET DGTVIPQDLN LGWAISKKKA DFIGKRGMER AFLASPDRWK
LVGLETLDGS VLPDGAIAPA PGSNANGQRN TQGRVTSTYW SPTLKKGIAM GLVHRGPERM
GEVIEFPKIW GGVVQARIVD PVFYDKAGEK QDV