Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pden_0515 |
Symbol | |
ID | 4578702 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Paracoccus denitrificans PD1222 |
Kingdom | Bacteria |
Replicon accession | NC_008686 |
Strand | + |
Start bp | 484673 |
End bp | 487606 |
Gene Length | 2934 bp |
Protein Length | 977 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639767832 |
Product | sarcosine oxidase alpha subunit family protein |
Protein accession | YP_914323 |
Protein GI | 119383267 |
COG category | [C] Energy production and conversion [E] Amino acid transport and metabolism |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGA CTGGACCCTT TCGCCTGCCG CGCGGCGGCC GCCTGATCGA CCGCGCCTAT CAGTTGCCCT TTCGCTTCGA CGGCCGCCAG ATGCGCGGGG TCGCGGGCGA TACGCTTGCC TCGGCCTTGC TGGCGAACGG CCAGCTGATG ATGGGCCGCA GCTTCAAGTA TCACCGCCCG CGCGGCCCCA TCGCCTCGGG TGCCGAAGAG CCGAACGCGC TGCTCGGCCT GGGTCAGGGC GGCCGGTTCG AGCCGAACCA GCGCGCCACC ACCACGCCGC TGGTCGGCGG CATGGTCACC GCCAGCCAGA ACGCCTGGCC CAGCCTGAAC GCCGATATCG GCGCGATCAA CAACTGGCTC TATCGCTTTT TCCCGGCGGG GTTCTACTAT AAGACCTTCA TGCATCCGCG CCCGTTCTGG AAGCATGTGT TCGAGCCGAT CATCCGCCGC TCGGCCGGCC TGGGCAAGGC GCCGACCGAG GCCGACCCGG ACAAATACGA ACAGGCCTAT GCCCATGTGG ATGTCGTGGT GGTCGGCGGC GGCATCGCCG GGCTGACCGC GGCGCGCGAC GCGGCCCGCG ACGGCAAGTC GGTCTGGCTG GTCGAGCAGA CCCCGCATTT CGGCGGCCGC ACCCCCACCG ACCATGCCGA CGGCCAGGCC CGCGTCGACG CGTTGCTGGC GGAACTGCAG GGCATGGCCA ACGTCACCCT GCGCCGCTCG ACCCAGGCGA CGGGGCTTTA CGACCATGGC TATCTGCTGG TGCGGGAATC GCTGGCCGAT CACGACCCGA ACGCCGGCAT TCCGCGCCAG CGCCTGTGGC GCATCCGCGC CGGCCATGTG GTCGTCGCCA CCGGCGCGCT GGAACGGCCG CTGAGCTTTG CCGGCAACGA CGTGCCGGGC GTGATCCTGG CCTCGGCCGT GCGCGACTAC ATCAACGATT ACGGCGTCGC GCCGGGCCGC AGGATCGTCG TGGTGACCAA TAACGACGAC GCCTATCGCA CCGCGCTGGT GGCGCTGGAT GCGGGGCTGG ACGTGGCCGC GGTGATCGAT GCCCGCAGCA CGGCCGAGGG TGCATTGCCC GCGGCGGTGC GCGCGCGGGG CGTGCGCGTG CTGACCGGCA GCGCCATCGC CGGCGTCAAG GGCGGCCACG GCGTCGAGGC GGTCAAGCTT TGCGCGCATT CCGGCTCGGG TCAGGTCACC GAGACCATCG ATGCCGATTG CGTCGCCATG TCCGGCGGCT GGTCCCCGGT GGTGCATCTG TGGAGCCATT GCGGCGGCAA GCTGAACTGG TCCGACGCGC AGTCGATGTT CGCCCCGGAC CCGAACCGCC CGCCGACCGG CGCGGACGGC AAGGCGATGG CAAGCTGCGT GGGTGCGGCG GCGGGCGACC TGCTGGTCCC GGAACTGACC GGCGCGCAGG CGGAATCGGC CACCATGCCG GTCTGGGTCA TGCCGGCCCA CGCCCCGCGC AAGATGAAAT TCAAGATGTG GCTCGACTTC CAGAACGACG TGAAGGTCTC GGATGTCGAA CTGGCCGCGC AGGAAGGCTA TCACAGCGTC GAGCATACCA AGCGCTACAC CACGCTGGGC ATGGCGACCG ACCAGGGAAA GGTAAGCAAT ATCAATGGAC TCGCCGTTCT GTCCAATGCG CTGAACCAGC CGATCCCGGC CACGGGCACG ACGACCTTCC GCCCGCCCTA TACGCCGCTG ACCCTGGGCA CCATCGCCGG CGAGGCGCGG GGCGAGATCT TCCAGCCCTT GCGCAAGACG CCCATGCATG GCTGGCACGA AGCGCAGGGC GCGTTCTTCG AGCCGGTCGG CCATTGGCGC CGCCCCTATT GCTATCCGAA GGGCAGCGAA AGCCATGGCG ACGCGGTGGC GCGCGAGATC CGGGCGGTGC GCGCCTCGGT CGGGACGCTC GACGCCTCGA CGCTGGGCAA GATCATCGTC AAAGGACCCG ATGCGGGCAG GTACCTCGAC ATGATGTATA CCGGCATGAT GTCCAGCCTG CCCATCGGCA AGTGCCGCTA TGGCCTGATC TGCAGCGAGA ACGGCTTTCT CATCGACGAC GGCGTGGCCG CGCGGCTGTC CGAGGACACC TGGCTTGTCC ACACCACCAC CGGCGGCGCC GAGCGCATGC ACGGCCATTT CGAGGACTGG CTGCAATGCG AATGGTGGGA CTGGAAGGTC TGGACCGCCA ACGTCACCGA GCAATGGGCG CAGGTCGCCG TGGTCGGCCC CAAGGCCCGC GTGCTGCTGG AACGGCTGGG CGGCAAGATC GACCTTTCGC CCGAGGCGCT GCCCTTCATG GGCTGGATCG AGGGCGAGAT CGCCGGCATC CCCGCCCGCG TCTATCGCAT CAGCTTTTCG GGCGAGCTCA GCTTCGAGGT GGCGGTGCCC GCGAACCGGG GCCTGGAGCT TTGGGAAAAG CTGCATGAGG CCGGCCGCGA CCTGAACGTC ACCCCCTACG GCACCGAGGC CATGCACGTC ATGCGCGCCG AAAAGGGGTT CATCATGATC GGCGACGAAA CCGACGGCAC GGTGATCCCG CAGGATCTGG GCATGTCCTG GGCGATCAGC AAGAAGAAGG CCGACTATAT CGGCAAGCGC GCGCAAGAGC GCAGCTTCAT GACCGATCCC GGCCGCTGGA AGCTGGTGGG CCTGGAAAGC CTGGATGGAC GGGTGCTGCC CGACGGCGTC TATGCGGTCG ATCAGGGCGC GAATGCCAAC GGCCAGCGCA AGGTGCAGGG CCGCGTGACC TCGACCTACA TGTCGCCGAC CCTGGACCGG CCCATCGCCA TGGGGCTGGT GCGGCAGGGC CCCGAGCGCA TGGGCGAGGT GCTGGAGTTC CCCGTCGCGG GTCAGGAAAG CTACAAGGCG CGGATCGTCG ATCCGGTCTT CTACGACAAG GAAGGGAGCC GGGCGAATGG CTGA
|
Protein sequence | MTETGPFRLP RGGRLIDRAY QLPFRFDGRQ MRGVAGDTLA SALLANGQLM MGRSFKYHRP RGPIASGAEE PNALLGLGQG GRFEPNQRAT TTPLVGGMVT ASQNAWPSLN ADIGAINNWL YRFFPAGFYY KTFMHPRPFW KHVFEPIIRR SAGLGKAPTE ADPDKYEQAY AHVDVVVVGG GIAGLTAARD AARDGKSVWL VEQTPHFGGR TPTDHADGQA RVDALLAELQ GMANVTLRRS TQATGLYDHG YLLVRESLAD HDPNAGIPRQ RLWRIRAGHV VVATGALERP LSFAGNDVPG VILASAVRDY INDYGVAPGR RIVVVTNNDD AYRTALVALD AGLDVAAVID ARSTAEGALP AAVRARGVRV LTGSAIAGVK GGHGVEAVKL CAHSGSGQVT ETIDADCVAM SGGWSPVVHL WSHCGGKLNW SDAQSMFAPD PNRPPTGADG KAMASCVGAA AGDLLVPELT GAQAESATMP VWVMPAHAPR KMKFKMWLDF QNDVKVSDVE LAAQEGYHSV EHTKRYTTLG MATDQGKVSN INGLAVLSNA LNQPIPATGT TTFRPPYTPL TLGTIAGEAR GEIFQPLRKT PMHGWHEAQG AFFEPVGHWR RPYCYPKGSE SHGDAVAREI RAVRASVGTL DASTLGKIIV KGPDAGRYLD MMYTGMMSSL PIGKCRYGLI CSENGFLIDD GVAARLSEDT WLVHTTTGGA ERMHGHFEDW LQCEWWDWKV WTANVTEQWA QVAVVGPKAR VLLERLGGKI DLSPEALPFM GWIEGEIAGI PARVYRISFS GELSFEVAVP ANRGLELWEK LHEAGRDLNV TPYGTEAMHV MRAEKGFIMI GDETDGTVIP QDLGMSWAIS KKKADYIGKR AQERSFMTDP GRWKLVGLES LDGRVLPDGV YAVDQGANAN GQRKVQGRVT STYMSPTLDR PIAMGLVRQG PERMGEVLEF PVAGQESYKA RIVDPVFYDK EGSRANG
|
| |