Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pden_4907 |
Symbol | |
ID | 4583468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Paracoccus denitrificans PD1222 |
Kingdom | Bacteria |
Replicon accession | NC_008688 |
Strand | - |
Start bp | 410161 |
End bp | 413091 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639772210 |
Product | sarcosine oxidase alpha subunit family protein |
Protein accession | YP_918663 |
Protein GI | 119387629 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.232501 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGGC TGACCGGAGG GCTGATCGAC CGCGCCCGCA ACCTGCGCTT CACCTTCGAC GGCCGGACCT ATGCCGGCCA TCCCGGCGAC ACGCTGGCCT CGGCGCTGTT GGCGAACGGC GTGCGGCTGA TGGGCCGCAG CTTCAAGTAT CACCGGCCGC GCGGCGTCTT TTCGGCCGGG TCCGAGGAGC CGAATGCGCT GGTGGAACTG CGCAGCGGCG CCCGGAAAGA GCCCAACAGC CGTGCCACCG TGGCCGAGCT TTACGATGGG CTTGAGGCCG CCAGCCAGAA CCATGTCGGC CCGCTGGGCT TCGACCTGCT GGCGGTGAAC GACCTGTTCT CGTCCTTCTT CGCGGCGGGT TTCTATTACA AGACCTTCAT GTGGCCGCGC GCCTTCTGGG AAAAGCTTTA CGAGCCGGCG ATCCGCCGGG CGGCGGGCTT GGGCAGCCTG TCGATGCAGC CCGATCCCGA TGCCTATGAC AAGGGCTTTT TGCATTGCGA CCTGCTGGTC ATCGGCGGCG GCGCGGCGGG TCTTTCGGCG GCGCTGACCG CTAGCCGGGC CGGGGCGCGG GTCATCCTGG CCGACGAGGA TTTCCGCCTG GGCGGTCGGC TGCTTGCCGA AAGCCATCTG TTGGACGATG CCCCGGCGAC CGAATGGGTC GCGCAGGCCG AGGCCGAGCT TGCGGCGCTG CCCAATGTCC GCATCCTGTG TCGGACCACG GTGATCGGCG CCTTCGACCA CGGCGTCTAT GGCGCGGTCG AGCGTGTCGC CGATCACCTG CCCGAGCCGG GGCGGCAGGT GCGCCAGACG CTTTGGCGGA TCACCGCCAG CCGCGCCGTG CTGGCCGCCG GTGCCATCGA ACGGCATATC CCGTTCCGGA ACAACGACCG TCCCGGCATC ATGCTGGCCG GCGCCATGCG CGCCTATGCC AACCGCTGGG CGGCCAGCCC GGCCAGGCGG GTGGCGATCT TTACCAATAA CGACGACGGC CACCGCACGG CGCTGGACCT TGCCGCCAAG GGGATCGAGG TCGCGGCGGT GATCGACAGC CGTTCGGACG CGCAGGCGCA GGGCCACTAC CGGCTGGTCC ACGGCATGGT CTGCGATGCC AAGGGCCGCC TGGGCCTGCG CGGGGTCCAG ATCGACAGCG ACGGCCGCAG GGAATGGCTG GATTGCGGCG CGCTCGGCGT CGCGGGCGGC TGGAACCCGA ACGTGCATCT GGCCTCGCAC CATCGCGGCC GCCCGGTCTG GGACGCGGCG CTGCAGGCCT TCGTCCCCGG CGAGGGCGGG CCGTGCGGGC TGATCGCCGC CGGTGCCGCG AAGGGGCAGG GCAGCACCGC TGCCGCGCTG CGCTCGGGCG CCGGGGCGGC GGTCGAGGCG CTGGCCCAGC TTGGCATCGA GGCCCGGTCC GCCGAGCTGC CGGCGGCCGA GGATGCGCCG GCCGGTCTGC GCCCGCTTTG GCACGTGCCG GGCAAGGGTC GCGCCTGGGT CGATTTCCAG AACGACGTGA CCGTCAAGGA CATCAAGCTG GCGCATCAGG AAAACATGCG CCCGGTCGAA CACCTGAAGC GCTGGACCAC GCTGGGCATG GCCACAGACC AGGGCAAGAC CGCGAATGTC ACCGCCCTGG CGGTCATGTC GGAACTGACC GGCAAGCCGA TCCCGGAAAC CGGCACGACG ATCTTTCGCC CGCCTTACAG CGGCGTGTCG TTGTCGGTGC TGGGCGGCGG CGACACGGGG CCGCATTTCC GCCCGCGCCG CCTGACGCCC ACGCATGAAT GGGCGCGTGC GCAGGGCGCC GTCTTTGTCG AGGTCGGACC CTGGGTGCGG GCGCAATACT TCCCCCGCCC GGGCGAGACG CATTGGCGCC AGACCGTGGA CCGCGAGGTC CTGGCCACCC GTGCGGGCGT CGGCATCTGC GACGTGACCA CGCTTGGCAA GGTGGATGTG CAGGGCGCGG ATGCCGGCGA GTTTCTCGAC CGGATCTATG CCAATGCCAT GAAAAGCCTC AAGGTCGGCA TGGTCCGCTA TGGCCTCATG CTGCGCGAGG ACGGGATGGC CTGGGACGAC GGCACCTGCG CCCGGCTGGC CGAGGATCAT TACGTCACCA CCACCACCAC CGCCCAGGCG GGGCCGGTCT ATCGGCAGAT GGAATTCGCC CGCCAATGCC TCTGGCCCGA GCTGGACGTG CAGCTGATCT CGACCACCGA CGCCTGGGCG CAGATCGCCG TGGCCGGTCC CAATGCACGC CGGCTGCTGG AGCGGATCGT GGACGGGTTC GACCTGTCGA ACGAGGCCTT TCCCTTCATG GCCTGCGCTG GCCTGATGGT TTGCGGCGGG CTTCGGGCGC GGCTCTTCCG CATCAGCTTC TCGGGCGAGC TGGCCTATGA GATCGCGGTG CCGGCCCGCT ATGGCCAGGC GCTGGTCGAG CGGCTGATGG AGAAAGGCGC CGATCTGGGC GCCACCCCCT ACGGCACCGA GGCGCTTGGC GTGATGCGCA TCGAAAAGGG CCATGCCGCC GGCAACGAGC TGAACGGGCA GACCACGGCG CAGATGCTGG GACTGGGCCG CATGGTCAGC AGCAAGAAAG ACGGCATCGG CGCCGTGATG TCCCGGCGCG AGGGGCTGGC GGCCGAAACC CGGGTGCTGG TGGGGCTGCA ACCGGTCGAT CCGGCGCAGC CCGTCCTTGC CGGCATGCAT CTTTTCGCCG AGGGGGCCGA GCATCGGACC GAGACCGACC AGGGCTGGAT CACCTCGGCC TGTCATTCGC CGCATGTCGG ATCAAGCATC GGGCTGGGCT TCCTGGCCGA TGGCGGCACC CGCATGGGCG AGACGGTGAT CGCCGCCAAT CCGCTGCAGG GGCAAAAGGT GGCGCTGCGC GTCGTCCCGC CGCATTTCAT CGACCCGGAA GGAGGGCGCC TGCGTGACTG A
|
Protein sequence | MTRLTGGLID RARNLRFTFD GRTYAGHPGD TLASALLANG VRLMGRSFKY HRPRGVFSAG SEEPNALVEL RSGARKEPNS RATVAELYDG LEAASQNHVG PLGFDLLAVN DLFSSFFAAG FYYKTFMWPR AFWEKLYEPA IRRAAGLGSL SMQPDPDAYD KGFLHCDLLV IGGGAAGLSA ALTASRAGAR VILADEDFRL GGRLLAESHL LDDAPATEWV AQAEAELAAL PNVRILCRTT VIGAFDHGVY GAVERVADHL PEPGRQVRQT LWRITASRAV LAAGAIERHI PFRNNDRPGI MLAGAMRAYA NRWAASPARR VAIFTNNDDG HRTALDLAAK GIEVAAVIDS RSDAQAQGHY RLVHGMVCDA KGRLGLRGVQ IDSDGRREWL DCGALGVAGG WNPNVHLASH HRGRPVWDAA LQAFVPGEGG PCGLIAAGAA KGQGSTAAAL RSGAGAAVEA LAQLGIEARS AELPAAEDAP AGLRPLWHVP GKGRAWVDFQ NDVTVKDIKL AHQENMRPVE HLKRWTTLGM ATDQGKTANV TALAVMSELT GKPIPETGTT IFRPPYSGVS LSVLGGGDTG PHFRPRRLTP THEWARAQGA VFVEVGPWVR AQYFPRPGET HWRQTVDREV LATRAGVGIC DVTTLGKVDV QGADAGEFLD RIYANAMKSL KVGMVRYGLM LREDGMAWDD GTCARLAEDH YVTTTTTAQA GPVYRQMEFA RQCLWPELDV QLISTTDAWA QIAVAGPNAR RLLERIVDGF DLSNEAFPFM ACAGLMVCGG LRARLFRISF SGELAYEIAV PARYGQALVE RLMEKGADLG ATPYGTEALG VMRIEKGHAA GNELNGQTTA QMLGLGRMVS SKKDGIGAVM SRREGLAAET RVLVGLQPVD PAQPVLAGMH LFAEGAEHRT ETDQGWITSA CHSPHVGSSI GLGFLADGGT RMGETVIAAN PLQGQKVALR VVPPHFIDPE GGRLRD
|
| |