Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_2709 |
Symbol | |
ID | 4597277 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 2881583 |
End bp | 2884498 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639777315 |
Product | glycine dehydrogenase |
Protein accession | YP_923899 |
Protein GI | 119716934 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain [COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain |
TIGRFAM ID | [TIGR00461] glycine dehydrogenase (decarboxylating) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.142199 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGACC ACCACCCTCT CGCCGACCTC GACGACGCGC TGCCGTTCGT CGACCGCCAC ATCGGGCTGC GCCCCGGCGA CGTCGAGACG ATGCTCGCCC GGCTCGGCTT CGACTCGCTC GACGCGCTGA TGGCCGCGGC GGTGCCCGGC GGCATCCGGG CCGCCGACGA GCTGGACCTG CCGGCCCCGC TGAGCGAGGA GGCCACGGCG CGCGAGCTGC GGGCGATCGC CCAGGCCAAC CGTCCGGGGG AGGCGATGAT CGGGCTCGGC TACCACGCCA CGACGACCCC GCCGGTGATC CGGCGCAACG TGCTCGAGGA CCCGAGCTGG TACACCGCCT ACACCCCCTA CCAGCCCGAG ATCTCCCAGG GCCGGCTCGA GGCGCTGCTG AACTTCCAGA CCGTCGTCGG CGACCTCACC GGGCTGCCGA TCGCCAACGC CTCGCTGCTC GACGAGGGCA CGGCCGCGGC GGAGGCGATG ACGCTGGTGC GCCGCGCCCA TCGGAACGCT GTCGGGCCGT TCGTGGTCGA CGCCGACGCG CTGCCACAGT CCATCGAGGT GGTCCGCACC CGCGCCGCGG GGCTGGGCAT CGACGTCGTG GTCGCCGACC TGACCGACGG CCTGCCGGCG GACGTGCTCG AGGGCGGGCT GTGCGGCGTG CTCGTGCAGT ACCCCGGCGC GTCCGGCCGG GTGCTGGACC CCAGGCGCGT GATCGAGCAG GCCCACGAGC AGGATGCGCT CGCCGTCGTG GCCGCCGACC TGCTCGCCCT CACCCTGCTC GAGTCGCCCG GCGAGCTCGG GGCCGACGTG GTGGTCGGCT CGAGCCAGCG GTTCGGCGTC CCGCTGTTCT ACGGCGGCCC GCATGCCGGG TTCATGTCGG TGGCCGCCGG CCTGGAGCGG CACCTGCCCG GCCGCCTGGT CGGCGTCTCG GTGGACGCCG AGGGGCGCCC GGCGTACCGG CTCGCCCTCC AGACCCGCGA GCAGCACATC CGCCGGGACA AGGCGACCTC CAACATCTGC ACCGCCCAGG TGCTGCTCGC GGTGGTGGCG TCGATGTACG CCGTCTACCA CGGGCCGGAG GGCCTGCGCG CGATCGCGAC CCGCGCGCAC CGCTACGCCG CGGTCCTCGC CGCGGCGCTC GGCCAGGGCG GGTTCCCGGT ACGCCACGAG AGCTACTTCG ACACCCTCGT CGTGCAGGCG ACCGGCCGGG CCGCCGCGGT CGTCGCAGCG GCCCGGGCGC TCGGCGTGCA GCTGCGGCTG GTCGACGCCG ACCACGTGGG CATCAGCACC TCCGAGTGCA CGACCCGGTC CACGGTCGCC AGCGTGCTCA AGGCGTTCGG CCTGGCGACC GGCCCGGCCG ACGGCTCGGT CGACCTGGAC GCGGTGGACG CCGCGACCGG CGACGCCCTG CCGGAGGCCC TGCGCCGGAC CACGCCGTAC CTGACCCACG AGGTGTTCAC CAGCTATCAC AGCGAGACCG CGATGCTGCG CTACCTGCGC CGGCTCTCGG CGCGCGACTA CGCGCTGGAT CGTGGCATGA TCCCGCTCGG CTCCTGCACG ATGAAGCTCA ACGCCACCAC CGAGATGGAG CCGATCTCGC TGCCCGGGTT CGCCGACCTG CACCCGTTCG CGCCCGCCGA GGACGCCCAG GGCTACCGGC AGCTGGTCAC CGAGCTCGAG GGCTGGCTCG CGGAGGTCAC CGGCTACGAC TCCGTCTCGA TCCAGCCGAA CGCCGGCTCC CAGGGCGAGC TCGCCGGCCT GCTCGCGATC CGCGGCTACC ACCGTGCCAA CGGCGACGAG GCCCGCAACG TGGTGCTGAT CCCGTCCTCC GCGCACGGCA CGAACGCCGC CTCGGCGGTG CTCGCGGGGA TGCGGGTCGT GGTCGTGAAG TCGGGGCCGG GCGGCGAGGT CGACCTCGAC GACCTGCGCG CTCAGTGTGC CGCCCACGCG GACGACCTCG CGGCGATCAT GGTGACCTAC CCGTCGACGC ACGGCGCGTA CGAGGAGACC ATCACGGAGC TGTGCGAGAT CGTGCACGCC CACGGCGGGC AGGTGTACGT CGACGGCGCG AACCTGAACG CCCTGCTGGG CTACGCCAAG CCCGGCGAGT TCGGCGGCGA CGTCTCGCAC CTGAACCTGC ACAAGACCTT CTGCATCCCG CACGGCGGCG GCGGTCCCGG GGTGGGGCCG GTCGCGGTGC GCGCCCACCT GGCGCCGTAC CTGCCCTCGC ACGGGATGCA CCCGGACGAG ACCAAGCGCA CCGGCATCGG CCCGGTCAGC GCCGCGCCGT ACGGCTCGGC GGGGATCCTG CCGATCTCGT GGGCCTACAT CCGGCTGATG GGCGCGGCCG GCCTGACCCG GGCCACCGCG GCCGCCGTGC TGTCGGCGAA CTACGTCGCC GCCCGGCTCG GCGAGCACTA CCCGGTGCTC TACCGCGGCC ACGGCGGGCT GGTCGCGCAC GAGTGCATCC TGGACCTGCG CGGCCTGACG AAGGAGAGCG GCGTCACCGT CGACGACGTC GCCAAGCGGC TGATCGACTA CGGCTTCCAC GCCCCGACGA TGTCCTTCCC GGTCGCCGGC ACGCTGATGG TCGAGCCGAC GGAGTCCGAG GACCTCGGCG AGATCGACCG GTTCATCGAC GCGATGATCG CGATCCGCGG CGAGATCGAC CGGGTCGGCG CGGGGGAGTG GACCCCCGAG GACTCGCCGC TGCGCGGGGC GCCGCACACC TCGCGCGCGC TCGTCGGCGA GTGGGACCGT CCGTACTCCC GTGAGCTCGC GGTCTTCCCG ACCGGCCTCA CCGGCCCCAA CGGCGTGGCC GACAAGTACT GGCCGCCGGT GGCGCGGATC GACCAGGCGT ACGGCGACCG GCACCTGGTC TGCTCCTGCC CGCCTGTGGA AGCCTTCGCC GAGTGA
|
Protein sequence | MSDHHPLADL DDALPFVDRH IGLRPGDVET MLARLGFDSL DALMAAAVPG GIRAADELDL PAPLSEEATA RELRAIAQAN RPGEAMIGLG YHATTTPPVI RRNVLEDPSW YTAYTPYQPE ISQGRLEALL NFQTVVGDLT GLPIANASLL DEGTAAAEAM TLVRRAHRNA VGPFVVDADA LPQSIEVVRT RAAGLGIDVV VADLTDGLPA DVLEGGLCGV LVQYPGASGR VLDPRRVIEQ AHEQDALAVV AADLLALTLL ESPGELGADV VVGSSQRFGV PLFYGGPHAG FMSVAAGLER HLPGRLVGVS VDAEGRPAYR LALQTREQHI RRDKATSNIC TAQVLLAVVA SMYAVYHGPE GLRAIATRAH RYAAVLAAAL GQGGFPVRHE SYFDTLVVQA TGRAAAVVAA ARALGVQLRL VDADHVGIST SECTTRSTVA SVLKAFGLAT GPADGSVDLD AVDAATGDAL PEALRRTTPY LTHEVFTSYH SETAMLRYLR RLSARDYALD RGMIPLGSCT MKLNATTEME PISLPGFADL HPFAPAEDAQ GYRQLVTELE GWLAEVTGYD SVSIQPNAGS QGELAGLLAI RGYHRANGDE ARNVVLIPSS AHGTNAASAV LAGMRVVVVK SGPGGEVDLD DLRAQCAAHA DDLAAIMVTY PSTHGAYEET ITELCEIVHA HGGQVYVDGA NLNALLGYAK PGEFGGDVSH LNLHKTFCIP HGGGGPGVGP VAVRAHLAPY LPSHGMHPDE TKRTGIGPVS AAPYGSAGIL PISWAYIRLM GAAGLTRATA AAVLSANYVA ARLGEHYPVL YRGHGGLVAH ECILDLRGLT KESGVTVDDV AKRLIDYGFH APTMSFPVAG TLMVEPTESE DLGEIDRFID AMIAIRGEID RVGAGEWTPE DSPLRGAPHT SRALVGEWDR PYSRELAVFP TGLTGPNGVA DKYWPPVARI DQAYGDRHLV CSCPPVEAFA E
|
| |