Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0424 |
Symbol | |
ID | 9244263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 513705 |
End bp | 514973 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | Glycine hydroxymethyltransferase |
Protein accession | YP_003678377 |
Protein GI | 297559403 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.3186 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTACTG ACAACACGCT CAACCAGACG CTGGGCGAAC TGGACCCCGA GGTGGCGGCC GCGGTCGACG CGGAGCTGGC CCGCCAGCGC GACACCCTGG AGATGATCGC CTCGGAGAAC TTCGCCCCCC AGGCGGTGAT CGAGGCCCAG GGCACCGTCC TGACCAACAA GTACGCGGAA GGGTACCCGG GCCGCCGCTA CTACGGCGGG TGCGAGCACG TCGACGTCGT CGAGCAGCTC GCCATCGACC GCGCCAAGGC GCTGTTCGGC GCCGAGCACG CCAACGTGCA GCCGCACTCG GGCGCGCAGG CCAACACGGC CGTGTACTTC GCGCTGCTCA AGCCGGGTGA CACCATCCTG GGCCTGGACC TGGCCCACGG CGGCCACCTG ACCCACGGCA TGAAGATCAA CTACTCCGGC AAGATCCTCA ACGCGGTGGC CTACCACGTG CGCGACGAGG ACGGCACCGT CGACTACGAC GAGGTCGAGG CGCTCGCCGA GGAGCACCGG CCCAAGATGA TCGTCGCCGG GTGGTCGGCC TACCCGCGCC AGCTGGACTT CGCCCGCTTC CGCAAGATCG CGGACTCGGT CGGCGCGCTC CTGATGGTGG ACATGGCGCA CTTCGCCGGT CTGGTCGCGG CGGGGCTGCA CCCCAACCCG GTGCCGCACG CCGACGTGGT CACCACGACC ACGCACAAGA CCCTGGGCGG CCCGCGCGGC GGCATGATCC TGGCCAAGGC CGAGCTGGGC AAGAAGATCA ACTCCGCGGT GTTCCCCGGC ATGCAGGGCG GGCCGCTGGA GCACGTGATC GCGGCCAAGG CGGTGGCCCT CAAGGTCGCC GCGGGCGAGG AGTTCGCCGA CCGCCAGCGC CGCACGGTCT CGGGCGCCAG GCTGCTCGCC GAGCGGCTGA CGCGGCCGGA CGCGGCCGAG GTCGGCGTGA AGGTGCTCTC GGGGGGCACG GACGTGCACC TGGTCCTGGT GGACCTGGTG AACTCCGAGC TCAACGGCCA GGAGGCCGAG GACCGCCTGC ACTCGATCGG GATCACGGTC AACCGCAACG CGGTGCCCAA CGACCCGCGC CCGCCGATGG TCACCTCCGG TCTGCGGATC GGCACCCCGG CGCTGGCCAC CCGCGGTTTC GGCGACGAGG ACTTCGCCGA GGTCGCGGAC GTCATCGCCG AGGCGCTCAA GCCGGAGTTC GACGAGGCCG CGCTGCGCGG CCGGGTCCAG GCGCTGACCG CGAAGTACCC GCTCTACCCG AACCTGTAG
|
Protein sequence | MATDNTLNQT LGELDPEVAA AVDAELARQR DTLEMIASEN FAPQAVIEAQ GTVLTNKYAE GYPGRRYYGG CEHVDVVEQL AIDRAKALFG AEHANVQPHS GAQANTAVYF ALLKPGDTIL GLDLAHGGHL THGMKINYSG KILNAVAYHV RDEDGTVDYD EVEALAEEHR PKMIVAGWSA YPRQLDFARF RKIADSVGAL LMVDMAHFAG LVAAGLHPNP VPHADVVTTT THKTLGGPRG GMILAKAELG KKINSAVFPG MQGGPLEHVI AAKAVALKVA AGEEFADRQR RTVSGARLLA ERLTRPDAAE VGVKVLSGGT DVHLVLVDLV NSELNGQEAE DRLHSIGITV NRNAVPNDPR PPMVTSGLRI GTPALATRGF GDEDFAEVAD VIAEALKPEF DEAALRGRVQ ALTAKYPLYP NL
|
| |