Gene Ndas_0424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0424 
Symbol 
ID9244263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp513705 
End bp514973 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content71% 
IMG OID 
ProductGlycine hydroxymethyltransferase 
Protein accessionYP_003678377 
Protein GI297559403 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.3186 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACTG ACAACACGCT CAACCAGACG CTGGGCGAAC TGGACCCCGA GGTGGCGGCC 
GCGGTCGACG CGGAGCTGGC CCGCCAGCGC GACACCCTGG AGATGATCGC CTCGGAGAAC
TTCGCCCCCC AGGCGGTGAT CGAGGCCCAG GGCACCGTCC TGACCAACAA GTACGCGGAA
GGGTACCCGG GCCGCCGCTA CTACGGCGGG TGCGAGCACG TCGACGTCGT CGAGCAGCTC
GCCATCGACC GCGCCAAGGC GCTGTTCGGC GCCGAGCACG CCAACGTGCA GCCGCACTCG
GGCGCGCAGG CCAACACGGC CGTGTACTTC GCGCTGCTCA AGCCGGGTGA CACCATCCTG
GGCCTGGACC TGGCCCACGG CGGCCACCTG ACCCACGGCA TGAAGATCAA CTACTCCGGC
AAGATCCTCA ACGCGGTGGC CTACCACGTG CGCGACGAGG ACGGCACCGT CGACTACGAC
GAGGTCGAGG CGCTCGCCGA GGAGCACCGG CCCAAGATGA TCGTCGCCGG GTGGTCGGCC
TACCCGCGCC AGCTGGACTT CGCCCGCTTC CGCAAGATCG CGGACTCGGT CGGCGCGCTC
CTGATGGTGG ACATGGCGCA CTTCGCCGGT CTGGTCGCGG CGGGGCTGCA CCCCAACCCG
GTGCCGCACG CCGACGTGGT CACCACGACC ACGCACAAGA CCCTGGGCGG CCCGCGCGGC
GGCATGATCC TGGCCAAGGC CGAGCTGGGC AAGAAGATCA ACTCCGCGGT GTTCCCCGGC
ATGCAGGGCG GGCCGCTGGA GCACGTGATC GCGGCCAAGG CGGTGGCCCT CAAGGTCGCC
GCGGGCGAGG AGTTCGCCGA CCGCCAGCGC CGCACGGTCT CGGGCGCCAG GCTGCTCGCC
GAGCGGCTGA CGCGGCCGGA CGCGGCCGAG GTCGGCGTGA AGGTGCTCTC GGGGGGCACG
GACGTGCACC TGGTCCTGGT GGACCTGGTG AACTCCGAGC TCAACGGCCA GGAGGCCGAG
GACCGCCTGC ACTCGATCGG GATCACGGTC AACCGCAACG CGGTGCCCAA CGACCCGCGC
CCGCCGATGG TCACCTCCGG TCTGCGGATC GGCACCCCGG CGCTGGCCAC CCGCGGTTTC
GGCGACGAGG ACTTCGCCGA GGTCGCGGAC GTCATCGCCG AGGCGCTCAA GCCGGAGTTC
GACGAGGCCG CGCTGCGCGG CCGGGTCCAG GCGCTGACCG CGAAGTACCC GCTCTACCCG
AACCTGTAG
 
Protein sequence
MATDNTLNQT LGELDPEVAA AVDAELARQR DTLEMIASEN FAPQAVIEAQ GTVLTNKYAE 
GYPGRRYYGG CEHVDVVEQL AIDRAKALFG AEHANVQPHS GAQANTAVYF ALLKPGDTIL
GLDLAHGGHL THGMKINYSG KILNAVAYHV RDEDGTVDYD EVEALAEEHR PKMIVAGWSA
YPRQLDFARF RKIADSVGAL LMVDMAHFAG LVAAGLHPNP VPHADVVTTT THKTLGGPRG
GMILAKAELG KKINSAVFPG MQGGPLEHVI AAKAVALKVA AGEEFADRQR RTVSGARLLA
ERLTRPDAAE VGVKVLSGGT DVHLVLVDLV NSELNGQEAE DRLHSIGITV NRNAVPNDPR
PPMVTSGLRI GTPALATRGF GDEDFAEVAD VIAEALKPEF DEAALRGRVQ ALTAKYPLYP
NL