Gene Ndas_3009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3009 
Symbol 
ID9246862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3592137 
End bp3593321 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content74% 
IMG OID 
ProductPhosphoglycerate kinase 
Protein accessionYP_003680925 
Protein GI297561951 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0329661 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGACGA TCGACGACCT CGACGTCTCC GGCAGGCGCG TGTTCGTCCG GGCCGACCTG 
AACGTGCCCC TGGACGGCGA CCGCATCACC GACGACGGGC GCATCCGCGC GGCCGTGCCC
ACCATCTCCG CGCTGCGCGA GCGCGGCGCC CGCGTCATCG TCGCCGCCCA CCTGGGCCGC
CCCAAGGGAG CCCCGGACCC CCGCTACTCT CTGCGCCCGG TCGCCGCCCG CCTGGGCGAA
CTGCTCGGCG CCGAGGTGGC CTTCGCCTCC GACACCGCCG GGGAGTCGGC CCGCGCCACC
TCCGAGGCCC TCACGGACGG TCAGGTCGCC CTGCTGGAGA ACGTGCGCTT CGAGCCGGGG
GAGACCAGCA AGGACGACGC CGAGCGCGGG GAGCTCGCCG ACCGCTTCGC CCAGCTCGCC
GACCTGTACG TGGGCGACGC CTTCGGCGCC GTGCACCGCA AGCACGCCAG CGTCTACGAC
CTGCCCGGCA GGCTGCCGCA CGCCGTCGGC GGCCTGGTGC TGAACGAGGT CGAGGTGCTG
CGCAGGCTCA CCGGCGCCCC CCAGCGGCCC TACGCCGTGG TCCTGGGCGG CTCCAAGGTC
TCCGACAAGC TCGGCGTCAT CGACAACCTG CTGGGCACCG CGGACCGCCT GCTCATCGGC
GGCGGCATGG TCTTCACCTT CCTCAAGGCC CAGGGCCACG AGGTCGGCTC CAGCCTGCTG
GAGGCCGACC AGCTCGACAC CGTCAAGGGC TACCTGGAGC GCGCCGAGCG CGAGGGCGTG
GAGATCGTCC TGCCGGTGGA CGTGGTGGCC GCCGAGAAGT TCTCCGCCGA CGCCGCGCAC
GACGCGGTCG CCGTCGATGC CATCCCGTCC GACCGGATGG GCCTGGACAT CGGCCCCCGC
AGCCAGGAGC TCTTCGCGCG GAAGCTGGCC GACGCCCGCA CCGTGTTCTG GAACGGCCCG
ATGGGCGTCT TCGAGATGGA GCCCTACGCC GGGGGCACCC GCGCGCTGGC CCAGGCCCTG
ATCGACTCCG GCGCCTTCAC CGTGGTCGGC GGCGGCGACT CCGCCGCGGC CGTGCGCGCG
CTGGGCTTCG ACGAGGCGGC CTTCGGCCAC ATCTCCACCG GCGGCGGCGC CAGCCTGGAG
TACCTGGAGG GCAAGGACCT GCCCGGTATC GACGCCCTGA AGTAA
 
Protein sequence
MRTIDDLDVS GRRVFVRADL NVPLDGDRIT DDGRIRAAVP TISALRERGA RVIVAAHLGR 
PKGAPDPRYS LRPVAARLGE LLGAEVAFAS DTAGESARAT SEALTDGQVA LLENVRFEPG
ETSKDDAERG ELADRFAQLA DLYVGDAFGA VHRKHASVYD LPGRLPHAVG GLVLNEVEVL
RRLTGAPQRP YAVVLGGSKV SDKLGVIDNL LGTADRLLIG GGMVFTFLKA QGHEVGSSLL
EADQLDTVKG YLERAEREGV EIVLPVDVVA AEKFSADAAH DAVAVDAIPS DRMGLDIGPR
SQELFARKLA DARTVFWNGP MGVFEMEPYA GGTRALAQAL IDSGAFTVVG GGDSAAAVRA
LGFDEAAFGH ISTGGGASLE YLEGKDLPGI DALK