Gene Ndas_0978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0978 
Symbol 
ID9244823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1197660 
End bp1198823 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content78% 
IMG OID 
ProductPrephenate dehydrogenase 
Protein accessionYP_003678928 
Protein GI297559954 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGACA GCAGCGGTGC GCGGGGGGAC GGGGAGCAGC GGCTCCCGGT GCGCCGGGTC 
ACGGTGGTCG GCACGGGGCT CATCGGCACG TCGGTGGCGC TCGCCCTGCG CTCGCGCGGC
GTCGACGTCG CCCTCTCCGA CCCCGACGCC GCCTCCCTGC GCCTGGCCTG CGACCTGGGC
GCGGGACACC CCCTGGAGGA GGCCGACGAC CCGACCTCGG CCCCGGCCGA CCTCGCGGTC
ATCGCCGCGC CGCCCGCCGT CATCCCCTCC GTGCTGCGCC GGGCCCAGGA CCGGGGACTG
GCCCACGTCT ACACCGACGT GGCCAGCGTG AAGGCGAGCG TGCTGGCCGA GGCCGAGCGG
CTGGGCTGCG ACATGGCCAC GTTCGTGGCC GGGCACCCCA TGGGCGGCAG GGAGAAGCAG
GGCCCCGGCG CCGCGCGCGC CGACCTCTTC CTGGGCCGTT CGTGGGCGCT GTGCCCCACC
GGCAAGGCCG ACGCGCAGGC CGTCGCCGTC GTCGCCGAGG TCGCCCGCAT GTGCGGCGCG
GACCCGCTGG TGCTGGACGC GGCGGCGCAC GACCGGGCGG TGGCGCTGGT CTCGCACGCG
CCCCACGTGG CCTCCTCGGC GGTGGCGGCC CGGCTGCTCT CGGGCGACGA CGCGGCCCTG
ACCCTGGCGG GGCAGGGCGT GCGCGACGTC ACCCGGGTGG CGGGCGGCGA CCCCGCCATG
TGGTCCGAGA TCCTCACGCA CAACGCCGCC CCGGTCGCGG AGGTCCTGCA CGCGGTGGCC
GCCGACCTCG CGGCGACCGC CGACGCCCTG CGCGCGCTGG CGGACCGGCC GGGGCCGGAG
GAGGCGTCGG ACCGCGTCCT CGCTCCGGTG CACGACCTGC TGGAGCGCGG CAGGTCCGGC
CACGGCCGCA TCCCCGGCAA GCACGGCACG GTACGGCTGC CCGACTACAC GGTCATGCCG
GTGGTCATCC CCGACGAACC CGGTGCGCTG GGCCGCCTGT TCGCCGCGGC GGCGGAGGCC
GGGGTCAACA TCGAGGACGT GCGCATCGAG CACACGCCCG GCCTGCCGCT GGGCGTGGCG
CAGCTGCACG TGCTTCCCGA GGCGGTGGAC ACGCTCGCGC GGGCGCTGGC CGCCGACGGC
TGGTCGGTGC ACCCGGGCCT CTGA
 
Protein sequence
MSDSSGARGD GEQRLPVRRV TVVGTGLIGT SVALALRSRG VDVALSDPDA ASLRLACDLG 
AGHPLEEADD PTSAPADLAV IAAPPAVIPS VLRRAQDRGL AHVYTDVASV KASVLAEAER
LGCDMATFVA GHPMGGREKQ GPGAARADLF LGRSWALCPT GKADAQAVAV VAEVARMCGA
DPLVLDAAAH DRAVALVSHA PHVASSAVAA RLLSGDDAAL TLAGQGVRDV TRVAGGDPAM
WSEILTHNAA PVAEVLHAVA ADLAATADAL RALADRPGPE EASDRVLAPV HDLLERGRSG
HGRIPGKHGT VRLPDYTVMP VVIPDEPGAL GRLFAAAAEA GVNIEDVRIE HTPGLPLGVA
QLHVLPEAVD TLARALAADG WSVHPGL