Gene Ndas_1718 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1718 
Symbol 
ID9245568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2088603 
End bp2089724 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content75% 
IMG OID 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_003679653 
Protein GI297560679 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.192134 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTAC TGGTCACGGG TGGTGCCGGG TTCATCGGAT CGAGGGTCGC GGAGGAACTC 
CGCCTGGCCG GGCACGAGGC GGTCACCCTC GACGCCTACC TGCCCCAGGC CCACACCGGG
AGGGACACCG AGCACAGAAC CGTGGACGTC GTGGGCGACG TCCGGGACGG CGAGATCGTC
GAACGCGCCC TGCGGGGCGT GGACGCGGTC TGCCACCAGG CGGCGATGGT GGGGCTGGGC
TCCGCGGACT TCCTGGACGC CCCCGACTAC GTCCGCTGCA ACGACCTGGG CACGGCGGTC
CTGCTGGCCG CCATGGCCAG GACCGGCGTC CGCGACATCG TCATGGCCGG TTCGATGGTC
GTCTACGGGG AGGGGCGGTA CACGTGCCCC GAGCACGGCG ACGTCCGCCC CGGCCCGAGG
GCGGAGGCGG ACCTGCGGGC GGGGGTGTTC GACCCGCCGT GCCCCCGGTG CGGGGCGCCG
CTCGTGCCCG GCCTGGTGGG GGAGGACGCG CCGAGCGACC CGCGCAACGT CTACGCCACC
ACGAAGCTGG CGCAGGAGCA CCTGTGCGCG GCGTGGGCCC GGTCCGTCGG CGGACGCGCG
GTGTCGCTGC GCTACCACAA CGTGTACGGG CCGGGGATGC CGCGCGACAC CCCGTACGCG
GGCGTGGCCT CCTTCTTCCG CTCGGCGCTG GCCCGGGGCG AGGCGCCCCG CGTGTTCGAG
GACGGCCGCC AGCGGCGCGA CTTCGTGCAC GTGGGCGACG TGGCGCGGGC CAACGTGGCG
GCCCTGGAGG CCGTCGCGGG CAGGGCCCCG GGGGAGCTGT CCGCCTTCAA CACCGGCAGC
GGGACCCCGC ACACCATCGG CGAGATGGCC CGGGCGCTCG CGGACGCGCA CGGCGGCCCG
GAGCCCCTGG TCACCGGCGA GTACCGGCTC GGCGACGTCC GGCACATCAC CGCCTCCTCC
GACCGGCTGA GGCGGGAGCT GTCCTGGCGG CCCAGGGTCG GCTTCGCCGA GGGGATGGCC
GAGTTCGCCC GCGCCGAACT GCGCGGTTCG ACCGGTTCGG CGGCGTCGGA GGCCGAGGTG
TCCGCGTGTC GCGACCGCCT TTCCCGGCCT GTGTCCACAT GA
 
Protein sequence
MRVLVTGGAG FIGSRVAEEL RLAGHEAVTL DAYLPQAHTG RDTEHRTVDV VGDVRDGEIV 
ERALRGVDAV CHQAAMVGLG SADFLDAPDY VRCNDLGTAV LLAAMARTGV RDIVMAGSMV
VYGEGRYTCP EHGDVRPGPR AEADLRAGVF DPPCPRCGAP LVPGLVGEDA PSDPRNVYAT
TKLAQEHLCA AWARSVGGRA VSLRYHNVYG PGMPRDTPYA GVASFFRSAL ARGEAPRVFE
DGRQRRDFVH VGDVARANVA ALEAVAGRAP GELSAFNTGS GTPHTIGEMA RALADAHGGP
EPLVTGEYRL GDVRHITASS DRLRRELSWR PRVGFAEGMA EFARAELRGS TGSAASEAEV
SACRDRLSRP VST