Gene Ndas_1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1031 
Symbol 
ID9244877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1270457 
End bp1271590 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content80% 
IMG OID 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_003678980 
Protein GI297560006 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00494367 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0229045 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGCG CGACCGACCG CGCGGTGGGC GTGGTGGGCG CCTCCGGGGC CGTGGGCCGG 
GCCGCAGCCC GCCGCCTGCG CGCCCTGGGC CACACCCGCC TGCTGCTCGG GGGGCGCCGC
ACCGCGCCCC TGGAGGAGCT GGCCGCCGAA CTGGGCCCCG GCACCGCCGT CCGGGCGGTG
GACGCCGACT CCCCGGAGTC GCTGCGGGCC TTCTGCTCCG GACTCGACGT GGTGCTCAAC
TGCGCCGGGC CCTCCTACCG CATCGCCGAC GCCGTGGCGG TGCGGGCCCT GGACGCCGGA
GCCGACTACG TGGACGTGAC GGGCGACGGG CCCGCGCACG ACCGCCTCAG CCGCACCCCC
GCCGCACGGG ACCACGCGAT CGTCCTGTCG GCGGGGGTGC TCCCCGGCCT GTCCGCCCTG
CTGCCGCGCT GGTTCGCCGC CCGGCACGGC CTGGAGCGGA TGAGCGCCCA CGCGGGCGGG
CTGGAGAGGT GCACCGAGGC CGCCGCGGGC GACCTGCTGC TCTCCCTGCC CGGCGCCGAC
GACCCGACCG CCGTCTTCGG ACGGCCCCTG GCCGCCTGGC GGGAGGGCCG GGTCGTGGAG
AGGGCGCTGC GCGCCGCCGA CGGCGTCCGG CCGCCCGGGT TCCCGGGCAC CGCGTTCGTC
CAGCCCTTCC TCACCGAGGA GGCCCGCCGC CTGGCCGCCG ACCTGGGCCT GCGCGAACTG
GAGTGGTACA ACGTCCACCC CGGCGAGCGG GTCCGCGCCG TGCTGACCTC GGTCGCGGGG
CGCCCGGTCG CCGACCCGGC CGCGGCGGCG GAGCGCCTGC GGCGCGCGGC CGGGGTGGAC
CTGGCCGGGC GCACGCCCTA CTACCAGCTC GTGTACGCGC TCACCGCCCC CTCCGGGCGG
CGCAGCGTGA TGACCGCGCG CTTCTCCGAC AGCTACCGCA TGACCGGCCG CGTGGGCGCG
CAGGCCGCCG ACGCGGTGGC GCGCGGGCTG GTGCCCCGGG GCCTGCACCA CGCCGCCGAC
GTCCTCGACC CCGAGGCCGC GGTCACCGCC CTGTTCGACG ACCCCGAGGC CGCGAGCCTG
CGGGTGGAGG ACGCCGCGAG CGAGGACGCC GGGGTCGAGG AGGGCGCCCT GTGA
 
Protein sequence
MSGATDRAVG VVGASGAVGR AAARRLRALG HTRLLLGGRR TAPLEELAAE LGPGTAVRAV 
DADSPESLRA FCSGLDVVLN CAGPSYRIAD AVAVRALDAG ADYVDVTGDG PAHDRLSRTP
AARDHAIVLS AGVLPGLSAL LPRWFAARHG LERMSAHAGG LERCTEAAAG DLLLSLPGAD
DPTAVFGRPL AAWREGRVVE RALRAADGVR PPGFPGTAFV QPFLTEEARR LAADLGLREL
EWYNVHPGER VRAVLTSVAG RPVADPAAAA ERLRRAAGVD LAGRTPYYQL VYALTAPSGR
RSVMTARFSD SYRMTGRVGA QAADAVARGL VPRGLHHAAD VLDPEAAVTA LFDDPEAASL
RVEDAASEDA GVEEGAL