Gene Ndas_1760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1760 
Symbol 
ID9245610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2146590 
End bp2147690 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content80% 
IMG OID 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_003679694 
Protein GI297560720 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.122533 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00990126 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCCGCG TGGGCGTACT CGGCGCCGCG GGGGCGGTCG GTTCCGCGCT GCTGGCGCGG 
CTGGCCGGTA CCGGGGCGCG CCTGACGGCG GGCGTGCGCG ACCCCGGCCG CCTGTCCGGC
CCTCCCCCCG GGGCGGCCGT GCGCGTGGTC GACGCCGAGG ACCCCGCGGG GCTGGCGGAG
TTCTGCGCCT CCCACGACGT GGTGGTCAAC TGCGCGGGCC CCTCCGCGCT CCTGGGGGAC
CGGGTCCTGC GGGCCGCGAC CGCGTCCGGC GCCCACTACG TCTCGGTGGG GGACGACGGA
CGCGACCACC TCTCCCCCGC GGGACCGGAC GACCCCGGCC CGGCGCCGGG CCGCTGCGCC
CTGCTGGGGG CGGGCCTGCT GCCGGGGCTG AGCACACTGC TGCCCCGCGT GCTGGCCGAC
GGCTTCGACC GGGTGACGGA CATGACAGTC CACTCCGGCG GCCTGGAGCG CTTCACCCCG
GCGGCCGCGC GCGACTACGT CGCGGGGCTG GCCTCCGGCG CGGACCGCTC GCTGGCGGCC
TGGCGCGGCC GCCGTGTCGC GGGCGCCCTG CGGCCCGAGG CGGACGCGCG GCTGCCCTTC
CTGCCCCGGC CCGTGTCCCT GCACCCCTTC CTGAGCCCCG AGGCCGAACG CCTGGCACGC
GCCCTGTCCC TGGAGCGCCT GGACTGGTGG CACGTCTTCG AGGGGACGCG CACCACCGAC
GCGCTCGCCG GGACGCGGGG CCGGGGCGTC ACCGACCCCG ACGCCCTGGC GGACCTGCTC
GTGCGCGCCT CCGGCCTGGA GGTGTTCGGC CGCACCCAGT ACCAGGCGCT GGTGCTGCGG
GCCGGGGGCC GGATCGGCGG CCGTGAGCGC ACCCGGGTCC TCGCGCTCAC CGGCGCGGGT
CCGGCCCTGA GCGCCGAGGC CGCGGCCCTG GCGGTGCGGT TCGCGGCCGG GGGCGGGGCG
GCGGACGGAA CGCACTGGGC GGGCGAGGCG CTGCCGACCG CCGGGGTCCT CGACGGCCTG
CGGGACGCGC CCGGCGTCGC GTTCCTGCGC CTCACCGACG ACGATGACGC GCACTCCGGA
GTCGAGGAGG GGGTCCTGTG A
 
Protein sequence
MTRVGVLGAA GAVGSALLAR LAGTGARLTA GVRDPGRLSG PPPGAAVRVV DAEDPAGLAE 
FCASHDVVVN CAGPSALLGD RVLRAATASG AHYVSVGDDG RDHLSPAGPD DPGPAPGRCA
LLGAGLLPGL STLLPRVLAD GFDRVTDMTV HSGGLERFTP AAARDYVAGL ASGADRSLAA
WRGRRVAGAL RPEADARLPF LPRPVSLHPF LSPEAERLAR ALSLERLDWW HVFEGTRTTD
ALAGTRGRGV TDPDALADLL VRASGLEVFG RTQYQALVLR AGGRIGGRER TRVLALTGAG
PALSAEAAAL AVRFAAGGGA ADGTHWAGEA LPTAGVLDGL RDAPGVAFLR LTDDDDAHSG
VEEGVL