Gene Ndas_1177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1177 
Symbol 
ID9245027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1432477 
End bp1433850 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content73% 
IMG OID 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_003679124 
Protein GI297560150 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.387852 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.54744 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCCG AGCAGAGCAC GACCACCGTG GACGACTGGC GTTCCCTACC CGCCGAACAG 
CAGCCCGACT GGCCCGACCC CGAAGCCCTC TCCGACGCGC TCACCGACCT GGCCTCCTAC
CCGCCGCTGG TCTTCGCGGG CGAGTGCGAC CAGCTCAGGG CCCGCCTGGG CGCCGTGGCG
CGCGGAGAGG AGTTCCTCCT CCAGGGGGGC GACTGCGCCG AGGCCCTCGA CGGCGTCTCC
GCCGACCAGA TCAGGAACAA GCTCAAGACG CTCTTCCAGA TGGGCGCCGT GCTCACCTAC
GCGGGCTCCG TCCCCGTGGT GAAGGTGGGC CGCATCGCCG GGCAGTACGG CAAGCCGCGT
TCCAGCCCCA CCGAGACCCG TGACGGGGTG AGCCTGCCCT CCTACCGCGG CGACGCCGTC
AACGGACGCG CGTTCACCGC CCGCGACCGG CGCCCCGACC CCGAGCGGCT CAAGCGCGCC
TACCACGCCT CCGCGGCGAC CCTGAACCTG GTGCGCGCCT TCACCACCGG CGGCTACGCC
GACCTGTCCC AGGTGCACGC CTGGAACCGC GACTTCGTCC GCGACTCCCC CGCCGGTCGG
CGCTACGAGC GCCTGGCCCG CGAGATCGAC AACGCGCTGG CGTTCATGCG GGCGTGCGGG
GTCACCGACG CCGAGGCGGT GCGCACCACC GAGTTCTACT CCTCCCACGA GGCCCTGCTG
CTGGACTACG AGACGGCGCT GACCCGCGTG GACTCGCGTA CCGGCGGACT CTACGCCGTG
TCCGGGCACA TGGTCTGGAT CGGCGAGCGC ACGCGGCGGT TGGACGGGGC GCACGTGGAG
TTCGCCTCGC GCATACGCAA CCCGGTCGGG GTCAAGCTCG GCCCGGGCGC CGAACCCGAC
GATGTGCTGG CGCTGGTGGA CAAGCTCGAC CCCGACCGCG AGCCCGGACG GCTGACCCTC
ATCACGAGGA TGGGCGCCGG GCGCGTCCGC GACCGGCTTC CCGCGCTGGT GGAGAAGGTC
ACGGCGTCCG GTGCGCGGGT CGCGTGGGTG TGCGACCCCA TGCACGGCAA CACCTTCACC
GCCGGCAGCG GGCACAAGAC CCGGCGCTTC GACGACGTCC TGGACGAGGT GCGGGGCTTC
TTCGAGGTCC ACCACGCCCT GGGCACCCAC CCCGGCGGCA TCCACGTGGA GCTGACCGGC
GACGACGTGA CCGAGTGCGT GGGCGGCGGG AGCCGCATCG GCGTGGACGA CCTGCGGCGC
CGGTACGAGA CCGCCTGCGA CCCGCGCCTG AACCGCAGCC AGTCCCTGGA CCTGGCCTTC
CTGGTCGCCG AAGTGCTGGG AGAGGCCCGC CGGGCGAAGG AGGCGCGCCG ATGA
 
Protein sequence
MNPEQSTTTV DDWRSLPAEQ QPDWPDPEAL SDALTDLASY PPLVFAGECD QLRARLGAVA 
RGEEFLLQGG DCAEALDGVS ADQIRNKLKT LFQMGAVLTY AGSVPVVKVG RIAGQYGKPR
SSPTETRDGV SLPSYRGDAV NGRAFTARDR RPDPERLKRA YHASAATLNL VRAFTTGGYA
DLSQVHAWNR DFVRDSPAGR RYERLAREID NALAFMRACG VTDAEAVRTT EFYSSHEALL
LDYETALTRV DSRTGGLYAV SGHMVWIGER TRRLDGAHVE FASRIRNPVG VKLGPGAEPD
DVLALVDKLD PDREPGRLTL ITRMGAGRVR DRLPALVEKV TASGARVAWV CDPMHGNTFT
AGSGHKTRRF DDVLDEVRGF FEVHHALGTH PGGIHVELTG DDVTECVGGG SRIGVDDLRR
RYETACDPRL NRSQSLDLAF LVAEVLGEAR RAKEARR