Gene Ndas_3123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3123 
Symbol 
ID9246979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3738866 
End bp3739912 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content72% 
IMG OID 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_003681038 
Protein GI297562064 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCATCG TGATGGCGCC GGACGCCACC CCCGAGAACA TCGACGATCT GGTCGAGCTG 
GTCTCCTCCG CCGGAGGCGA GGCCTACGTG ACGCGCGGGG TGAGCCGCAC GATCATCGGC
CTCGTCGGTG ACGTCGAGCG CTTCCAGGAC CTGGGCCTGG CCGCCAAGCC GGGCGTCAGC
GACGTCCTGC GCATCTCCGT CCCCTACAAG CTCGTCAGCC GCGAGAACCA CGACAGCCGC
ACGGTCGTCA GCGTGCGCGG AGTGCCGATC GGCGGCGACA ACGTCACCGT CATCGCCGGT
CCCTGCGCCG TCGAGACCCC CGAGCAGACC CTGGCCGCGG CCCGGATGGC GCTGGAGGCG
GGGGCGTCCC TGCTGCGCGG CGGCGCCTAC AAGCCCCGCA CCTCCCCCTA CGCCTTCCAG
GGCCTGGGCG AGGAGGGCCT GAGGATCCTC GCCGACGTGC GCGAGGAGAC CGGCCTTCCC
ATCGTCACCG AGGTCGTGGA CGCCGCCGAC GTGGAGCTGG TCGCCTCCTA CGCGGACATG
CTCCAGGTCG GCACCCGCAA CATGCAGAAC TTCGCCCTCC TCCAGGCGGT GGGGGACGCG
GGCAAACCGG TCCTGCTCAA GCGCGGCATG AGCGCCACCA TCGAGGAGTG GCTGATGGCC
GCCGAGTACA TCGCCCAGCG CGGCAACCTC GACATCGTCC TGTGCGAGCG CGGCATCCGC
ACCTTCGAGA AGGCCACCCG CAACACCCTG GACATCAGCG CGGTCCCGGT CGCCCAGAAC
CTGTCCCACC TGCCGGTGAT CGTCGACCCG TCCCACTCGG GCGGCAAGCG CGAGCTGGTG
CTGCCGCTCT CGCGCGCGGC CGTGGCGGTC GGCGCGGACG GCGTCATCGT CGACGTGCAC
CCCCACCCGG AGAACGCCCT GTGCGACGGC CCGCAGGCCC TCCTGCACGA GGACCTGGCC
GAGCTGCGCG ATCTGGCGGG CACCCTGGCC GCCCTGACCG GCCGCACCCT GACCCTGGCG
CCGGGCCGCG AACTGGCCGG TCTGTGA
 
Protein sequence
MVIVMAPDAT PENIDDLVEL VSSAGGEAYV TRGVSRTIIG LVGDVERFQD LGLAAKPGVS 
DVLRISVPYK LVSRENHDSR TVVSVRGVPI GGDNVTVIAG PCAVETPEQT LAAARMALEA
GASLLRGGAY KPRTSPYAFQ GLGEEGLRIL ADVREETGLP IVTEVVDAAD VELVASYADM
LQVGTRNMQN FALLQAVGDA GKPVLLKRGM SATIEEWLMA AEYIAQRGNL DIVLCERGIR
TFEKATRNTL DISAVPVAQN LSHLPVIVDP SHSGGKRELV LPLSRAAVAV GADGVIVDVH
PHPENALCDG PQALLHEDLA ELRDLAGTLA ALTGRTLTLA PGRELAGL