Gene Ndas_0861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0861 
Symbol 
ID9244706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1057466 
End bp1058755 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content74% 
IMG OID 
Product3,4-dihydroxy-2-butanone 4-phosphate synthase 
Protein accessionYP_003678811 
Protein GI297559837 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.451908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0935072 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCA CCGAACCCAC CCCGGCCACG ACACCGATCG ACCAGGCCGT CGTCCTCGAC 
CCGATCGAGG AGGCCGTCGC CGAGTTCGCC GCGGGGCGGG CCATCGTGGT GGTGGACGAC
GAGGACCGCG AGAACGAGGG CGACATCATC TTCGCCGCCG AGCTCGCCAC GCCCGAACTC
CTGGCGTTCA TGATCCGCTA CACCTCCGGT GTGGTGTGCG TCCCCCTGGA GGGCGGGGAC
CTGGACCGCC TCGACCTGCC GCTGATGACC GCGCGCAACG AGGAGAGCCT GCGCACCGCC
TACACGGTCA CCGTGGACGC GCGCGAGGGC GTCACCACGG GGATCTCCGC CGCCGACCGG
GCCCGCACCA TCCGCCTGCT GGCCGCGGAG GGCAGCGGCC CCGCCGACTT CGTGCGCCCC
GGCCACGTCC TGCCGCTGCG CGCCCGCCCG GGCGGCGTGC TGGCCCGGCG CGGCCACACC
GAGGCCTCCG TCGACTTCGC CCGCCTGGCC GGTCTGCGTC CCGCGGGCGT GCTCGCCGAG
GTCGTCAACG ACGACGGCAC CATGGCCCGT CTGCCCCAGC TGCGCGCGTT CGCCGACGAG
CACGGCCTCA AGCTGGTCTC GGTCGAGCAG CTCGCCGCCT ACCGCGAGGC GCTGGGGGAG
GCGCTCACCG AGGCCGAGGC CCACCCGCCG CTGGTCTCCC GCGCGGTCCA GACGCGCCTG
CCCAACAGGT ACGGCCAGTG GCGCGCGGTC GGGTACCGGG GTACCGCCGA CGGCGCCGAG
CACGTCGCGC TGGTGTACGG GGACCTGACC GACGGCACCG ACGTCCTGGC GCGCCTGCAC
TCGGAGTGCC TCACCGGCGA CGCGTTCGGC TCCCACCGCT GCGACTGCGG CGCCCAGCTG
GACGCCGCCA TGGCCGACAT CGCCGAGGAG GGGCGCGGGG TGCTCGTCTA CCTTGGCGGC
CACGAGGGCC GGGGGATCGG TCTGCTGCAC AAGCTGAGCG CCTACAGCCT CCAGGACCAG
GGGGCGGACA CCGTGGACGC CAACCTGCGC CTGGGCCTGC CCGCCGACGC GCGCGAGTTC
GGCGCCGGGG CGCAGATCCT GGCCGACCTG GGGGTGTCGT CGGTGCGGCT GCTGACCAAC
AACCCCGCCA AGGCCGAGGG ACTGGAGCAG CACGGGGTGC GGGTCAAGGA GCGGGTGGCG
ATGCCCTCCT TCGTCACCGA GGACAACATC GACTACCTGC GCACCAAGCG CGACCGCATG
GGCCACGACC TGACCGGCGT CGTCCGCTGA
 
Protein sequence
MTVTEPTPAT TPIDQAVVLD PIEEAVAEFA AGRAIVVVDD EDRENEGDII FAAELATPEL 
LAFMIRYTSG VVCVPLEGGD LDRLDLPLMT ARNEESLRTA YTVTVDAREG VTTGISAADR
ARTIRLLAAE GSGPADFVRP GHVLPLRARP GGVLARRGHT EASVDFARLA GLRPAGVLAE
VVNDDGTMAR LPQLRAFADE HGLKLVSVEQ LAAYREALGE ALTEAEAHPP LVSRAVQTRL
PNRYGQWRAV GYRGTADGAE HVALVYGDLT DGTDVLARLH SECLTGDAFG SHRCDCGAQL
DAAMADIAEE GRGVLVYLGG HEGRGIGLLH KLSAYSLQDQ GADTVDANLR LGLPADAREF
GAGAQILADL GVSSVRLLTN NPAKAEGLEQ HGVRVKERVA MPSFVTEDNI DYLRTKRDRM
GHDLTGVVR