Gene Ndas_3852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3852 
Symbol 
ID9247723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4624092 
End bp4625453 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content72% 
IMG OID 
Productnucleotide sugar dehydrogenase 
Protein accessionYP_003681755 
Protein GI297562781 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0199691 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.463935 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATGCCG CAGCCTCCAA CCCAGCCCTT GTCTCCGAGC AGTCCGCCGG TGCGGTCAGC 
GACCTCGTCG TTCTCGGCCT CGGCTACGTC GGCCTGCCGC TGGCGGCCGA GGCCGTGTCC
GCCGGCCTGA AGGTCACCGG TCTGGACGTG AGCGACCGCG TCGTCGACGG GCTCAACAAC
GCCGTCTCGC ACGTGGACGA CCTCTCCGCG GAGGACGTGC GCCGGATGCT GGACCGCGGT
TTCACCGCGA CCACCGACCC GGCCTGCCTG GCCGCCGCGC GCACCATCGT GATCTGCGTG
CCCACGCCGC TGTCGGCCGA GGGCGGCCCC GACCTGGGCG CGGTGACGTC CGCGGCCGAG
GCGATCGCCG CCCAGCTGCA GCCCGGCACC CTGGTCATCC TGGAGTCGAC CACCTACCCC
GGCACCACCG AGGAGGTGGT CCGCCCGCTG CTGGAGAAGT CCGGCCTGGT CGCGGGCGCC
GACTTCCACC TGGCCTTCTC GCCCGAGCGC ATCGACCCCG GCAACCCGAC CTTCGGCGTG
GCCAACACGC CCAAGGTGGT CGGCGGCCTG ACGCAGGAGT GCGGCGAGGC GGCGGCGGAG
TTCTACGGCG CCTTCGTGAA CACGGTGGTG CGGGCGCGCG GCACCCGCGA GGCCGAGATG
GCCAAGCTGC TGGAGAACAC CTACCGCCAC GTCAACATCG CCCTGGTCAA CGAGATGGCC
ATCTTCTGCC AGGAGCTGGG CGTGGACCTG TGGGACTCCA TCGCCGCGGC GGCCACCAAG
CCGTTCGGCT TCCAGGCCTT CTACCCGGGC CCGGGCGTGG GCGGCCACTG CATCCCCATC
GACCCGAACT ACCTGTCGTA CAAGGTCAAG ACCCTCGGCT ACCCGTTCCG GTTCGTGGAG
CTGGCCCAGG AGATCAACGG CCGCATGCCC TCCTACGTCA TCCAGCGGGC GCAGGAGCTG
CTCAACGACT CCGGCCTGGC CCTGTCGCGC TCCAAGGTGC TGCTGCTGGG CGTCACCTAC
AAGGCCGACA TCGCCGACCA GCGCGAGTCC CCGGCCCGGC CGGTCGCGCG CAAGCTGGCC
GCCAAGGGCG CCACGCTGAC CTACCACGAC CCGCACGTGG AGTCCTGGCA GGTCGACGGC
GTGGACGTGC CCAGGTCCAC CGACCTGGAC CGCGCCCTGG CCGAGGCCGA CCTGACCATC
CTGCTCACCG ACCACGCCGA GTACCGGCCC AAGCGGCTGG AGGAGTACGC GCGGCTGCTC
CTGGACACCC GGGGCGTGCT GCGCCGCCCC GACCCCGAGG ACTCCGCGGT CCCCTCGCAG
GTGCGGCGCC ACGTCACACG GGAGGGCATC GAGGTCCTGT GA
 
Protein sequence
MDAAASNPAL VSEQSAGAVS DLVVLGLGYV GLPLAAEAVS AGLKVTGLDV SDRVVDGLNN 
AVSHVDDLSA EDVRRMLDRG FTATTDPACL AAARTIVICV PTPLSAEGGP DLGAVTSAAE
AIAAQLQPGT LVILESTTYP GTTEEVVRPL LEKSGLVAGA DFHLAFSPER IDPGNPTFGV
ANTPKVVGGL TQECGEAAAE FYGAFVNTVV RARGTREAEM AKLLENTYRH VNIALVNEMA
IFCQELGVDL WDSIAAAATK PFGFQAFYPG PGVGGHCIPI DPNYLSYKVK TLGYPFRFVE
LAQEINGRMP SYVIQRAQEL LNDSGLALSR SKVLLLGVTY KADIADQRES PARPVARKLA
AKGATLTYHD PHVESWQVDG VDVPRSTDLD RALAEADLTI LLTDHAEYRP KRLEEYARLL
LDTRGVLRRP DPEDSAVPSQ VRRHVTREGI EVL