Gene Ndas_3990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3990 
Symbol 
ID9247861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4772267 
End bp4773727 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content72% 
IMG OID 
ProductUDP-N-acetylglucosamine pyrophosphorylase 
Protein accessionYP_003681893 
Protein GI297562919 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.797377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGTGA ACCGTCCGGC TGCCGTCATC GTCCTCGCGG CGGGCGAGGG CACCCGAATG 
AAGTCGAAGC TTCCCAAGGT CCTCCACGAA CTCAACGGCC GCAGCATGCT CGGCCACGTG
CTCGCGGCGG CGCGGGAACT CGACCCCCAG CACGCGGTCG TGGTCGTCGG CCACGCGCGT
GAGCAGGTCA GATCCCATCT GGAGGAGATC GCCCCCCAGG CCGCCACCGC GGTCCAGGAG
GAGCAGAACG GCACCGGCCA CGCCGTGCGC ATGGCGATCG AGGACCTGGC CGCCAAGGGC
GTCAAGCTCA GCGGCACCGT GGTCCTGACC TGCGGTGACA CCCCCCTCCT GCGCGGTTCC
ACGCTCGCGG AGCTCGTCGC GGCCCACGAC GAGGAGGGCA ACGCCGTCAC GGTGCTCTCC
GCGCGCGTAC CCGACCCCCA CGGTTACGGC CGCATCGTCC GCGACGCCGA CGGCGACTTC
ACCGGCATCG TCGAGCACGC CGACGCCACC CCCGAGCAGC ACGCGATCGA CGAGATCAAC
TCGGGCATGT ACGCCTTCGA CGGCGCCCTG CTGTCCGAGG TCGTCCAGCG CCTGTCCACC
GACAACGCCA AGGGCGAGGA GTACGTGACC GACGCGGTCT CCCTGCTGCG CGGCGACGGC
CACCGGGTCG GCGCCTGGGC GGCGGACGAC TGGCACGAGG TCCAGGGCGT CAACAACCGC
GTCCAGCTCT CCGAGGCCCG CCGCGTCCTC AACGACCGGC TGGTCAACGA GCACATGCTC
GCCGGGGTCA CCGTCGTGGA CCCCGCCACC ACCTGGATCG ACGCCCAGGT CACCATCGGC
CGCGACACCG TGATCGAACC GGGGACCCGG CTGCTGGGCG CCACCTCCGT CGGCGAGGAC
GCCGTCGTCG GCCCCCGCGC GGACCTGAGG GACACGGTCG TCGGCGCGGG CGCCACGGTG
CGCGAGACCA CGGCGGACCG GGCCGAGATC GGCCCCGGGG CCTCCGTCGG CCCCTACACC
TACCTGCGGC CGGGCACCCG CCTGGCCGAG CGGTCCAAGG CCGGAGCCTT CGTCGAGGTC
AAGAACTCGA ACGTCGGCGC CGAGTCCAAG ATCCCGCACC TGACCTACGT GGGCGACGCG
GACATCGGCG TGGGCAGCAA CATCGGCTGC TCCTCGGTGT TCGTCAACTA CGACGGGGTC
AACAAGTCCC GGAGCGTCAT CGGCGACCAC GTCAGGATCG GCAGCGACAA CACCATCGTC
GCCCCGGTCC GCGTGGGCGA CGGCGCCTAC TCCGGGGCGG GGACCGTGGT CCGCGACGAC
GTGCCGCCCG GTGCCCTCGC CGTTTCCGAG GGGCACCGCC AGCGCAACGT CGAGGGCTGG
ACCCGGCGCA AGCGCCCGGG CACGCCCTCC GCCGAGGCGG CGGAGCAGGC CGATCGGCAC
AGAGCCGACG ACAAGCAGTG A
 
Protein sequence
MSVNRPAAVI VLAAGEGTRM KSKLPKVLHE LNGRSMLGHV LAAARELDPQ HAVVVVGHAR 
EQVRSHLEEI APQAATAVQE EQNGTGHAVR MAIEDLAAKG VKLSGTVVLT CGDTPLLRGS
TLAELVAAHD EEGNAVTVLS ARVPDPHGYG RIVRDADGDF TGIVEHADAT PEQHAIDEIN
SGMYAFDGAL LSEVVQRLST DNAKGEEYVT DAVSLLRGDG HRVGAWAADD WHEVQGVNNR
VQLSEARRVL NDRLVNEHML AGVTVVDPAT TWIDAQVTIG RDTVIEPGTR LLGATSVGED
AVVGPRADLR DTVVGAGATV RETTADRAEI GPGASVGPYT YLRPGTRLAE RSKAGAFVEV
KNSNVGAESK IPHLTYVGDA DIGVGSNIGC SSVFVNYDGV NKSRSVIGDH VRIGSDNTIV
APVRVGDGAY SGAGTVVRDD VPPGALAVSE GHRQRNVEGW TRRKRPGTPS AEAAEQADRH
RADDKQ