Gene Ndas_4336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4336 
Symbol 
ID9248211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5171449 
End bp5172738 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content74% 
IMG OID 
ProductUDP-N-acetylglucosamine 
Protein accessionYP_003682231 
Protein GI297563257 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.913017 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTGGAA TCGTTGAGCA GACGGCGGTC CCGTCCCGGG TGGCTACCGT CAGTCTGCAC 
ACCTCCCCGC TCGACCAGCC CGGCACCGGC GACGCCGGAG GCATGAACGT GTACGTGGTC
GAGGTGGCCC GGCGCATGGC CGAACGCGGC GTGGCCGTGG ACGTGTTCAC CCGCGCGACC
CGGGCCGACC TTCCGCCGGT GGTGGAGCTG GCCCCGGGCG TGAACGTGCG GCACGTGCCC
GCGGGGCCCT ACGGCCGCCT GGACAAGAAC ACCCTCGCCG AGCACCTGTG TCCGTTCATC
TTCGGGATGC TGCGGGCCGA GGCGCAGAAC GAACCCGACC ACTACGACCT CGTGCACGGG
CACTACTGGC TGTCCGGCCA GGCCGGTGTG GTGGCCGCCC GCCGCTGGGG CGTGCCCCTG
GTGCAGTCCA TGCACACCAT GGCCCGGGTG AAGAACGCCT CCCTGGCCGA CGGCGACGAG
CCCGAGCCCG AGGCCCGGTT GCGCGGGGAG GACCAGCTGG TCCGCCAGGC CGACCGGCTC
ATCGCCAACA CCGACGACGA GGCCCGCCAG CTCCGGGAGC ACTACGGCGC CCGCGACGGC
CAGATCAGCG TCATCCCCCC GGGTGTGGAC CTGGAGGTGT TCAGCCCCGG TTCGCGCCGG
GACGCGCTGG CGCGGATCGG ACTGCCCGCG GGCACCGAAC TCCTGCTGTT CGTGGGGCGC
GTGCAGCGGT TGAAGGCCCC CGACGTGCTC ATCCGGGCCG CCGCCGCGCT GCTGGAGCGC
GACCCCTCAC TGCGTTCGCG CCTGGTGGTG GGCGTGGTCG GCGGCCTGTC GGGCGGCGGG
ATGCGCGAGC CCGGCCTGCT CACCGACCTG GCCCGCTCCC TGGGCGTGGC GGACGTGGTG
CGGATCGAGC CGCCGCAGAC CCGCGAGCGT CTGGCCGACT ACTACCGCGC GGCCGCGGTG
ACCGTGGTGC CGTCCTACTC CGAGTCGTTC GGCCTGGTGG CGGTGGAGTC CCAGGCGTGC
GGCACCCCGG TCCTGGCCGC GCGCGTGGGC GGCCTGACCA CGGCCGTGGC CGACGGGGTG
TCCGGTGTCC TGGTACGGGG GCACAACCCG GACGACTACG CGGCCGAACT GCACCGGATG
ATCGCCGAAC CGGCGTGGCG TGCCAAGCTG GCGATGGCGG CGCCCGAGCA CGCCGCCACC
CTGGGCTGGT CGCGGACCGT CGACGAACTG CTGGACGTGT ACCGGGCGTG CACCGCCCCC
CGGGCGCTGC CCCTGGCGGC GTGCCGGTGA
 
Protein sequence
MSGIVEQTAV PSRVATVSLH TSPLDQPGTG DAGGMNVYVV EVARRMAERG VAVDVFTRAT 
RADLPPVVEL APGVNVRHVP AGPYGRLDKN TLAEHLCPFI FGMLRAEAQN EPDHYDLVHG
HYWLSGQAGV VAARRWGVPL VQSMHTMARV KNASLADGDE PEPEARLRGE DQLVRQADRL
IANTDDEARQ LREHYGARDG QISVIPPGVD LEVFSPGSRR DALARIGLPA GTELLLFVGR
VQRLKAPDVL IRAAAALLER DPSLRSRLVV GVVGGLSGGG MREPGLLTDL ARSLGVADVV
RIEPPQTRER LADYYRAAAV TVVPSYSESF GLVAVESQAC GTPVLAARVG GLTTAVADGV
SGVLVRGHNP DDYAAELHRM IAEPAWRAKL AMAAPEHAAT LGWSRTVDEL LDVYRACTAP
RALPLAACR