Gene Ndas_1092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1092 
Symbol 
ID9244938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1339828 
End bp1342098 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content77% 
IMG OID 
Productexopolysaccharide biosynthesis protein-like N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase 
Protein accessionYP_003679040 
Protein GI297560066 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.830982 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCTGT CCCGCCCAAG GCCCCCGGGC CCCGTCCGCT CGGCGGCGCT CCCCGTCATC 
GCCGTGCTCG TCGCGCCCAT GGTCCTGGCC GCTCCGTCGT CCCGGGTGAC GGCCGACGTC
GCCGCGCCGG CGGGCGGGAT CGCCGGGGCC GTGGAGGAGC GGATCGCCGC CGGGGTGGAC
CTCGCCTCCG GTCCCGTCCC GGGCGCCGGG GGCGGCGGGG AGCAGTTCGC CAGCCTCCTC
ACCGTCGACC TCACCGAGGG TGTCGGGGTC GAGTACGTCG ACGGGGGCGG CCTCACCTCC
CCCGCGACGG TCGCGGACAT GGCCGCCGCC GTCGAGCCCC CGGAGGGGTC CACCGTGGTC
GCCGCGGTCA ACGGCGGCTA CTTCGACATC GGCGCGACCC AGGCCCCCCT GGGCGCCGGG
ATGAGCGACG GGCGCCTGCT CACCTCGCCC GACCCGGGGT TCGCCAACGC CGTGGTCATC
GACGCCGGGG GCAGGGGCAG CGTGCGCCAG GTGGCCTTCG AAGGGACCGC GTCCCTGCCG
TCGGGGGACC TCGACATCGA CGCCCTCAAC ACCTCCGCCG TCCCCGCGGA CGGCCTCGGT
CTGTACACGT CGGACTGGGG CGGCCACCCC CGCGCACACG TGGTGTACGA ACCCGGGACC
AGCCCCGGGG ACACGGCCGT CGCCGAGGCC GTGGTCTCCG AGGGCGTCGT CGAGCGGGTC
AGCGTCACCC CCGGCAGCGG CCCGATCGAG GAGGACGAAC AGGTCCTGGT GGCGCGCGGC
TCCGCGGCGG AGCGGATCGC CGACCTGTCC GAGGGCGACC CGGTCGAGGT GGAGCACACC
CTCACGGCCG AGGGCGCCGA ACCCCGCGTC GTCGTGGGCG GACGGCACGT GCTGGTGCGC
GACGGCGAAC CCGTCCCCGT CGAGGACGTC TCCCGCGCGC CGCGTACCGC GATCGGGTTC
TCCGAGGACG GCGAGGTCAT GCACGTGGTG ACCGCGGACG GGCGCAACCG CGGCCACGCC
GGATCCACGC TCGCGGAGGT CGCCGAACTG CTCGCCGCGT CCGGGGCCGA GCAGGCCCTG
GAGCTGGACG GCGGCGGATC CTCGACCCTG CTCGTGCGCG AACCCGGGGG CGTCTCCCCG
GTGCTGCGCA ACCGCGCCGG GGACCAACTC CGGGAGGTCC CCGACGGCCT TGTGATCACG
GCGACCGAGG GCTCGGCCCG GACCTCGGGC CTGTGGCTGC GGCCCGCCCT GGAACCCCGG
CCCGAACACG GCTCCCCCGT GCCGCCCCAG GCCGACCCCC GACGCGTGTT CACCGGCATG
CACCGCACCC TGGCCGCCAC CGGGCACGAC GAGGCCTTCG GGCCGTCCGG GCCCGGCGCG
CCCCGCTCCC CCGACGACCG CACCGATGAG CTGGAGCTGT CCGCGCCCAC CGGACACGGC
GACGGCCCCC GGTTCGTGGC GGGAGACCCC GGACCGGTGA CCGTCACGGG GCGGGCGGGC
CGGGTCAGCG ACACCGTCGA CCTGGAGGTG CTGCCCGCCC CGGACACGCT CCTCGCCGCA
CCCCGGCGCC TGGGCATGGC CTCCGCCGAG GACACCGCCT CCTTCGTGCT CACCGGGGCC
ACCGAGGACG GGCGGCGGGC GCCCGTCGAA CCGGTCGACG CACGGGTCGA GGCCGTCCCC
GACCTGGTCG AGGTCGTGGA CCGGGGCGAC GGAGGCTTCG AGGTGCGGCC GCGCGCCGGG
GAGGGGACGG GCGTCCTCAC GGTGACCGCG GGCGGGGTGA GCACGCGGAT ACCGTTCTCC
ATCGGTACGC GGACGACTCC CCTGGCGGAC TTCGAGGACG CGCGGGAGTG GACGGCGCGG
TCCGCGCGCG GCGGGGCAGA GGTGCGCCCC GTGGCCGGGC GCGACGGCCC CGGTCTGGCC
CTGGCCTACG ACTTCACCCG CGACATCCGC ACCCGGACCG CCTCCGCCCA CCCGCCCGAG
CCGCTCGCCC TGGACCGCCA GGCCTTCGCG TTCACGGTGG GCGTGCGCGG CGACGGCAAC
GGCGCCCGTC TCATGCTCAG CCTCACCGAC GCGCACGGTG TCGGCCACTC CCTGGAGGGC
CCCGCCGTGG ACTGGGAGGG CTGGCGCGAC GTGCGCCTGG AGGTGCCCGA GGACGTCGGG
CACCCGGTCA CGGTCTCGCG GGTGTACCTG CTGGAGGAGG ATCCGAGCCG GGCCTACGCG
GGTGAGGTGG TGCTGGACGG CCTCACCGCC CGGACCACGA CGGGGCCCTG A
 
Protein sequence
MNLSRPRPPG PVRSAALPVI AVLVAPMVLA APSSRVTADV AAPAGGIAGA VEERIAAGVD 
LASGPVPGAG GGGEQFASLL TVDLTEGVGV EYVDGGGLTS PATVADMAAA VEPPEGSTVV
AAVNGGYFDI GATQAPLGAG MSDGRLLTSP DPGFANAVVI DAGGRGSVRQ VAFEGTASLP
SGDLDIDALN TSAVPADGLG LYTSDWGGHP RAHVVYEPGT SPGDTAVAEA VVSEGVVERV
SVTPGSGPIE EDEQVLVARG SAAERIADLS EGDPVEVEHT LTAEGAEPRV VVGGRHVLVR
DGEPVPVEDV SRAPRTAIGF SEDGEVMHVV TADGRNRGHA GSTLAEVAEL LAASGAEQAL
ELDGGGSSTL LVREPGGVSP VLRNRAGDQL REVPDGLVIT ATEGSARTSG LWLRPALEPR
PEHGSPVPPQ ADPRRVFTGM HRTLAATGHD EAFGPSGPGA PRSPDDRTDE LELSAPTGHG
DGPRFVAGDP GPVTVTGRAG RVSDTVDLEV LPAPDTLLAA PRRLGMASAE DTASFVLTGA
TEDGRRAPVE PVDARVEAVP DLVEVVDRGD GGFEVRPRAG EGTGVLTVTA GGVSTRIPFS
IGTRTTPLAD FEDAREWTAR SARGGAEVRP VAGRDGPGLA LAYDFTRDIR TRTASAHPPE
PLALDRQAFA FTVGVRGDGN GARLMLSLTD AHGVGHSLEG PAVDWEGWRD VRLEVPEDVG
HPVTVSRVYL LEEDPSRAYA GEVVLDGLTA RTTTGP