Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1092 |
Symbol | |
ID | 9244938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1339828 |
End bp | 1342098 |
Gene Length | 2271 bp |
Protein Length | 756 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | exopolysaccharide biosynthesis protein-like N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase |
Protein accession | YP_003679040 |
Protein GI | 297560066 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.830982 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACCTGT CCCGCCCAAG GCCCCCGGGC CCCGTCCGCT CGGCGGCGCT CCCCGTCATC GCCGTGCTCG TCGCGCCCAT GGTCCTGGCC GCTCCGTCGT CCCGGGTGAC GGCCGACGTC GCCGCGCCGG CGGGCGGGAT CGCCGGGGCC GTGGAGGAGC GGATCGCCGC CGGGGTGGAC CTCGCCTCCG GTCCCGTCCC GGGCGCCGGG GGCGGCGGGG AGCAGTTCGC CAGCCTCCTC ACCGTCGACC TCACCGAGGG TGTCGGGGTC GAGTACGTCG ACGGGGGCGG CCTCACCTCC CCCGCGACGG TCGCGGACAT GGCCGCCGCC GTCGAGCCCC CGGAGGGGTC CACCGTGGTC GCCGCGGTCA ACGGCGGCTA CTTCGACATC GGCGCGACCC AGGCCCCCCT GGGCGCCGGG ATGAGCGACG GGCGCCTGCT CACCTCGCCC GACCCGGGGT TCGCCAACGC CGTGGTCATC GACGCCGGGG GCAGGGGCAG CGTGCGCCAG GTGGCCTTCG AAGGGACCGC GTCCCTGCCG TCGGGGGACC TCGACATCGA CGCCCTCAAC ACCTCCGCCG TCCCCGCGGA CGGCCTCGGT CTGTACACGT CGGACTGGGG CGGCCACCCC CGCGCACACG TGGTGTACGA ACCCGGGACC AGCCCCGGGG ACACGGCCGT CGCCGAGGCC GTGGTCTCCG AGGGCGTCGT CGAGCGGGTC AGCGTCACCC CCGGCAGCGG CCCGATCGAG GAGGACGAAC AGGTCCTGGT GGCGCGCGGC TCCGCGGCGG AGCGGATCGC CGACCTGTCC GAGGGCGACC CGGTCGAGGT GGAGCACACC CTCACGGCCG AGGGCGCCGA ACCCCGCGTC GTCGTGGGCG GACGGCACGT GCTGGTGCGC GACGGCGAAC CCGTCCCCGT CGAGGACGTC TCCCGCGCGC CGCGTACCGC GATCGGGTTC TCCGAGGACG GCGAGGTCAT GCACGTGGTG ACCGCGGACG GGCGCAACCG CGGCCACGCC GGATCCACGC TCGCGGAGGT CGCCGAACTG CTCGCCGCGT CCGGGGCCGA GCAGGCCCTG GAGCTGGACG GCGGCGGATC CTCGACCCTG CTCGTGCGCG AACCCGGGGG CGTCTCCCCG GTGCTGCGCA ACCGCGCCGG GGACCAACTC CGGGAGGTCC CCGACGGCCT TGTGATCACG GCGACCGAGG GCTCGGCCCG GACCTCGGGC CTGTGGCTGC GGCCCGCCCT GGAACCCCGG CCCGAACACG GCTCCCCCGT GCCGCCCCAG GCCGACCCCC GACGCGTGTT CACCGGCATG CACCGCACCC TGGCCGCCAC CGGGCACGAC GAGGCCTTCG GGCCGTCCGG GCCCGGCGCG CCCCGCTCCC CCGACGACCG CACCGATGAG CTGGAGCTGT CCGCGCCCAC CGGACACGGC GACGGCCCCC GGTTCGTGGC GGGAGACCCC GGACCGGTGA CCGTCACGGG GCGGGCGGGC CGGGTCAGCG ACACCGTCGA CCTGGAGGTG CTGCCCGCCC CGGACACGCT CCTCGCCGCA CCCCGGCGCC TGGGCATGGC CTCCGCCGAG GACACCGCCT CCTTCGTGCT CACCGGGGCC ACCGAGGACG GGCGGCGGGC GCCCGTCGAA CCGGTCGACG CACGGGTCGA GGCCGTCCCC GACCTGGTCG AGGTCGTGGA CCGGGGCGAC GGAGGCTTCG AGGTGCGGCC GCGCGCCGGG GAGGGGACGG GCGTCCTCAC GGTGACCGCG GGCGGGGTGA GCACGCGGAT ACCGTTCTCC ATCGGTACGC GGACGACTCC CCTGGCGGAC TTCGAGGACG CGCGGGAGTG GACGGCGCGG TCCGCGCGCG GCGGGGCAGA GGTGCGCCCC GTGGCCGGGC GCGACGGCCC CGGTCTGGCC CTGGCCTACG ACTTCACCCG CGACATCCGC ACCCGGACCG CCTCCGCCCA CCCGCCCGAG CCGCTCGCCC TGGACCGCCA GGCCTTCGCG TTCACGGTGG GCGTGCGCGG CGACGGCAAC GGCGCCCGTC TCATGCTCAG CCTCACCGAC GCGCACGGTG TCGGCCACTC CCTGGAGGGC CCCGCCGTGG ACTGGGAGGG CTGGCGCGAC GTGCGCCTGG AGGTGCCCGA GGACGTCGGG CACCCGGTCA CGGTCTCGCG GGTGTACCTG CTGGAGGAGG ATCCGAGCCG GGCCTACGCG GGTGAGGTGG TGCTGGACGG CCTCACCGCC CGGACCACGA CGGGGCCCTG A
|
Protein sequence | MNLSRPRPPG PVRSAALPVI AVLVAPMVLA APSSRVTADV AAPAGGIAGA VEERIAAGVD LASGPVPGAG GGGEQFASLL TVDLTEGVGV EYVDGGGLTS PATVADMAAA VEPPEGSTVV AAVNGGYFDI GATQAPLGAG MSDGRLLTSP DPGFANAVVI DAGGRGSVRQ VAFEGTASLP SGDLDIDALN TSAVPADGLG LYTSDWGGHP RAHVVYEPGT SPGDTAVAEA VVSEGVVERV SVTPGSGPIE EDEQVLVARG SAAERIADLS EGDPVEVEHT LTAEGAEPRV VVGGRHVLVR DGEPVPVEDV SRAPRTAIGF SEDGEVMHVV TADGRNRGHA GSTLAEVAEL LAASGAEQAL ELDGGGSSTL LVREPGGVSP VLRNRAGDQL REVPDGLVIT ATEGSARTSG LWLRPALEPR PEHGSPVPPQ ADPRRVFTGM HRTLAATGHD EAFGPSGPGA PRSPDDRTDE LELSAPTGHG DGPRFVAGDP GPVTVTGRAG RVSDTVDLEV LPAPDTLLAA PRRLGMASAE DTASFVLTGA TEDGRRAPVE PVDARVEAVP DLVEVVDRGD GGFEVRPRAG EGTGVLTVTA GGVSTRIPFS IGTRTTPLAD FEDAREWTAR SARGGAEVRP VAGRDGPGLA LAYDFTRDIR TRTASAHPPE PLALDRQAFA FTVGVRGDGN GARLMLSLTD AHGVGHSLEG PAVDWEGWRD VRLEVPEDVG HPVTVSRVYL LEEDPSRAYA GEVVLDGLTA RTTTGP
|
| |