Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4598 |
Symbol | |
ID | 9248479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5453194 |
End bp | 5454477 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | N-acylglucosamine 2-epimerase |
Protein accession | YP_003682490 |
Protein GI | 297563516 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00161711 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACACCAT CGACACGACC GGCCCCGGGC CTGCCCGAGT GGCTGCGCGA GGAGGAGCGG CGGCTGTACG GCTTCGCCGC GGGATCCGCC GTGTCCGACG GCTTCGGCTG GCTCGACGCG GACGGGCGTG TGGAGCGGGA CCGGCCCGCC GCCGCCTGGA TCACCGCGCG GATGACGCAC GTGTTCTCCC TGGCCCACCT GCGGGGGGAG GCCGACGCCG GACGGCTGGC CGACCACGGT GTGGCGTCCC TGGCGACGGG CCCCCTGCGC GACGCCCGGG ACGGCGGCTG GTTCGACCGG GTGCCGGACG ACACCGGGGG GAACACTCCC CGGGAGCGCT CCCATGAGCT GCCGCGCAAG TCGGCCTACG AGCACGCGTT CGTGGTCCTG GCCGCCTCCA GCGCCACCCT CGCCGGTCGC CCGGGAGCGC GTGAGCTGCT CGACGACGCG CTGCGGGTCG TCGAGGAGCG CTTCTGGGAG GAGTCCGCGG GCGCCCTGCG CGAGAGCTGG GACAGCGGGT GGACCGCCAC CGAGGACTAC CGGGGCGCCA ACAGCAACAT GCACGCCGTG GAGGCCTTCC TGGCCGCGGC CGACGCCACC GGCGACGCCG TGTGGGCCCG CCGCGCGCTC TCCGTCGCCG AACGCCTGAT CCACGGCGTG GCCGCCGAGC ACGACTGGCG CCTGCCCGAG CACTTCACCT CCGACTGGAA GCCCGTCCCG GACTACAACC GGGACCGGCC CGACCACCCC TTCCGGCCCT TCGGGAGCAC CACCGGCCAC CTGCTCGAAT GGGCGCGCCT GCTGGTCCAC CTGGAGGTCG CGCTGAGCCG TGCGGGCGAC CCCGTCCCCG CGTGGCTGCG CACCGACGCC GAGGCGCTGT TCGACCACGC CGTCCGGCGC GGCTGGGCCG TCGACGGCGC CGAGGGGTTC GTCTACACCC TCGACTGGCA GGACCGGCCG GTCGTGCGCG AGCGCATGCA CTGGGTGGTG GCCGAGGCCG CCATGGCAGC CTGGGCGCTG GGGGAGCACA CCGGCGTGGC CACCTACGCC GACCTGCACG AACGCTGGTG GGCCTACGCC GACCGGTACC ACGTGGACCG CGAACGCGGC AGCTGGCACC ACGAACTGGA CCCGGACAAC CGCCCGGCCG CGAGCGTGTG GCCGGGCAAA CCGGACGTCT ACCACGCCTA CCAGGCGGCA CTCCTGCCCC AGATCGGGCT GTCGGCCTCC ATCGCGGCCG CCCTCCTGCC CGGCACCGAC CACGACTCCG GAGGAAACGC GTGA
|
Protein sequence | MTPSTRPAPG LPEWLREEER RLYGFAAGSA VSDGFGWLDA DGRVERDRPA AAWITARMTH VFSLAHLRGE ADAGRLADHG VASLATGPLR DARDGGWFDR VPDDTGGNTP RERSHELPRK SAYEHAFVVL AASSATLAGR PGARELLDDA LRVVEERFWE ESAGALRESW DSGWTATEDY RGANSNMHAV EAFLAAADAT GDAVWARRAL SVAERLIHGV AAEHDWRLPE HFTSDWKPVP DYNRDRPDHP FRPFGSTTGH LLEWARLLVH LEVALSRAGD PVPAWLRTDA EALFDHAVRR GWAVDGAEGF VYTLDWQDRP VVRERMHWVV AEAAMAAWAL GEHTGVATYA DLHERWWAYA DRYHVDRERG SWHHELDPDN RPAASVWPGK PDVYHAYQAA LLPQIGLSAS IAAALLPGTD HDSGGNA
|
| |