Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3967 |
Symbol | |
ID | 9247838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4743404 |
End bp | 4745962 |
Gene Length | 2559 bp |
Protein Length | 852 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_003681870 |
Protein GI | 297562896 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACACCG ACGCTCCCGG CCCGTCCGGA CCGGAACTGA GGGAGTACAC GGCACTGCTG CGCCGCCGCT GGCGGTTCGT CGGCGCCGGG GTGCTCGGCG GGCTGGCCCT GGCCTCCGCC GCGGTGGTCG CACTCCCCGC CGCCTACACC TCCGTCTCCG CCGTCCAGGT CCAGCCCAGC GGGATGGCCG AGTTCACCGG GGAGCGCTCC GGCCGCCTGG CCGGGGACGT CAACCTCGAC ACCGAGGCAC AGGTCCTGCT GTCGGAGCGC GTGTCGTCCG CCGTGGCCGA GGCCCTGGCG GAGGAGGGCG GTGCCGCGCC CTCCGTGGCG GACCTGCGCG AGCGGGTGGA CGTGAGCGTT CCCCCCAACA GCAGCGTCCT GGAGATCAGC TACTCCGCCG GGAGCCCCGA GGCCGCGCGG GCGGGCGCGC AGGCCTACGC CGACGCCTAC CTCGAACTGC GCCGCGAGCG GATCGACGGG CTGATCGAGA GCCACCTGGA GGCGCTGCGC GGTGAGCAGG AGGCCCGTTA CGAGGCCCTC GCCGAGACCG CCGGGGAGTC CGCCGCCCCC GGCGCGGACG CGCGGGTGGA GGCTCTGCGC GCGGAGATCA CCGAGCTGGG CAACGGCATC AGCCCGCTCA GCGCCCTGGC GGAGACGGTC GAGCCCGGCA GTGTCATCAC CCCGGCGGGG CTCCCCGAGC GCGCGAGCAG CCCGATACCC GCCCTGTGGC TGGTCGGCGG CGCCGCGCTC GGCCTGCTGA CGGGGCTGCT GGCGGCCGTG GTGCGCGACC GCCTCGACCC GAGGCTGCAC GACGCCGAGG AGACCGGGCG GATCGGGGCG GTGCCGGTGC TGCTCGACCT GTCCGAGCGG GTCGGGCGCG GCCAGCGCTC CCCCGGGCTG CTGCCGGACG GCGACCGGCG CGGGCAGCGG GTCAACGAGT TCGCGCACCT GGTGCGCGCC CGCCTCGCCG CGGTCCCCGT TCCCGCGGCG GCCGGCGGAC CCTCGGAGGT CGCTGGGCCG ACCGGGGCGG GTGGGACGGG GGTGTCCGAC CGCGCCGGAG CGGGCGCGGG CGGCGCCCGG GGGGAGGGCG AACTGGCCGT GCTCCTCGGG GACGAGGAGG CGGTCCTCGG CCGCGTGCTG CTCGTCACCG CGACCACCCC GGGCCGCGCC GGTGCGGCGA CCGCGGTGAA CCTGGCCGCC TCCCTGGCCC GCACGGGCTC GGAGACGCTG CTGGTGTGCG CCGACCCCCG CACCGACGCG GTCGGCGAGC TGCTGGGGCT GCCGGAGGGC CCCGGCCTGG CCGAGGCGCT GCTGGAGGGT GAGGACCCCG CCGACCTGGA GGTGCGGCCC GACGCGGTGC CCCGTCTGCG GGTGCTGCGC TACGGGTCGC CGGGTATGGA CGCGCCGGTG CAGGGCACCG CCATGCCGGA ACTGGTGCGG CTGCTGCGGG CGGGGGCGGA GTACGTGGTG GTGGCGGTGG CGCCGGTGAG CGAGCGGGCC GACGCGCACG CGCTGGCCGG TTCGGCCGAC CTGATGCTGC CGGTGGTCGA ACTGGACCGC ACGCGCCGCG CCGAACTGGG GGAGCTGCTC GTCCTCGCCG ACCGGTTCGG GGTTCCCGTT CCGGGCACGG CCGTGCTGCC GCGCCAGCCG CTGGCCGGAC CCGCGCCGGT GGCGTCGCCG TCCGCGGCCG AGCCGGTCGC CGGGGAACAG ACCACCGGGC CCGACCGCAC GCCCGGAAGG GACGGAGCGG CCGGGGCGGA CGAGGCGGCC GGGGCCGCGA AGGGCCGCGC CGGTGCGGGG ATCACGCTGA CCGGCATCGT CAGCGAGCTG CCCGCCACCG CCGGGACCGC GGGGACGCGG GGGCGCGGCG GGGCGGCCCG GCCCCCGGCT CCCAGGGACG CCGAGGGTGT CCCCGGGGTT CCCGGTCCCC GGCGGGCGGA GGCCGCGGAG GAGCCCGAGG CCGCGGAGAG CGGGAGGAGC ACGGCCGACG GTGCCGCCGG CACGCCTTCG CGGGCTCGCG ACGGGGACGC GGAGTCCGCG GTCCCGGGCG GCGGCGAGCG CGCCGGTGAC GAGCGCGCCG GAGGCGACGA CACCGCCCAG GAGCGCTCCG GGGTGTCCGA AGCCGAGACG GCCGAGCGGA TCGCGGAGGC GGCCGGGGCG GACGACGGCA CCGCGCAGGA CGCGGCGGCC CCCCGCGACG GCGGGATACC CGCGGGCCTG GAGGTCCCCC GCGTGCTGGG AGGCGACGGC GCCACCCAGA TCCCGGTGAC GCCCGAGGCC GCCGGGACCG GCCGGACCGA GGAGGACGCC GCGGCCCCGG GTACGGCCTC GGAGGCGGAG ATGGCCTTCG GGCTGGCCCG GACCGCCGAG GCGCGGGAGC CCCACGACCC CGAGGCGACC GTCAGCGGCG CCGAGGCCAC GGAGGCCCTC GTGGCCTTCG CGGCCGAGGG GGCCGAGGGG GCCGAGGAGT CCCGGCCGAC CGGAGCCGCC GGGGACTCCG ACTCCCCGGA CAGCGCCCCT GACACCGCCC CGGACACGGC CCCGGGGACA CGGAACTGA
|
Protein sequence | MDTDAPGPSG PELREYTALL RRRWRFVGAG VLGGLALASA AVVALPAAYT SVSAVQVQPS GMAEFTGERS GRLAGDVNLD TEAQVLLSER VSSAVAEALA EEGGAAPSVA DLRERVDVSV PPNSSVLEIS YSAGSPEAAR AGAQAYADAY LELRRERIDG LIESHLEALR GEQEARYEAL AETAGESAAP GADARVEALR AEITELGNGI SPLSALAETV EPGSVITPAG LPERASSPIP ALWLVGGAAL GLLTGLLAAV VRDRLDPRLH DAEETGRIGA VPVLLDLSER VGRGQRSPGL LPDGDRRGQR VNEFAHLVRA RLAAVPVPAA AGGPSEVAGP TGAGGTGVSD RAGAGAGGAR GEGELAVLLG DEEAVLGRVL LVTATTPGRA GAATAVNLAA SLARTGSETL LVCADPRTDA VGELLGLPEG PGLAEALLEG EDPADLEVRP DAVPRLRVLR YGSPGMDAPV QGTAMPELVR LLRAGAEYVV VAVAPVSERA DAHALAGSAD LMLPVVELDR TRRAELGELL VLADRFGVPV PGTAVLPRQP LAGPAPVASP SAAEPVAGEQ TTGPDRTPGR DGAAGADEAA GAAKGRAGAG ITLTGIVSEL PATAGTAGTR GRGGAARPPA PRDAEGVPGV PGPRRAEAAE EPEAAESGRS TADGAAGTPS RARDGDAESA VPGGGERAGD ERAGGDDTAQ ERSGVSEAET AERIAEAAGA DDGTAQDAAA PRDGGIPAGL EVPRVLGGDG ATQIPVTPEA AGTGRTEEDA AAPGTASEAE MAFGLARTAE AREPHDPEAT VSGAEATEAL VAFAAEGAEG AEESRPTGAA GDSDSPDSAP DTAPDTAPGT RN
|
| |