Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1320 |
Symbol | |
ID | 9245170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1625954 |
End bp | 1627648 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | undecaprenyl diphosphate synthase |
Protein accession | YP_003679260 |
Protein GI | 297560286 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.165638 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGTT ACACGAGGAC GGGTCCCCCG CTGATCGCGG AGCGGGGCGC CGACACCGGC CTGGCGCGCG CCTTCGAGGT CTGCCGCAGG ATCCACACGG GTGCGGACCA CGTCTCCCCC CGGGTGGTCG ATCTGCTGCC CGCCCACAAG CGCCCCTACG CGCACGCACT GGTCGCCTTC GGGATATGGG CCGATCGGCT GGCCGACGAG GGGGAGGTCT CCGAGCGCGG GCCGGCCCTG GCCCGGTTCC GCGCCGAGAC CCTGGCCGCG CTGGCCGACG GACCCGGCGC GCCCGTCCGC CTGCCGCCGG TCCAGCGCGC CATGGCCCAC ACGGTGCGGG CGTGGGACAT GCCCGTACCG GTGCTGGAGG AGCTCCTCAC CACGCTCGAA CAGGACAGCC GCCGAACGCC CGACTTTCCC GGCTTCGCCG ACCTGCGCGG CTATCTGCGC GGTATGAGCG GCACCGTCGC GGAGCTGCTC GGCACCGTCT TGGAGCCGGT CCGGGAGGAC ACCCCGGAAC TCATGTCGCT GCTGGGCGAA GTCCTCCAGT ACATCGATAT CCTCACCGAC CTGCCCGAGG ACCTCGAACA GGGCCGCTGC TACCTGCCCC GCCAGGACCT GGAGCGGTTC GGCCTGGACG CGGACGGCCT GAACGGCGCG CTCGGCACGG ACGCCTGTCG GGAACTGATC GCCCTGCAGG TGCGACGGGC ACGCGGACTG CTGGACCGGG GGCAGGAGGT GGTCGATGCC GTGCACCCCT CCAGCCGCCC CTTCCTCGCC TCCTTGCTGG CCGGGCTGCG GACGGGCCTG GACGAGTGCG AGTACCTTCC GGCGAACCGG CCGGACGCTC CGCCGCGAAC AGCCGTTCCC GCCCGGTTGT CGCAAACCCG GGAGACACCC GCCGAAGTCC TGCCCGTCGA CTCCGTGCCG CGGCAACAGC GGTCGCCCGT CCCGTCCCCG GACTCCGAGG ACCCTCCCGC CGCGGTCCCC GAGCACGTGG CCGTGATCAT GGACGGTAAC CGGCGGTGGG CCTTAGCACT CGGACTGGCC GCGGTGGAAG GGCACATGGC CGGAGAGGAG GCGATGTACC GTCTGGTGGA CGCCGCCGGG GACCTGGGCA TCAAGTACGT GACCACCTTC GCCTTCTCCA CCGAGAACTG GTCGCGCTCT CCCGAAGAAG TGTCCTCCTT GTTCAAGGTG TTCGCCCGGC GCGTCACGGG GGTCACCGGG CGCCTGCACG CCCGGGGCGT CCGGATGCGC TGGTACGGCC GCCGCACCAG GATCGAGGCG GCGCTCCGCG AACGGCTGGA GTGGGCCGAG GAGCTGACGT CGGGCAACAG CGGGGTGACG TTCACGTTCT GCCTGGACTA CGGAGGCCGC CAGGAGATGG TGGACGCGGT CAAACACGCG GCGGCCGAGG CGTTCTCGGG GCGGCTGGAC CCGACGCGCA TGGCGGAGTC CGACCTCGCC GGGTACCTCT ACGATCCCAC CCTGCCCGAC GTGGACCTCC TGATCAGGAC CGCTGGCGAG CAGCGCACCA GCAACTTCCT GCCCTGGCAC ACCGCCTACG CCGAGATCGT CTTCGATGAC GCTCTCTGGC CCGACTTCGA CCGCTCCCAC CTGGTGCGTG CCGTGAACGC CTACGCGGAA CGGCGCAGGA GCTTCGGCGG CACCCTGAAC GAGAAGAGCG CCTGA
|
Protein sequence | MSGYTRTGPP LIAERGADTG LARAFEVCRR IHTGADHVSP RVVDLLPAHK RPYAHALVAF GIWADRLADE GEVSERGPAL ARFRAETLAA LADGPGAPVR LPPVQRAMAH TVRAWDMPVP VLEELLTTLE QDSRRTPDFP GFADLRGYLR GMSGTVAELL GTVLEPVRED TPELMSLLGE VLQYIDILTD LPEDLEQGRC YLPRQDLERF GLDADGLNGA LGTDACRELI ALQVRRARGL LDRGQEVVDA VHPSSRPFLA SLLAGLRTGL DECEYLPANR PDAPPRTAVP ARLSQTRETP AEVLPVDSVP RQQRSPVPSP DSEDPPAAVP EHVAVIMDGN RRWALALGLA AVEGHMAGEE AMYRLVDAAG DLGIKYVTTF AFSTENWSRS PEEVSSLFKV FARRVTGVTG RLHARGVRMR WYGRRTRIEA ALRERLEWAE ELTSGNSGVT FTFCLDYGGR QEMVDAVKHA AAEAFSGRLD PTRMAESDLA GYLYDPTLPD VDLLIRTAGE QRTSNFLPWH TAYAEIVFDD ALWPDFDRSH LVRAVNAYAE RRRSFGGTLN EKSA
|
| |