Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2848 |
Symbol | |
ID | 9246699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3399176 |
End bp | 3401104 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | deoxyxylulose-5-phosphate synthase |
Protein accession | YP_003680765 |
Protein GI | 297561791 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCACTGT TCGAGTCGAT ACACAATCCG GAAGCGCTCA AGAGGCTTCC GGCCGACCAG CTCCCTGCGC TGGCACAGGA GATCCGGGAC TTCCTGGTCG ACTCCTGCTC CCAGACCGGC GGGCACCTGG GCCCCAGCCT CGGCGTCGTC GAGCTGACCA TCGCGCTGCA CCGGGTCTTC GACTCCCCGA AGGACCCCAT CATCTTCGAC ACCGGGCACC AGGCCTACGC GCACAAGGTG CTCACCGGCC GCCACGACTT CACCAACCTC AAGTCCGAGG GCGGCCTGTC CGGATACCCC TCCCGCGCAG AGTCCGAGCA CGACTTCGTC GAGAACTCCC ACGCCTCCAC CGCGCTGTCC TACGCCGACG GCATGGCCAA GGCCAACGAG GTGCGGGGCC GCACGGACCG CACCGTGGTC GCCGTCATCG GCGACGGCGC CCTCACCGGC GGCATGGCCT GGGAGGCGCT CAACAACATC GCCGAGCGCA GGGACCGGCG CCTGGTCATC GTGGTCAACG ACAACGGCCG CTCCTACTCG CCCACCATGG GCGGTCTGGC CGACCACCTC GCCTCCCTGC GCATGGCCCC CGGCTACGAG CAGGCCCTGG ACCTGGCCAA GACCACCCTC AACCGCACGC CCGTGGTCGG CCAGCCCCTC TACGAGGCGC TGCACGGCAT CAAGAAGGGC CTCAAGGACG CCATCCAGCC GCAGATGATG TTCGAGGACC TGGGCCTGAA GTACCTCGGT CCCATCGACG GCCACGACGA GCAGGCCCTG GAGAAGGCGC TGCGCCGCGC CCGCGACTTC GGCGGCCCGG CGATCGTCCA CTGCATCACC CAGAAGGGCA AGGGCTACGC CCCGGCCGAG AACCACGACG AGGACCAGTT CCACGCCCCC GGCCCCTTCG ACGTGGCCAC CGGCCAGGCC AAGCCCGGCC CCAAGGCCGT CAAGTGGACC TCGGTCTTCG CCGACACCAT GGTGGAGATC GGCGCCGAGC GCCCCGACGT GGTGGCCATC ACCGCCGCCA TGCTCCACCC CACCGGCCTC AACAAGTTCG CCGAGGCCTA CCCCGAGCGC ACCTTCGACG TGGGCATCGC CGAGCAGCAC GCCGCCACCG CGGCCACCGG CATGGCCATG AACGGCCTGC ACCCGGTCGT GGCCGTCTAC GCCACCTTCC TCAACCGCTG TTTCGACCAG GTGCTCATGG ACGCCGCCCT GCACCGCCAG GGCGTCACCT TCTGCCTGGA CCGCGCCGGG ATCACCGGCA ACGACGGCGC CAGCCACAAC GGCATGTGGG ACATGTCCAT CCTCCAGGTG GTCCCGGGCC TGCGCCTGGC CGCGCCCCGC GACGCCGAGC GCCTGCGCGC CCTGGTGCGC GAGGCCGTCG CCGTCGAGGA CGCGCCCACC GTCGTGCGCT ACCCCAAGGG CGCGGTCGCC GAGCAGCTGG ACGCCGTGGG CAGGGTCGGC TCCATGGACC TGCTCCGCCG CGCCGCCGAC GGCGGCAACT CCAAGGACCT GCTCATCGTG GGCGTGGGCA CCATGGCCCA GGTCGCCTGC GAGGTCGCCG ACCGGATGGC CGCCCAGGGG CTGGGCGTCA CCGTCATCGA CCCCCTGTGG GTCAAGCCGC TGGACGAGGC CCTGGTCGCC GAGGCCGCCG AGCACAGCGT CGTCGCGGTC ATCGAGGACA ACGGCCGCAC CGGCGCGGTC GGCGACGCCG TGGCCCGCCT GCTGCGCGAC CACGACGTGG ACATGCCGGT GCGCACGTTC GGCATCGCGC AGGAGTTCCT GGACCACGCC AAGCGCGACG ACATCCTGCG CGAGCAGGGC CTGACCCCCC AGGAGCTGTC CCGCAGCCTC ACCGAGACGG TCTCCCGGCA CACCGACGGC GCCAAGTCCG CCGGCGAACT GGCCGACGAG ACCGCCTGA
|
Protein sequence | MALFESIHNP EALKRLPADQ LPALAQEIRD FLVDSCSQTG GHLGPSLGVV ELTIALHRVF DSPKDPIIFD TGHQAYAHKV LTGRHDFTNL KSEGGLSGYP SRAESEHDFV ENSHASTALS YADGMAKANE VRGRTDRTVV AVIGDGALTG GMAWEALNNI AERRDRRLVI VVNDNGRSYS PTMGGLADHL ASLRMAPGYE QALDLAKTTL NRTPVVGQPL YEALHGIKKG LKDAIQPQMM FEDLGLKYLG PIDGHDEQAL EKALRRARDF GGPAIVHCIT QKGKGYAPAE NHDEDQFHAP GPFDVATGQA KPGPKAVKWT SVFADTMVEI GAERPDVVAI TAAMLHPTGL NKFAEAYPER TFDVGIAEQH AATAATGMAM NGLHPVVAVY ATFLNRCFDQ VLMDAALHRQ GVTFCLDRAG ITGNDGASHN GMWDMSILQV VPGLRLAAPR DAERLRALVR EAVAVEDAPT VVRYPKGAVA EQLDAVGRVG SMDLLRRAAD GGNSKDLLIV GVGTMAQVAC EVADRMAAQG LGVTVIDPLW VKPLDEALVA EAAEHSVVAV IEDNGRTGAV GDAVARLLRD HDVDMPVRTF GIAQEFLDHA KRDDILREQG LTPQELSRSL TETVSRHTDG AKSAGELADE TA
|
| |