Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4846 |
Symbol | |
ID | 9248732 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5742810 |
End bp | 5743901 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | inositol 1-phosphate synthase |
Protein accession | YP_003682735 |
Protein GI | 297563761 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTTCGG TACGTGTAGC CGTCGTGGGC GTCGGCAACT GTGCGGCGTC GCTCGTCCAG GGCGTGCACT ACTACAAGGA CGCCAACCCC GAGTCCCGGG TACCAGGCCT GATGCACGTG CAGTTCGGCC CGTACCACGT GCGCGACATC GAGTTCGTCG CAGCCTTCGA CGTGGACGCC AAGAAGGTCG GCCACGACCT CGCCGACGCC ATCACGGCCA GCGAGAACAA CACCGTCAAG ATCTGCGACG TGCCGCCGAC CGGTGTCACC GTCATGCGCG GACCGACCTA CGACGGGCTC GGCAAGTACT ACCGCGAGGT CATCCAGGAG TCCCCCGAGG ACGCGGTGGA CGTGGTCGCG GCGCTCAAGG CCAGCAAGGC CGACGTGCTC GTGTCCTACC TCCCGGTGGG CTCGGAGGAG GCGGGCAAGT TCTACGCCCA GTGCGCGATC GACGCGGGCG TGGCCTTCGT CAACGCCCTG CCGGTGTTCA TCGCCTCCGA CCCCGAGTGG GCCGAGAAGT TCACCAGAGC GGGTGTGCCG ATCATCGGCG ACGACATCAA GTCGCAGATC GGCGCGACCA TCACCCACCG CGTGCTGTCC AAGCTGTTCG AGGACCGCGG CGTGATCGTG GACCGCACGT ACCAGCTCAA CTTCGGCGGC AACATGGACT TCAAGAACAT GTTGGAGCGC GACCGCCTGG AGTCCAAGAA GATCTCCAAG ACCCAGTCCG TCACCTCCCA GATCCCGCAC GAGCTGAAGG CAGGCTCGGT GCACATCGGC CCGTCGGACC ACGTGCCGTG GCTGGACGAC CGCAAGTGGG CCTACATCCG CCTTGAGGGG CGCGCGTTCG GCGACGTGCC GCTGAACCTG GAGTACAAGC TGGAGGTCTG GGACTCCCCC AACTCCGCGG GCATCATCAT CGACGCGGTC CGCGCCGCCA AGATCGCCAA GGACCGCGGC ATGGGCGGCC CGATCCTGGC CCCGTCCTCC TACTTCATGA AGTCCCCGCC CGAGCAGTAC AGCGACGCCG AGGCGCACGA GAAGGTCGAG CAGTTCATCG CCTCGGGCCA GCACGACGGC GCCGACGAGT AG
|
Protein sequence | MGSVRVAVVG VGNCAASLVQ GVHYYKDANP ESRVPGLMHV QFGPYHVRDI EFVAAFDVDA KKVGHDLADA ITASENNTVK ICDVPPTGVT VMRGPTYDGL GKYYREVIQE SPEDAVDVVA ALKASKADVL VSYLPVGSEE AGKFYAQCAI DAGVAFVNAL PVFIASDPEW AEKFTRAGVP IIGDDIKSQI GATITHRVLS KLFEDRGVIV DRTYQLNFGG NMDFKNMLER DRLESKKISK TQSVTSQIPH ELKAGSVHIG PSDHVPWLDD RKWAYIRLEG RAFGDVPLNL EYKLEVWDSP NSAGIIIDAV RAAKIAKDRG MGGPILAPSS YFMKSPPEQY SDAEAHEKVE QFIASGQHDG ADE
|
| |