Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2324 |
Symbol | |
ID | 9246174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2774355 |
End bp | 2775515 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | Inositol-3-phosphate synthase |
Protein accession | YP_003680252 |
Protein GI | 297561278 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0365735 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACGCGAG TCGGAGTATG GCTGATCGGG GCGCGGGGGT CGGTGGCGAC CACCGCCGTC CTGGGAGCCC TGGCGGTGCG GTCCGGGGCG TCGGGGCGGA CGGGGTGCGT GACCGAGCGC CCCGAGTTCG CCCCGGCGCG CCTGCCCGGG ATCGGCGACC TGGTCTTCGG CGGCCACGAC GTCTCCGAGA TACGCCTGTG CAAGAACGCC GAGCGGCTTG CCCGGGCGGG GGTGCTGCCC GCGCACCTGC CGCCGGCCTT CGCCGACGAA CTGGACCGGA CCGAGCACCG TCTGCGCCGG GGCCTGCACG CCGCCGCCAC CACCGGGGTG GGGCGCCGCG ACGTGGACCG CCTGGAGGCG GACCTGCGCG AGTTCGCCGA ACGGGAGGCG CTCGAACGCG TCGTGGTGGT GGACCTGTCC AGCACCGAGC CCCGGCCCGA ACCGCACGCC GAGCACACCG ACGCCGACGC CCTGGAGGCG GCCCTGGACG CCGGGACCGC GCGGATCCCG GCCAGCTCCG CGTACGCCTA CGCGGCCCTG CGGGCGGGCT GCGCGTTCGT GGAGTTCACG CCCAATACCG GGCCCCGGCT GCCCGCCCTG GCGCGGCTCG TCGAGCGCTC GGTCGTGCCG TGGGCCGGGT GCGACGGCAA GACCGGGGAG ACCCTGGTCA AGAGCGCGCT CGCGCCGATG TTCGCCGCCC GGGCGCTGCA CGTGCGCTCG TGGTCCTCGC TCAACCTGCT CGGCGGCGGC GACGGCGCCA CCCTGGCCGA CCCGGCCAAC GCCGAGAGCA AACTCGCGTC CAAGGCGCGG GGTCTGGAGC ACATGCTGGG CCACGGCCCG GACGGGCCCC TGCACATCGA CTACGTCCCG GACCTGGGCG ACGCCAAGGT CGCCTGGGAC CACGTCTCGT TCGAGGGGTT CCTGGGGGCG CGGATGACCC TGCAGTTCAC CTGGTCGGGC TACGACTCCG CGCTGGCCGC GCCCCTGGTG CTGGACCTGG CGCGGCTGAC CGCCCACGCC CACCGGCGGG GCCGGGTGGG CCCCGTGCCG GAGCTGGCGT TCTTCTTCAA GGACCCCGTG GGCACCCGCG AACACGGCCT CGCCGAACAG TGGCGGGCCC TGACGTCCTG GTGCGCGGCA CCGGAGGAGG AGGCCGAGTG A
|
Protein sequence | MTRVGVWLIG ARGSVATTAV LGALAVRSGA SGRTGCVTER PEFAPARLPG IGDLVFGGHD VSEIRLCKNA ERLARAGVLP AHLPPAFADE LDRTEHRLRR GLHAAATTGV GRRDVDRLEA DLREFAEREA LERVVVVDLS STEPRPEPHA EHTDADALEA ALDAGTARIP ASSAYAYAAL RAGCAFVEFT PNTGPRLPAL ARLVERSVVP WAGCDGKTGE TLVKSALAPM FAARALHVRS WSSLNLLGGG DGATLADPAN AESKLASKAR GLEHMLGHGP DGPLHIDYVP DLGDAKVAWD HVSFEGFLGA RMTLQFTWSG YDSALAAPLV LDLARLTAHA HRRGRVGPVP ELAFFFKDPV GTREHGLAEQ WRALTSWCAA PEEEAE
|
| |