Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1696 |
Symbol | |
ID | 9245546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2067747 |
End bp | 2068787 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | SMP-30/Gluconolaconase/LRE domain protein |
Protein accession | YP_003679631 |
Protein GI | 297560657 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.263478 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACAGC ACTGCCCCAG GCCGCGACGA ACGGTGGCCG CCCGCGCACT CGCCGCACTG GCCCTGACCG GCGCGGCGGC CGGATGCGCC GCCCCCGCCG CCACCGGCGA GGACGGCCAG GACGGCGGAA CCGAGCGCAC CGCCGAACTC CTGGTCCAGG TGACCTCCGT CCACGAGGAG ACGGGGATGA CCCTGCTCGA AGGCCCGACC TTCGACGCCG ACGGACGCCT GCTCGTCGTG GACGTCACCG CCCCCGCCGG GGAGCCCAAG GTGCTCCGGG TGGACACCGG AACCCGGGAG GTGACCCCGG TGTTCACCGA CGAGACCGGC GCCTACACCT CCGCGCAGTT CAGTCCGCAC GACGACCGGC TCTACCTGAC CGACTTCGCC GGTGGGAAGA TCGACAGCAT CACCCCCGAG GGCGAGGACC ACACCACGTT CTTCTCCGGG GAGGTCGACG GAGCGCCGAT GAACCCCGAC GACCTGGCCT TCGACGAGGC CGGGAACATG TACGTCAGCG ACTCGGCCGG GTTCGACGGT CCGGCGTGGG AGGCCCGGGG CAGGGTCGTG CGCGTCGACC GCGACACCGC GGAGGCGACC GTCCTGGCCG AGGAGCTGCC CGCGCCCAAC GGCATCTCGT TCACCGCGGA CTTCTCGGGG CTGTGGGTCG GCCAGTACGG CGCCAACCGC GTCGACCACT ACGCGCTGAA CGAGGACGGC ACCGAGGTGG AGACCTCCCA CGCCGCCCTG TACTTCGACG GAGGCACGAG CCGGATCGAC TCCATCGCGG TGGACGCCGA CGGCAACCTC TACCAGGCCG TCCACGGCCA GCCGCGCATC TTCGTGTACA GCCCGCTCGG TGAGCACCTG GCGACGGTCG GCGTCCCGGC CGACGCCGCC GAGGGGCTGT ACTCGGCCAC CAACGTGGCC ATCGCACCGG GGACGACCGA CGCCTACATG ACCGTCAGCG GGGACGACGG CGGGTTCGTC TACTCCTTCG ACGCGCTCGC CGAGGGGATC CGCCAGTCCA ACGGCGGCTG A
|
Protein sequence | MEQHCPRPRR TVAARALAAL ALTGAAAGCA APAATGEDGQ DGGTERTAEL LVQVTSVHEE TGMTLLEGPT FDADGRLLVV DVTAPAGEPK VLRVDTGTRE VTPVFTDETG AYTSAQFSPH DDRLYLTDFA GGKIDSITPE GEDHTTFFSG EVDGAPMNPD DLAFDEAGNM YVSDSAGFDG PAWEARGRVV RVDRDTAEAT VLAEELPAPN GISFTADFSG LWVGQYGANR VDHYALNEDG TEVETSHAAL YFDGGTSRID SIAVDADGNL YQAVHGQPRI FVYSPLGEHL ATVGVPADAA EGLYSATNVA IAPGTTDAYM TVSGDDGGFV YSFDALAEGI RQSNGG
|
| |