Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2129 |
Symbol | |
ID | 9245979 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2548558 |
End bp | 2549565 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | Exonuclease RNase T and DNA polymerase III |
Protein accession | YP_003680059 |
Protein GI | 297561085 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCCGGA CCTGGACCGC GATCGACTTC GAGACCGCCA ACCACGACCG CGGCAGCGCG TGCGCGGTCG GCCTGGTGCG GGTGCGCGAC GGCGCGGTCG TGGACCGGTA CACCACGCTG ATCCGGCCGC CGAGGCAGGT GGACTTCTTC TCCCGCCACA ACATCGCCGT GCACGGCATC ACCGCGGCCG ACGTCGCCGA CGCCCCCTCC TGGGAACAGG CGCACGCGCG CATCGTGGAG TTCGCCGACG GCGGCCCCCT GGTGGCGCAC AACGCCGCCT TCGACATGGG GGTGCTGCGC CAGGCCTGCG GACACACCGG GCTGTCCCAC CCGGCGTGGG AGTACGCGTG CACGCTGGCG CTGTCCCGGC GCACCTGGAG CGGGCTGCCC GACCACAAGC TGCCGACCGT GTGCGCCCAC ATCGGGCACC GGGTGACGCA CCACCACCGA GCCGACGCCG ACGCCGAGGC CGCCGCGCGC ATCGTGATCG CCGCCATGGA GCGCTACGGC ACCCCGTCGC TGGCCGACCT GACCCGTGCC GCGCGGACGG ACCTGCGCCG GGTGGAGGCC TTGCCCGGGA GCGCGGTGCC CGCCCCGGTC CCGGCCGCCG CCGCGCCCGT CCAGCCCGCG CTGCTCGCCG CGTCCGCCGC GGCTGCCGCG CCCGAGGACC GGTTCGGACG GTGGCAGCGC GACGCCCGGA CGCCGCTGCC CGAGCCCTCC CCGGACGCCG ACCCCACCGG GCCGCTGTAC GGGCGCACCG TGTGCGTCTC CGGCGACCTG GAGTCCATGG ACAAGCCCGA GGTGTGGCGG CGCGTGGCCG AGGCGGGCGG GCGGCCCGCC AAGAACGTCA CCAGGAAGAC CGACATGCTC GTCCTCGGGG GCCACGGCGG CCCCGGAAAG ACCGCCAAGC ACCTGCGGGC CGAGATCTAC CGCGAGCGAG GCCAGCGCAT CGACCTGGTC ACCGAGGCCG AACTGCTGGC CCTGCTGGGG ATGACCACCC GCCCCTGA
|
Protein sequence | MSRTWTAIDF ETANHDRGSA CAVGLVRVRD GAVVDRYTTL IRPPRQVDFF SRHNIAVHGI TAADVADAPS WEQAHARIVE FADGGPLVAH NAAFDMGVLR QACGHTGLSH PAWEYACTLA LSRRTWSGLP DHKLPTVCAH IGHRVTHHHR ADADAEAAAR IVIAAMERYG TPSLADLTRA ARTDLRRVEA LPGSAVPAPV PAAAAPVQPA LLAASAAAAA PEDRFGRWQR DARTPLPEPS PDADPTGPLY GRTVCVSGDL ESMDKPEVWR RVAEAGGRPA KNVTRKTDML VLGGHGGPGK TAKHLRAEIY RERGQRIDLV TEAELLALLG MTTRP
|
| |