Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3432 |
Symbol | |
ID | 9247299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4106617 |
End bp | 4109565 |
Gene Length | 2949 bp |
Protein Length | 982 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | nicotinate-nucleotide/dimethylbenzimidazole phosphoribosyltransferase |
Protein accession | YP_003681343 |
Protein GI | 297562369 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.368499 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.260394 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGAG ACGACCGCGA GGACGCCCGG CGCGGCGAGG AGGGCGGAGC CGAGCCGTCC CGCTGGAAGG GCGGCAGACC CGCCTCGGGA ACCGGTCCCC GCCTCGGCGG CCTGTTCGAC GGGCTCCCCG AGAGCGGGCG CGGCGCCCGG GCCCAACCCC CGCGCGGCTC CTCGGGCACG GCCAACCCGT TCCGGCAGCG GCCCCGGCCC CCCGCGCCGG CCGACCCCGC GCCACGGGTC CCCTCCGCCC CGGTCTCGCC CTACGGCGCG CTTCCCCTGG ACATCCCCCC GGCCCCGCGA CCGGAGGCCG AGCCGACGGT CGAGCTCGCG GCGGACGCGC CGTCGCCGAC CGGGTCGCCG CACCGGCCCG ACGTCATCCC CTTCCGCGGC GGCGAGCGCC CCGCCGGGAG CGATGCGGCC AGCCCGGGAG CCGTGCCGAG TCCGGTTCCC GGGAGCCCCG AGAGCCAGGC GGGCCCGGAG TGGACCGTTC CCTCCGCCGA GTCCGAACAT CCGGCGGAGC CCAGCGGTCC GGCCGAGCCG GAGCCCCTGG CCGAGCCGGA GCCCTCGACG GCGCTCGCGG ACGCTGCCGC GCTCGCGAGG TCGGCGGTGC TCGCGAAGGC GGCGCGCCCA CAGGAGGCGG TGCCCGCGCA CGGCACGACG ACCGCCCGAG CCGCCGCACC CGCCCGCCCG GTCACGCCCG CACGTCCGGC CGTACCCGTC CGTCCAGCGG CGCCCGCCCG CCAGGGCGCC CCCGCCCAAC CGCCGGCGCC CCCGCGCCCC GCCGAACCCG TGCACACCGC GGCGTCCGCG CACCCCGCCG CGCCCCCGAA CACCGAGGTA CCCGCCCGCC CGGACACGCC CGCACCCCCT GTGGCTTCCG CCCGCCCGGC CACGTCCGCG CCCCCCACGG GGCCTGCCCG CCCGGCTGCT CCCACCCGTC CGGCGGTACC CGCACGCCCG GCTACGCACC CCGCCCGGAC CGTCCGCCCG CCCGGGACCA CCACCGACCC GGAGGAAGCG GGGGCCGGGG CCGCCGGAGC ACCGACCACA GACCACGGAC GCGCGGAACA GCACTCCGCG ACCACCGAGC CGGAGCAGAG CATGCACGCA CACGAACCAT CCGAGCCCCT GCCCGGGCCC GCCGGGCCTC CTGTCCAGAC CGCCCCCGAG CCCGCGGCCG CGTCCACCGC CCAGGCTCCC GAGCGGGCCG GCAACGGCTC CGTCCACGCC TACGGCGAGG CCGAGCGCGC CGCCGTCTAC CGGGCCATCC GCGAACGCCG CGACGTCCGG ACGGGCTTTC GGCCCGACCC CGTGCCCCAC GACGTGCTCA TCCGCGTCCT GGAGGCCGCC CACCAGGCCC CAAGCGTCGG CGACTCCCAG CCCTGGGACT TCCTGGTCAT CGAGGACCCC GGCCTGCGCG CCCGCGTGGG CGACCTCGCC GCGGCCGAGC GCGAGGACCA CGCCCACGCA CCGCCGGGCG TCCGCGCCCG CGCCTTCGCC GGACTCAAGG CCGAGGCGGT CCTCGACGCG CCCCTCAACA TCGCCGTCAC CGTCGACCCC ACACGGGGAG GACGGCACGC CCGGGGCCGC CACGCCCGCC CCCTGAGCGC CGACCACGCC GCGGCGCTCG CCGTGGAGAA CCTCTGGATC GCCGCGCGCG CCGAGGGGCT GGGCGTCGGC TGGCTCACCT TCGTCGACGA GCGCGACGTC GCCCGCGCCC TCGAACTCCC CGCCCACCTG GACGTGGCCG CCTACCTGTG CGTCGGCTAC GTGGAGGAGT TCCCCGCCGA GTCCGAGCTC AGCCTCTCCG GCTGGGCCAG GGAGCGCCCC CTGTCCTGGG CGGTGCACCA CGACCGCTAC GGCCGCCGCG GTCTGCCCGG CCGGGAACCC ACGAGCCTGC TGGAGGAGAC CATCACCGCC GTCGGAGCGC TGGACACCCG CGCGATGGAG GAGGCCAGGG ACCGCCAGGA CCGGATGACC AAGCCGCCCG GCTCCCTCGG CTTCCTGGAG GAGGTCTCGG TCCAGCTGGC CGGGATCTCC GGGCAGTGCC CGCCGCCGAT CCCCGACCCG GCCGCCGTCG CCGTGTTCGC GGGCGACCAC GGCGTGCACG CCCAGGGGGT GACCCACTGG CCCCAGGAGG TCACCGCCCA GATGGTGCAC AACTTCCTGG AGGGCGGCGC CGTCGTCAAC GCCTTCGCCG CCCAGGTCGG CGCCGAGGTC ACCGTCGTGG ACGTCGGCGT GGCCGCGGAC CTGCCCCGTG CCCCCGGCCT GCTGGCCCGC AAGGTCGCGC GCGGCACCGC CGACCTCACC CAGGGGCCAG CGCTCACCCG CGAGCAGACC CTCCAGGCCC TGGAGTGCGG CATCGAGGTC GCCCGCGACC TGGTCTCCGC GGGCAACCGC TGCCTGGTCA CCGGCGACAT GGGCATCGCC AACACCACCC CGGCCGCCGC CCTGGTGTGC GCCTTCACCG GAGCCGACCC CGCGCACGCC ACGGGACGGG GCACCGGTGT GGACGACGCC GTGTACGCCC ACAAGGTCGA CGTGGTGCGC CGTGCCCTGG CCGAGCACCC CGTCGACCCG GCCGACCCCA TCGGCACCCT GGCCGCCCTG GGCGGCCTGG AGCACGCCGC CCTGGCGGGG TTCGTGCTCG GCGGCGCGGC CCTGCGCGTC CCGGTGCTGC TGGACGGGGT CATCGCCGGA TCGGCCGCCC TGGCCGCCGC CGCGATCTCC CCGGAGGCCC TCAGCGCCTG CTTCGCCGGG CACCGCTCCA GCGAGCCCGG GCACAGCCTG GCCCTGGAGC ACCTGGGCCT GCGCCCCCTG GTCGACCTGG AGATGCGCCT GGGCGAGGGC TCCGGAGCGC TGCTGGCACT GCCGCTGCTC CAGAGCGCGG CGCGTGTCCT GCACGACGTC GCCACCTTCG ACGACGCGGG CGTCTCCACC GCCCCCTGA
|
Protein sequence | MTRDDREDAR RGEEGGAEPS RWKGGRPASG TGPRLGGLFD GLPESGRGAR AQPPRGSSGT ANPFRQRPRP PAPADPAPRV PSAPVSPYGA LPLDIPPAPR PEAEPTVELA ADAPSPTGSP HRPDVIPFRG GERPAGSDAA SPGAVPSPVP GSPESQAGPE WTVPSAESEH PAEPSGPAEP EPLAEPEPST ALADAAALAR SAVLAKAARP QEAVPAHGTT TARAAAPARP VTPARPAVPV RPAAPARQGA PAQPPAPPRP AEPVHTAASA HPAAPPNTEV PARPDTPAPP VASARPATSA PPTGPARPAA PTRPAVPARP ATHPARTVRP PGTTTDPEEA GAGAAGAPTT DHGRAEQHSA TTEPEQSMHA HEPSEPLPGP AGPPVQTAPE PAAASTAQAP ERAGNGSVHA YGEAERAAVY RAIRERRDVR TGFRPDPVPH DVLIRVLEAA HQAPSVGDSQ PWDFLVIEDP GLRARVGDLA AAEREDHAHA PPGVRARAFA GLKAEAVLDA PLNIAVTVDP TRGGRHARGR HARPLSADHA AALAVENLWI AARAEGLGVG WLTFVDERDV ARALELPAHL DVAAYLCVGY VEEFPAESEL SLSGWARERP LSWAVHHDRY GRRGLPGREP TSLLEETITA VGALDTRAME EARDRQDRMT KPPGSLGFLE EVSVQLAGIS GQCPPPIPDP AAVAVFAGDH GVHAQGVTHW PQEVTAQMVH NFLEGGAVVN AFAAQVGAEV TVVDVGVAAD LPRAPGLLAR KVARGTADLT QGPALTREQT LQALECGIEV ARDLVSAGNR CLVTGDMGIA NTTPAAALVC AFTGADPAHA TGRGTGVDDA VYAHKVDVVR RALAEHPVDP ADPIGTLAAL GGLEHAALAG FVLGGAALRV PVLLDGVIAG SAALAAAAIS PEALSACFAG HRSSEPGHSL ALEHLGLRPL VDLEMRLGEG SGALLALPLL QSAARVLHDV ATFDDAGVST AP
|
| |