Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3707 |
Symbol | |
ID | 9247576 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4450272 |
End bp | 4451678 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | glycosyl transferase group 1 |
Protein accession | YP_003681611 |
Protein GI | 297562637 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.271047 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTCTTCC AGAAGGTCCA CCGGATCCGC CAGGAGGCGG AACCGGGCCT GCCCGCGGTC ACCGAGGCCC CGGCCGCCGT CATCCCCGTC CAGGTCCCGA CCGGCACGGG CGCACCCGCC GCCCCGGACG TCCCCGCCGA CCTCGGGCCG CGCCTGGACG GGGTCCGCCT GGTCATGGTC AACTGGCGCG ACCCCTGGCA GTCCACCGCG GGGGGAGCCG AGGAGTACTC CTGGCGGATC AGCCGCCACC TGGCCGAACG CGGTGCCATC GTCACCTTCC TCACCAGCCG CGAGCCCCAC CAGGCGCGCG TGGAGACCAG GGACGGGATC GTCATCCGCC GCATGGGCGG CAGGTTCACC GTCTACCCGC GCGTCATGGC GTGGCTGGCC CTGTGGCGCC GCGAGTACCA GCTCGCCTTC GACTGCATGA ACGGCGTCCC CTTCCTGTGC CGCCTCGTCC TGCGCCGCAG CACGCGGGTG GTCAGCGTCG TCCACCACGT CCACGACCTC CAGTTCAACG CCTACTTCCC GGCGCCCGTC GCCTGGCTGG GCCGCACGCT GGAGTCGGTC GTGGCCTCGC GCGTCTACCG CCGCTGCACC ACCGTCACCG TCTCGGAGTC CTCCCGCCGG GCCATGCGCG AGAAGCTGGG CTGGCGCGCG CCGATCGAGA TCATCCACAA CGGCGGACTC CCCGGGCCCC AGAAGCCCCT CGACGACGCG CCCGCCCCCG CCGACATGGG CCACCCGGCC GTGGTCAGCC TGGGCCGCCT GGTCGTCCAG AAGCGGGTCT CGCGGGTGGT CGACCTCGCC CGCGCGCTGC GCGAGGAGCA CCCCGACCTG AAGGTGCACA TCATCGGCCG CGGCCCCGAG GGCGAACCCC TGGCGGAGCA GGTCGCCCGT GACGGCACGG GCGACCGCGT GCGCCTGCAC GGCTTCCTGC CCGAGGAGGA CAAGAACAGC GTCCTGGCCT CGTGCCACCT CCATGTCACC GCCTCCGAGT TCGAGGGCTG GGGCCTGACC GTCATCGAGG CGGCCCGCCT CGGCGTGCCC ACCGTGGCCT ACGATGTGGA CGGACTCCGC GACTCGGTCC GCGACGGCGA GACCGGGTGG CTCGTGCGCG AGGGCGAGGA ACTCGCCGAC GTGGTCGCGC GCGCCCTGGA GGAGCTGTCC GACCCCCGCC GGGCCGAAGC CGTCCGCCGC GCCTGCCGCG CGTGGGCGTC CCGGTTCACG TGGGAGGCCA GCGGCGCGCG GATGACCCGG CTCGTCGCGA GGGAGCTGGG CCTGCCCGGC GCCGCGGAAC CGGACGCCCC CGCCACCGAC GCCCCCGTCC CCGACCCCCT CACCTCCGAC GACACGGGCG CCCCCGCCGC CCGGAACACG GCAGCAGGCA GGAAAGCGAA GACGTGA
|
Protein sequence | MVFQKVHRIR QEAEPGLPAV TEAPAAVIPV QVPTGTGAPA APDVPADLGP RLDGVRLVMV NWRDPWQSTA GGAEEYSWRI SRHLAERGAI VTFLTSREPH QARVETRDGI VIRRMGGRFT VYPRVMAWLA LWRREYQLAF DCMNGVPFLC RLVLRRSTRV VSVVHHVHDL QFNAYFPAPV AWLGRTLESV VASRVYRRCT TVTVSESSRR AMREKLGWRA PIEIIHNGGL PGPQKPLDDA PAPADMGHPA VVSLGRLVVQ KRVSRVVDLA RALREEHPDL KVHIIGRGPE GEPLAEQVAR DGTGDRVRLH GFLPEEDKNS VLASCHLHVT ASEFEGWGLT VIEAARLGVP TVAYDVDGLR DSVRDGETGW LVREGEELAD VVARALEELS DPRRAEAVRR ACRAWASRFT WEASGARMTR LVARELGLPG AAEPDAPATD APVPDPLTSD DTGAPAARNT AAGRKAKT
|
| |