Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1572 |
Symbol | |
ID | 9245422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1923850 |
End bp | 1924902 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | glycosyl transferase group 1 |
Protein accession | YP_003679507 |
Protein GI | 297560533 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.999019 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0163981 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATCC ACCACCCGAC GCCCACGATC CCGGCGCACG CACCCCGGGG CATGGCGCAG GGCCGGGCCC GGACGCCCGC GCGCGTGCTG CGGATCCTGC TCTGGCACGT CCACGGCTCG TGGACGACCG CGTTCGTCCA CGGCGGGCAC ACCTGCCTGC TGCCGGTGAC CCCGGACCGG GGGCCCGACG GGCGGGGCCG CGCCCGCACC TGGGACTGGC CCGCCAACGC CGAGGAGCTG CCCCTTGGGG CGGTGCGCGA CAGCGAACCC GACCTGGTGG TCCTCCAGCG ACCGCACGAG ATCGCCCTGG CCGAGGAGCT CCTGGGCCGC GTGCCGGGCC GGGACGTGCC CGCCGTCTAC GTCGAGCACA ACACCCCGCG CCGGGACGTG CCGGTCACCC GGCACCCCGT CGCCGACCGC GACGACATCC CCGTCGTCCA CGTCACCCAC TTCAACGACC TGTTCTGGGA CTGCGGGCGC GCGCCCACCC GCGTGGTCGA ACACGGCGTG CCCGACCCCG GGTACCGCTA CACCGGCGAG GTCCTCCGCA CGGGCGTGGT GCTCAACGAG CCCCTGCGCC GCTGGCGCTT CACCGGCACC GACCTGCTGC CCGCGCTCGC CGGGAGCGTG CCCCTGGACC TGTTCGGCAT GGGCGTGGCC GGGATCTCCG AGCACCTCGG GCTGGCGCCC TCGCGGCTGC GCGCCCACGA GGACCTGCCC CAGGACGCCA TGCACGACGA ACTGGCCCGC CGCCGCGCCT ACGCCCACCC GCTGCGGTGG ACCTCGCTCG GCCTGTCCCT CATCGAGGCG ATGATGCTGG GCCTGCCCGT GGCCGCCCTG GGCACGACCG AGGCCTACGA GGCCGTACCG CCCGAGGCGG GCACCGTCTC GACCGACCCC CGAGTACTCG CGGAGGCGCT GCGCGCGTTC CATGAGGATC GCGACCTCGC CCTTCGCACC GGCAAGGCCG CCCGTGCGGC GGCACTGCGC CGCTACGGGC TCACCAGGTT CCTCGACGAC TGGGACCGGG TGCTACAGGA GGTGACGTCA TGA
|
Protein sequence | MTIHHPTPTI PAHAPRGMAQ GRARTPARVL RILLWHVHGS WTTAFVHGGH TCLLPVTPDR GPDGRGRART WDWPANAEEL PLGAVRDSEP DLVVLQRPHE IALAEELLGR VPGRDVPAVY VEHNTPRRDV PVTRHPVADR DDIPVVHVTH FNDLFWDCGR APTRVVEHGV PDPGYRYTGE VLRTGVVLNE PLRRWRFTGT DLLPALAGSV PLDLFGMGVA GISEHLGLAP SRLRAHEDLP QDAMHDELAR RRAYAHPLRW TSLGLSLIEA MMLGLPVAAL GTTEAYEAVP PEAGTVSTDP RVLAEALRAF HEDRDLALRT GKAARAAALR RYGLTRFLDD WDRVLQEVTS
|
| |