Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1571 |
Symbol | |
ID | 9245421 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1922837 |
End bp | 1923853 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | glycosyl transferase family 9 |
Protein accession | YP_003679506 |
Protein GI | 297560532 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0153093 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGGGG GAACCGTGAT CGTGGCGCGC CTGGACAGCA TGGGGGACGT CCTGCTGTCC GGACCGGCCG TGCGCGCCGT CGCCCACGGA GCCGACCGGG TCGTCTACCT CGCCGGACCC CGTGGCGCGG AGACCGCCGC GACGCTGCCG GGTGTGGACG GCGTCCTGAC CTGGTGCGCG CCCTGGATCG CCGCCGACCC GCCGCCGGTG GACGCCGCCG ACGTCACCCG CCTCGTGGAA CGCCTGTCCG GCCTCGGCGC GGACGCCGCC GTGATCCTGA CCTCCTTCCA CCAGTCGCCG CTGCCGCTGG CCCTGCTCCT GCGCATGGCC GGGGTGCCCC GCGTCTCGGC CGTCAGCGAG GACTACCCGG GCAGCCTGCT CGACGTGCGC CACCACGTGG CCCACGCGAT CCCCGAGGCC GAGCGGATGC TGTCCCTGGC ACGCGCGGCC GGGTACCCGC CGCCCCCCGG CGACGACGGA CGCCTGGCCG TGCGCCGCCC GCTGCCCGAC ACCCGGGAGC TGACCGGGCC CGACGGGTAC GCGGTGGTCC ACGTCGGCGC GGACGCCCCC TCCCGGGAGC TGCCGCCCGC CCTGGCCGCC AAGACCGTCG CCGCGCTGGC CGAACGCGGC CACCGGGTCC TGGTCACCGG CACCGCCGGG GAGGCCGAGA TGGCCCGCGA GGTCGCCGCG CACGGCGCGG CGGACCTGGC CGGGCGCACC ACCGTCACCG AACTGGCCGA CGTGCTCGAC CGCGCCGCCG TCCTGGTCTC GGGCAACACC GGCCCCGCCC ACCTGGCCGC CGCCGTCGGC ACACCCGTGG TCTCGCTCTT CTCCCCGGTC GTGCCCGCCT CCGCGTGGGC GCCCCACGGC ACCGCGGTCC GCGTCCTGGG TGACCAGCTC GCGCCCTGCG CCGACACCCG GGCCCGGGTC TGCCCCGTAC CCGGCCATCC CTGCCTGAAC TCCGTCTCCT CCGACGACGT CCTGACCGCC GTCGACGAAC TCCTGGGGGC ACGATGA
|
Protein sequence | MSGGTVIVAR LDSMGDVLLS GPAVRAVAHG ADRVVYLAGP RGAETAATLP GVDGVLTWCA PWIAADPPPV DAADVTRLVE RLSGLGADAA VILTSFHQSP LPLALLLRMA GVPRVSAVSE DYPGSLLDVR HHVAHAIPEA ERMLSLARAA GYPPPPGDDG RLAVRRPLPD TRELTGPDGY AVVHVGADAP SRELPPALAA KTVAALAERG HRVLVTGTAG EAEMAREVAA HGAADLAGRT TVTELADVLD RAAVLVSGNT GPAHLAAAVG TPVVSLFSPV VPASAWAPHG TAVRVLGDQL APCADTRARV CPVPGHPCLN SVSSDDVLTA VDELLGAR
|
| |