Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3875 |
Symbol | |
ID | 9247746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4645973 |
End bp | 4646968 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | glycosyl transferase family 2 |
Protein accession | YP_003681778 |
Protein GI | 297562804 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCTGGC CGCCTGTCTC CGTCGTCATG CCCGTACTCA ACGAAGAGCG CCACCTCGCC GCCGCGGTGG AGCACGTCCT CGCCCAGGAC TACCCGGGCG ATCTGGAGGT CGTGCTCGGG GTCGGGCCCT CCTCGGACCG CACCCGCGAG GTCGCCGACG GGATCGCCGC CGCCGACCCG CGCGTGAAGG TGGTGGACAA CCCGACGGGA AGGACCCCCT CCGGGCTCAA CGCCGCCATC GGGGCCTCCT CGCACGACAT CGTCGCCCGC ATCGACGGGC ACGCCATGAT GCCCTCGGAC TACCTGCGGG TGGCCGTGGA GACGCTCCGC GAGACCGGCG CCGACAACGT CGGCGGGATC ATGGCCGCCG AGGGCACGAC CCCCTGGGAG AAGGCGGTCG CCGCCGCCAT GACCTCCAAG GTGGGCGTGG GCAACGCGCG CTTCCACACC GGCGGCGAGG GCGGTCCGGC CGACACCGTG TACCTGGGCG TCTTCCGGCG CTCGGCGCTG GAGCGGGTCG GCGGCTACGA CGAGGCCTTC CTGCGGGCCC AGGACTGGGA GATGAACCAC CGCATCCGCA CCACCGGCGG CACGGTGTGG TTCCAGCCGC GCATGCGGGT CTCCTACCGC CCCCGGCGCA ACGTGCGCCT GCTCGCCAAG CAGTACTTCC ACTACGGCCG CTGGCGCAGG GTGGTCTCCC GCCAGCACAA GGGCACCATC AACCTGCGCT ACCTGGCCCC GCCCGTCGCC CTGGCGGGTG TGGTCGCCGG GCTGGTCGGC GGCTTCTTCC TCTGGCCGCT GTTCCTGGTC CCGGCCGCCT ACCTGGTCCT GGTCACCGCC GCGTCGGTGC CGCTCGGCCG CGGCCTTCCC GCCGCCAGCC TGCCGATGAT CCCGCTCGCG CTGGCCACCA TGCACATGTC GTGGGGCGCC GGGTTCATCA CCAGCCCGCC CGGACTCGGC TCGGACTCGC GCACGGCGCA GGCCGCCCGG GCCTGA
|
Protein sequence | MSWPPVSVVM PVLNEERHLA AAVEHVLAQD YPGDLEVVLG VGPSSDRTRE VADGIAAADP RVKVVDNPTG RTPSGLNAAI GASSHDIVAR IDGHAMMPSD YLRVAVETLR ETGADNVGGI MAAEGTTPWE KAVAAAMTSK VGVGNARFHT GGEGGPADTV YLGVFRRSAL ERVGGYDEAF LRAQDWEMNH RIRTTGGTVW FQPRMRVSYR PRRNVRLLAK QYFHYGRWRR VVSRQHKGTI NLRYLAPPVA LAGVVAGLVG GFFLWPLFLV PAAYLVLVTA ASVPLGRGLP AASLPMIPLA LATMHMSWGA GFITSPPGLG SDSRTAQAAR A
|
| |