Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3020 |
Symbol | |
ID | 9246873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3607490 |
End bp | 3608572 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | glycosyl transferase group 1 |
Protein accession | YP_003680936 |
Protein GI | 297561962 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.506245 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAACGCCA GGATCGCGCT GGTCATCGGT ACCAGCAGTG GAGGGGTGGG CCGCCATGTG CGCTCGCTGG GCGCGGGCCT GGCCGCGCGC GGCCACCGGG TGGCGGTGCT GGGCCCGGCC TCCGCCGAGC GCGAGTTCGG CTTCACCTCG GAGGGCATGC GCTTCTCCCC GGTCGGCATC GGAGCGGCGC CCTCCGCGGG CGATCCCGGC GCGGTGCTGC GCCTGCGCGC CCTGACCCGG GGCGCGGACG TGGTCCACGC GCACGGGCTG CGGGCCGGGG CCCTGTGCGC GCTGGCCGGC GCCTCTCCGC TGGTGGTGAC CGCGCACAAC GCGCCGCCGC TGGTGCGGGG GGCGCTGTCG GCGGCCTACC CGGTGCTGGA GCGGATCGTG GCGCACCGGG CGGACGTGGT GCTGGGGGTG TCGGGCGACC TGGTGCGCAG GCTGCGCTCG GTCGGGGCGC GCGACGCGCG GCTGGCGGTG GTGGCCGCTC CCGAGACCGG CGCTCCGGTG AACGGGCGCG AGGCCACCCG GGCCGACCTG GCGGTGCTGC CGGAGCGGCC GCTGCTGCTG ACCGTCGCCC GCCTGGCCGA GCAGAAGGGG TTGGACATGC TCCTGGCGGC GGCGCCTGCC ATCGCCGACC GCCGCCCCGA ACCGGTGGTG GCGATCGCCG GGGACGGGCC CCTGTGGGGG CAGCTGCACG ACACGGCCGC GGAGATGCGC GCGGACGTGC GCATGCTGGG GCACCGCGCG GACGTGGCGG ACCTGCTGGC GGCGGCGGAC GTGTTCTGCC TGACCAGCCA GTGGGAGGGG CCCTCACTGG TGATCATGGA GGCGTTGCGC GCGGGGCTGC CGGTGGTCTC CACGCGGGTC GGCGGCATCC CGGACCTGTA CTCGGGGACG GTGCTGATGG TGCCGCCGGG GGATCCCGCG GCCTTCGCCG CCGCCGTGGG CCGGGTGCTG GACGACCCGG CGCTGGCCGA GGACCTGCGG GCGCGCTCGC GCGAGGCGGC CAAGGCGCTG CCGAGCGAGG AGGACGCGGT GGAGGCCGCC GCGGGCGTGT ACAAGACGGT GCTGCGGCGG TGA
|
Protein sequence | MNARIALVIG TSSGGVGRHV RSLGAGLAAR GHRVAVLGPA SAEREFGFTS EGMRFSPVGI GAAPSAGDPG AVLRLRALTR GADVVHAHGL RAGALCALAG ASPLVVTAHN APPLVRGALS AAYPVLERIV AHRADVVLGV SGDLVRRLRS VGARDARLAV VAAPETGAPV NGREATRADL AVLPERPLLL TVARLAEQKG LDMLLAAAPA IADRRPEPVV AIAGDGPLWG QLHDTAAEMR ADVRMLGHRA DVADLLAAAD VFCLTSQWEG PSLVIMEALR AGLPVVSTRV GGIPDLYSGT VLMVPPGDPA AFAAAVGRVL DDPALAEDLR ARSREAAKAL PSEEDAVEAA AGVYKTVLRR
|
| |