Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5533 |
Symbol | |
ID | 9249436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 724875 |
End bp | 726002 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | protein of unknown function DUF58 |
Protein accession | YP_003683418 |
Protein GI | 297564445 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0610248 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAACGA CACGGGGATG GCTGGTCGCC GGCTCGGGCG TGCTCCTGCT CGCGGTCGGC GTCCTGTCCC AGTACCAGGA GATCGCGCTG CTCGGCGGTG TCGCCGTGAC CGTGGTCGCC GTGGCGGTGC TGCTGGTGGG GCGCCCCGCC GGGGTGCGCG TCCAGCGCTC GGCCTCCACC ACCCGGACCT CTCCCGGCAC CACCGTGCGC GTGCGGGTCG AGGCGAGCAA CACCGGACGG CGCTCCGTGC AGGTCAGTGA GCGCGTCCTC GGCTCCGACG GCGAGCGCGC GGTGCCGCTG CGCCCCCTGG CCGCACGCGC CACCGGCGGC TCCGACTACC GGATCGGGGC CCTGCGCCGC GGGGTGGTCG AGCTGGGTCC CCTGCGGGCC GGGCGCTCCG ACCCGTTGGG ACTGGCCTCG CTGCACCGCG ACCACGGCGG TACCGAACGG GTCTGGGTGC ACCCCCGCTG GGAGCACCTG CGCGCCGTAC CGATCGGCCG GGTGGCCGAC CCCGACGGCG CGGCGGACGG CGCGCCCGCG GGCACCCTGA CCTTCCACGC CCTGCGCGAC TACGTGCCCG GCGACGACCT GCGCCACGTC CACTGGCGCA GCTCCGCGCG GCTGGACAGG CTCGTCGTGC GCGAGTACAT CGACACCTCC CAGACCCGGA TCTGCGTCAT CGTCGACGAC CGCCCCACAC CCGGCGGCGA GGCCCGCCTG GACGAGGTGG CCGGCGCGGC GGCCTCCATC GCGGCCACCG CCGTCCGCTC GTCCCTGCAC TGCGAACTGC GCCTGGCCAG CGGCAGGGGC AGGGAGAGCA CGGGCGGCCT GCCCCCGCTG CTCGACCTGC TCTCCGAGGC CCGGAGCACT CCGGGGGCAG ACCTGCACCG CGCCCTGCTC CTGGCCCGCA CCCGTCCCGC CGGTGACACC GCGGTCCTGG TCAGCGGCGC GCTCACCGCC GAGGACCTGC GGTCGTTCGG GCGGCTCGGC GACCGCTACG CGGGCCTGAT CGCCGTCGTC GTCGGATCGG AGGAGCACCC GACCGCGCCC CCGGACGTCA CCCTGCTCAC CGCCGGTGAC ACCGCCGGGT TCGCCGACCG ATGGAACGAG GCGCCGTGGT CACGCTGA
|
Protein sequence | MPTTRGWLVA GSGVLLLAVG VLSQYQEIAL LGGVAVTVVA VAVLLVGRPA GVRVQRSAST TRTSPGTTVR VRVEASNTGR RSVQVSERVL GSDGERAVPL RPLAARATGG SDYRIGALRR GVVELGPLRA GRSDPLGLAS LHRDHGGTER VWVHPRWEHL RAVPIGRVAD PDGAADGAPA GTLTFHALRD YVPGDDLRHV HWRSSARLDR LVVREYIDTS QTRICVIVDD RPTPGGEARL DEVAGAAASI AATAVRSSLH CELRLASGRG RESTGGLPPL LDLLSEARST PGADLHRALL LARTRPAGDT AVLVSGALTA EDLRSFGRLG DRYAGLIAVV VGSEEHPTAP PDVTLLTAGD TAGFADRWNE APWSR
|
| |