Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0864 |
Symbol | |
ID | 9244709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1060166 |
End bp | 1061326 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | protein of unknown function DUF182 |
Protein accession | YP_003678814 |
Protein GI | 297559840 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0137317 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0522719 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCGACA TCCGAGCGGC CGTGTCCGGC CTGTACGCGT CGGGCGAGAC GTTCGCGCTG GCCACCGTGA TCGACACCTA CAAGAGCGCT CCCCGCGACG CCGGGGCGGC GATGCTGGTC AGCGCCTCGG GGGAGGCGGT CGGCAGCGTG TCCGGCGGCT GCGTGGAGGG CGCGGTCTAC GAGGAGGCCC TGGAGGTCAT CCGCACGGGG GTCCCCGTGC GCCGCACCTA CGGGGTCAGT GACGACGACG CCTTCAGCGT CGGCCTGACC TGCGGCGGAA CACTGCACAT GTTCATCGAG CCGATCAGCC GGTCCGCCTA CCCCCAGCTG GGCTCGGTGA TCGGCGCGGT CGACGAGCAC CAGCCGGTCG CCGTGGCCAC CATCGTCTCC GACCCCTCCG ACCAGGGCCG GGTGGGGCGG CGCCGGGTGG TCTGGCCCGG CCACGCCGAG GGCGGCCTGG GCAGCGACAG GCTGGCCGAC GCCCTGGACG ACGACGTGCG CGGCATGCTG GCCCAGGGCA GCACGGGTCT GCTGCGCTAC GGCACCGACG GGCAGCGGCG CGGGGACGAG CTGGAGGTGT TCGTGCAGTC CTTCGCGCCC GCCCCTCGCA TGCTGGTCTT CGGTGCGATC GACTTCGCGG CGGCCGTCGC CGACCTGGGC ACCTACCTGG GTTACCGGGT GACGGTGTGC GACGCCCGGC CGGTGTTCGC GACCGCCAAG CGCTTCCCCA CCGCCGAGGA GGTGGTGGTC AAGTGGCCGC ACGTGTTCTT GGACGAGATC GCCGACCAGA TCGACGAGCG CACGGCCATC TGCGTGCTCA CCCACGACCC CAAGTTCGAC GTGCCGGTGC TCAGCGTGGC CCTGCGCACC CGCGCCGGGT ACATCGGCGC GATGGGGTCC CGGCGCACGC ACGAGGACCG GTTGGAGCGG CTGCGCGAGG CCGGAGTGGG TGAGGCGGAG CTGGCGCGCC TGCACTCGCC GATCGGTCTG GACCTGGGGG CGCGCACCCC GGAGGAGACG GCGGTGTCGA TCGCCGCGGA GTTGGTGCAG GTGCGGTGGG GCGGCAGCGG GCGGGCCCTG CGTGAGACCA GCGGCCGGAT CCACCACGAC ACGTCGGGTA AACCCCTGGT TCCGGCCGGA TTCGGCCACC GTGTGAAGTG A
|
Protein sequence | MRDIRAAVSG LYASGETFAL ATVIDTYKSA PRDAGAAMLV SASGEAVGSV SGGCVEGAVY EEALEVIRTG VPVRRTYGVS DDDAFSVGLT CGGTLHMFIE PISRSAYPQL GSVIGAVDEH QPVAVATIVS DPSDQGRVGR RRVVWPGHAE GGLGSDRLAD ALDDDVRGML AQGSTGLLRY GTDGQRRGDE LEVFVQSFAP APRMLVFGAI DFAAAVADLG TYLGYRVTVC DARPVFATAK RFPTAEEVVV KWPHVFLDEI ADQIDERTAI CVLTHDPKFD VPVLSVALRT RAGYIGAMGS RRTHEDRLER LREAGVGEAE LARLHSPIGL DLGARTPEET AVSIAAELVQ VRWGGSGRAL RETSGRIHHD TSGKPLVPAG FGHRVK
|
| |