Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4874 |
Symbol | |
ID | 9248761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 5727 |
End bp | 7097 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | protein of unknown function DUF418 |
Protein accession | YP_003682763 |
Protein GI | 297563790 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.93625 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGG CGGGGTACGA CACGACAGGG GCACAACGGC ACGGGGACGG GACGCACCAC CCGGTCGGTG CCGGGGGGCG GTACACGGCC GGGGCACGGC ACGAGGGCGG GGCGCTCCAC ACGGCCGGGA GCGGGGAACC GCGCACGGCG GGGACCGAGG CGCGGCACAC GGCCGGGGCA CGGCGCGCAG CCGCGTCCGA CGGTGCCGGT CCCGGCCTGC CGTCCCCCGC GCGGCGCGTG GTGTTCCTGG ACGTCGCCCG GGCGCTCGCG ATGATCGGCG TGGTCATGAT GAACGCGTCC TCGGTGGTCT ATCCCGTCGA GATGTCGGGC GACCGGGTCC CGGGCGTGCT GAGCGAACTC GTGGACGGCG GGCTCACCCT GCTCATGTCG GGCAGGGCCA GGACCATGCT CATGGTGCTG CTGGGCGCGG GCGTGGTCCT GGCGTGGCGG GCCGCCGCCC GGCGCGGCGG GAGTCCGGCG GCGGTGATGC TGCGCCGCTA CGCCGTCCTG GGCCTGCTGT TCGGACTTCC GCACCTGGCG GTCTTCGACG GGGACATCCT CACCCAGTAC TCGGTCGCGG CGCTGCTGCT GACCCCGCTG GTGCCGCTGC TGCTCGGCGG GTCCCGCCGC AGACCGCTGG TCGCGGCGGC CGTGCTGTTC GCCGCCGTGC CGGTGTCGGA CCTGCTGCTC TCGCCGTTCC TCGGGGACCA CACCTGGGGG GCCTCGGCGA TGCTGGTCCC GCAGACCCTC GGGTTCTTCT GCGTGGGCGT GTGGCTGGCC CGCCGCCCGG AGCTCACCGC CGAGCCCGGA ACCGGTGCGG AGGGGACCTC CCGCCTGCCG CTGCGCATGC TCGTGTTCGG CGCGGTGGTC CAGGTCCTGA GCGTGGCCCT CATGCTCGTC GGCAGCGTCG TGTTCCCGAC CGAGTTCGGC GCCGACGGCG CGCCGGTGCG GTCCGTGGGC GAGACCGTGG TCGTCCTCCT GGGGAACACC TTGCTGAACC TGGGCGGAGC CCTGCTCTAC CTGGGACTGG TGTGGTGGCT GGTACTCAGG GGACGGGGCG CGGCGCGCGT GCTGGGGACC CTGGCCCCGC TCGGGCGGAT GACCCTCACG GTGTACCTGG GCAGCACGGC GGTGTTCCTG GCGGTCATGG GCCCGTTCGA GGGGACGGTC CCCCAGCTGG CCCAGTACGC CCTGGCCGCC GCCTACTTCG TCGCGACCGC CGTCCTGGCC CACCTGTGGG CGCGCCGGTT CCGGCTCGGT CCGCTGGAGT GGGTGTGGCG GAGCCTGACC CACCTGCGCC CGGTGCCCCT GCGCGCCGAG CGCTCCGGTC GGCGGTCCGG CCTCCCGGCC GCCAGCGGGG ACCCGGCCTG A
|
Protein sequence | MSEAGYDTTG AQRHGDGTHH PVGAGGRYTA GARHEGGALH TAGSGEPRTA GTEARHTAGA RRAAASDGAG PGLPSPARRV VFLDVARALA MIGVVMMNAS SVVYPVEMSG DRVPGVLSEL VDGGLTLLMS GRARTMLMVL LGAGVVLAWR AAARRGGSPA AVMLRRYAVL GLLFGLPHLA VFDGDILTQY SVAALLLTPL VPLLLGGSRR RPLVAAAVLF AAVPVSDLLL SPFLGDHTWG ASAMLVPQTL GFFCVGVWLA RRPELTAEPG TGAEGTSRLP LRMLVFGAVV QVLSVALMLV GSVVFPTEFG ADGAPVRSVG ETVVVLLGNT LLNLGGALLY LGLVWWLVLR GRGAARVLGT LAPLGRMTLT VYLGSTAVFL AVMGPFEGTV PQLAQYALAA AYFVATAVLA HLWARRFRLG PLEWVWRSLT HLRPVPLRAE RSGRRSGLPA ASGDPA
|
| |