Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3092 |
Symbol | |
ID | 9246948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3703137 |
End bp | 3704369 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | protein of unknown function DUF349 |
Protein accession | YP_003681007 |
Protein GI | 297562033 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.283256 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCACGG ACCCTTGGGG CCGCGTAGAC GACGAAGGCA CGGTCTACGT GCGCACGAGC GAAGGCGAGC GGGTCGTCGG ATCGTGGCAG GCGGGGGCGC CCGAGGAGGC TCTGGCCTTC TTCCGACGCA AGTACGACTC CCTGGTCACC GAGGTGGAGC TGCTGGAGAA CCGGCTCCGC AACACCGACC TGTCCGCCTC CGCCGCGATG TCGAACATCG ACAAGCTGCG AGACTCGGTC CGGGAGGCCA ACGCGGTCGG CGACCTGGAG TCGCTGATGG GCCGCCTGGA CGCACTGGCG GGCCGCGCCG AGGAGCGCAA GGTCGAGCAG AAGCAGGCCC AGGAGCAGGC GCGCGGCCAG GCCCGGGAGG TCAAGGAGCG CATCGTCGCC GAGGCCGAGC GGGTCGCGGT GGAGACGACG CACTGGAAGT CCGGCGGCGA GCGGATGCTC CAGCTCATCG AGGAGTGGAA GAAGGCGCCG CGGGCCGACC GGCCCACCGA GCAGGCGCTG TGGAAGCGCA TGTCGGCGGC GCGCAACTCC TTCTCCAAGC GGCGCAAGGC CTACTTCGCC AGCCTGGACC AGGAGCGCGA GTCGGTGCGT GCGGAGAAGG AGCGCATCGT GGTGGAGGCC GAGGCCCTGT CGGGCTCCAC CGACTGGGGG GCCACGGCCC GCGCCTACCG CGACCTGATG CAGCGCTGGA AGCGCAGCGG CCGGGCGGAC CGGGCGAGCG AGGACAAGCT GTGGGCGCGC TTCAAGGCCG CGCAGGACAC CTTCTTCGAC GCGCGCAACG CGACCTTCGC CGAGCGCGAC GCCGAACTGC GGGTCAACGC CGAGGCCAAG GAGCGGATCC TGGCCGAGGC CCAGGCGGAG ATCCCGAAGA TCTCCGACCC TCGCCGGGCG CGTGCGCGGC TGCGCGACTT CCAGGACGCC TGGGAGGAGG CCGGTGAGCT ACCGCGCGAC GTGCGCGACC AGCTGGAGGG CGCCTTCCGG CAGATCGAGG ACGGCGTGCG CCGGGCCGAG GACGCCGAGT GGGAGCGCAG CAACCCCGAG GCGCGGGCCC GCGCCAGGGA CACGGTCGCG CAGCTGGCGG CGGCGATCGC CGACCTGGAG GCCAAGCTGG GCAAGGCCAG GGACAGGGGC GACGAGCGCC GGGTCAGAGA GTACGAGGAC GCGCTGGAGG CCCGCCGCTC GTGGCTGGCC GAGGCGGAGA AGGCCCTCGA CGAGCTGAGC TGA
|
Protein sequence | MTTDPWGRVD DEGTVYVRTS EGERVVGSWQ AGAPEEALAF FRRKYDSLVT EVELLENRLR NTDLSASAAM SNIDKLRDSV REANAVGDLE SLMGRLDALA GRAEERKVEQ KQAQEQARGQ AREVKERIVA EAERVAVETT HWKSGGERML QLIEEWKKAP RADRPTEQAL WKRMSAARNS FSKRRKAYFA SLDQERESVR AEKERIVVEA EALSGSTDWG ATARAYRDLM QRWKRSGRAD RASEDKLWAR FKAAQDTFFD ARNATFAERD AELRVNAEAK ERILAEAQAE IPKISDPRRA RARLRDFQDA WEEAGELPRD VRDQLEGAFR QIEDGVRRAE DAEWERSNPE ARARARDTVA QLAAAIADLE AKLGKARDRG DERRVREYED ALEARRSWLA EAEKALDELS
|
| |