Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4161 |
Symbol | |
ID | 9248035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4968044 |
End bp | 4969276 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | protein of unknown function DUF1006 |
Protein accession | YP_003682062 |
Protein GI | 297563088 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGAAC CGCACCCCCG CCGCGACGCG GAACTCTCCC TCTCCCAGGC GCGCAGGATC GCGCTCGCCG CCCAGGGGTT CAGCGACCCC CGGCCCGCCG GGAAGCCGAC CCTCGCCCAC CTGGCGCGGG TGGTGCGCAG GGTGGGCATC CTCCAGATCG ACAGCGTGAA CGTGCTGGCC CGCAGCCAGT ACCTGCCGGT CTTCGCGCGC ATGGGCGCCT ACGACACCGC CCTGCTGGAC CGGGCGACCA CCTCCGCGAC CGGGTCGGGG GCCGCGCGGC TGGTGGAGTG CTGGGCGCAC GAGGCCAGCC TGGTGCCGCC CTCCACCCGC CAGCTCATGC GCCACCGGAT GGAGCGCAAC CGCGCCCAGG AGCGCGTGGG GTGGATGGAC CGCATCCGGG AGGAGAAGCC CGAGCTGGTC AAGGCGGTGC TGGAGGAGGT GGTGCGGATC GGCCCGGCCA GCGCCCGGCA GGTGGAGGCG GCGCTGGCGC ACGACGTGCC GCGCGCCAAG GACCACTGGG GGTGGAACTG GTCCGAGGTC AAGAGGTGCC TGGAGTACCT GTTCTGGGTG GGCGACCTGA CCTCCAACGG GCGCAACACG CAGTTCGAGC GGCTCTACGA CCTGCCCGAG CGGGTGCTGC CGCCGGAGGT GTACGCGGCG CCGGAGACCA CCCCGGAGGA GGCGCACCGG GAGCTGGTGT CGATCGCGGC CCGCGCCCAC GGGGTGGCCA CCGAGGCGTG CCTGCGCGAC TACTTCCGCC TCAAGTCCAC CGAGTCCCGG GCGGCCGTCT CCGACCTGGT GGAGGCGGGT GAGCTGGTGC CGGTGCGGGT GCGGGGCTGG GACCGCACGG CCTATCTGCA CCGGGACGCC CGGGTGCCGC GCAGCGTGCG GGCGCGGGCG CTGCTGAGCC CGTTCGACTC GCTGGTGTGG ACGCGCGACC GGGCGGAGGA GCTGTTCGGG TTCCGGTACC GGCTGGAGAT CTACGTGCCC GCCGCCAAAC GGGTGCACGG CTACTACGTG CTGCCGTTCC TGCTGGGCGA GGACCTGGTG GCGCGGGTGG ACCTCAAGGC GGACCGCGCC TCGGGGACCC TGCTGGCGCA GCGGATCACG CTGGAGGACG GCGCGCCCGC CGAGACGCTG CCCGAACTGG TCGAGCAGCT GACCGAGATG GCGGCCTGGC TGGGCCTGTC CAACGGCGTC GGCGGCCCGG CGCTGGCCTC CGGCGGACGC TAG
|
Protein sequence | MAEPHPRRDA ELSLSQARRI ALAAQGFSDP RPAGKPTLAH LARVVRRVGI LQIDSVNVLA RSQYLPVFAR MGAYDTALLD RATTSATGSG AARLVECWAH EASLVPPSTR QLMRHRMERN RAQERVGWMD RIREEKPELV KAVLEEVVRI GPASARQVEA ALAHDVPRAK DHWGWNWSEV KRCLEYLFWV GDLTSNGRNT QFERLYDLPE RVLPPEVYAA PETTPEEAHR ELVSIAARAH GVATEACLRD YFRLKSTESR AAVSDLVEAG ELVPVRVRGW DRTAYLHRDA RVPRSVRARA LLSPFDSLVW TRDRAEELFG FRYRLEIYVP AAKRVHGYYV LPFLLGEDLV ARVDLKADRA SGTLLAQRIT LEDGAPAETL PELVEQLTEM AAWLGLSNGV GGPALASGGR
|
| |