Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3150 |
Symbol | |
ID | 9247006 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3768817 |
End bp | 3769839 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | cytochrome c oxidase, subunit II |
Protein accession | YP_003681065 |
Protein GI | 297562091 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTCCGA CCCGCCAAGA GAATCGGCGC CGCAACCTGC GCCGATGGGC ACCACGCGGC GCCGCGCTCG CCGTGCTGGG GCTGGCCGCG ACCGGCTGCG CGTCGAACGA TCTCACTCGT TTGGGCATGC CGGAGCCGAT CACCAACCAG GCCGAGCGTG TTCTCTCGCT CTGGCAGGGC TCCTGGGTGG CGGCTTTCGC GGTCGGCATT CTCGTGTGGG GGCTGATCGT CTGGTCGGTC ATCTTCCACC GCAAGCGCTC TGAGCAGTTG CCGCCGCAGG TGCGGTACAA CATGCCCATC GAAGCGCTCT ACACCGTGCT GCCGATCGTC ATCATCTCGG TGCTGTTCTT CTTCACCGCC CGGGACCAGG CGATCCTGCT CGACACCGAC GAGCCGGCGG ACGTCAACAT CGAGGTCGTG GCCTTCCAAT GGGCCTGGCA GTTCAACTAC CTCGACGACA AGAAGGAGAA CGGCGGGGAG GTGCTCTTCT CCGAGACGGG TATCCCCAAC CCGGACGGCA CCGCCGACCC CTCCACCCAG ACGACCCTGG TGCTGCCCGA GGGCGCGACC GTCCACTTCG ACCTGCACTC GCCGGACGTC ATCCACTCGT TCTGGATCCC CGAGTTCGGT TTCAAGATGG ACGTCATCCC CGGTCGGGAC AACGCCTTCC AGGCCGACAT CAACGAGGGC ACCGCGGGCG AGTACGTCGG CCGCTGCGCC GAGCTGTGCG GTGTGGACCA CGCCCGCATG CTCTTCAACG TCCAGGTCCT GCCCCAGGAC GAGTACGACG CCTGGGCCGC CGAGCAGCAG CAGGCCGCCG AGGAGGCCGA GCTGGAGGCC GCCGACACCG GCGACACGCC GGACTCCGGT GAGGGCTCCG GTGCCGAGGG TGCGGGCGCC GAGGGCGCCG GCACCGACGA GGAGGGCACC GGCAGCGGCG GTTCCGAGGC CGAGGGCTCC GGTACCGACG AGCAGGACAC CGGTTCCGGC GCGTCCGACG CCGAGGAGAA TGAGCAGTCA TGA
|
Protein sequence | MSPTRQENRR RNLRRWAPRG AALAVLGLAA TGCASNDLTR LGMPEPITNQ AERVLSLWQG SWVAAFAVGI LVWGLIVWSV IFHRKRSEQL PPQVRYNMPI EALYTVLPIV IISVLFFFTA RDQAILLDTD EPADVNIEVV AFQWAWQFNY LDDKKENGGE VLFSETGIPN PDGTADPSTQ TTLVLPEGAT VHFDLHSPDV IHSFWIPEFG FKMDVIPGRD NAFQADINEG TAGEYVGRCA ELCGVDHARM LFNVQVLPQD EYDAWAAEQQ QAAEEAELEA ADTGDTPDSG EGSGAEGAGA EGAGTDEEGT GSGGSEAEGS GTDEQDTGSG ASDAEENEQS
|
| |