Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1613 |
Symbol | |
ID | 9245463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1977334 |
End bp | 1978917 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | cobyric acid synthase CobQ |
Protein accession | YP_003679548 |
Protein GI | 297560574 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGAGCG TGTCCGTTGA GGGTGGTGAC GGGCCGCAGG CCCCGGTGGG CCGCGGCCGT GGAGGAGGAC TGCTGGTCGC CGGGACCACC TCCGACGCGG GCAAGAGCGT GGTCACCACC GCCCTGTGCC GGGCCTTCGC GCGCCGAGGC GCGCGGGTCG CCCCCTACAA GGCGCAGAAC ATGTCCAACA ACTCGATGGT CTGCCACGGC CCCGACGGCC GCCCGGCGGA GATCGGCCGC GCCCAGTGGG TGCAGGCCCT GGCGGCCCGC GCCGTGCCCG AGCCCGCCAT GAACCCCGTG CTCCTCAAAC CGGGCACCGA CCGGCGCAGC CACGCCGTCC TGCTCGGCCA CCCCGCCGGG GACGTCTCCT CGGCCGACTG GGAGGCGGGC CGCCGCCACC TGGCCGAGGC CGCGCACGCC GCCTACGACG ACCTCGCCTC CCGCCACGAC GTCGTCGTCG CCGAGGGCGC GGGCAGCCCC GCCGAGATCA ACCTGCGGGC GGGCGACTAC GTCAACATGG GCCTGGCCCG GCACGCGGGC CTGCCCGTGG TGGTGGTCGG CGACATCGAC CGGGGCGGCG TCTTCGCCGC GATGTACGGC ACGGTGGGCC TGCTGGAGCC CGCCGACCAG GAGCTGGTCG CGGGCTTCGT GGTCAACAAG TTCCGCGGGG ACCCCGGGCT GCTGCGGCCC GGCCTGGAGG ACCTTGAGCG GCTCACCGGC CGCCGGGTCT ACGGCACCCT GCCGTGGGAC CCGGGGCTGT GGCTGGACTC CGAGGACGCC CTGGACCTGG AGGGCCGCCG GACGCGCGGG GGGGCGGGGC TGAGGGTGGC CGTGGTGCGC CTGCCCCGGA TCAGCAACTT CACCGACGTG GACGCCCTCG GGCTGGAACC GGGCCTGGAC GTCGTCTTCG CCTCCGGCCC GCGCGACGTG GCCGACGCCG ACCTGGTCGT CCTGCCCGGC ACCCGCTCCA CCCTGGCCGA CCTGGCCTGG CTGCGCTCGC GCGGGCTCGA CCGGGCGATC GTCGAGCACG CCCGGCGCGG TCGGCCGCTG CTGGGGATCT GCGGCGGGTT CCAGATGCTC GGCCGCACCG TCACCGACGC CGACGGGGTC GAGGCCGAAC CCGGGGCGCG CGCGGACGGC CTCGGGCTGC TCGACGCCCG GACCGACTTC ACCGCGGACA AGACCCTGCG CCTGCCCTCG GGGGAGGCGC TCGGGGCGCC CGCGCACGGC TACGAGATCC ACCACGGCCG GGTCACGGTC GGCGGCGGCG CGGAGGGGTT CCCGGGCGGC GCCCGCGCGG GCAACGTCTT CGGCACCATG TGGCACGGCA GCCTGGAGGG CGACGCCTTC CGCGCCGCCT TCCTGGCCGA GGCGCTGGGC TGCGCGCCCT CCGGGGTGCG CTTCGCCGCC GCCCGGGAGC GGCGCCTGGA CCTGCTCGCC GACCTGGCGG AGGAGCACCT GGACGTGGAC GCCCTGCTGG CCCTGGCCCG CGAGGGCGCC CCGGCGGGTC TGCCCGTGCT GGCCCCGGGA GCCCCGCCCG CCGCACGCCC GCGGCGCGGG CCCGGGGCGG AGGTGTCCCG GTGA
|
Protein sequence | MRSVSVEGGD GPQAPVGRGR GGGLLVAGTT SDAGKSVVTT ALCRAFARRG ARVAPYKAQN MSNNSMVCHG PDGRPAEIGR AQWVQALAAR AVPEPAMNPV LLKPGTDRRS HAVLLGHPAG DVSSADWEAG RRHLAEAAHA AYDDLASRHD VVVAEGAGSP AEINLRAGDY VNMGLARHAG LPVVVVGDID RGGVFAAMYG TVGLLEPADQ ELVAGFVVNK FRGDPGLLRP GLEDLERLTG RRVYGTLPWD PGLWLDSEDA LDLEGRRTRG GAGLRVAVVR LPRISNFTDV DALGLEPGLD VVFASGPRDV ADADLVVLPG TRSTLADLAW LRSRGLDRAI VEHARRGRPL LGICGGFQML GRTVTDADGV EAEPGARADG LGLLDARTDF TADKTLRLPS GEALGAPAHG YEIHHGRVTV GGGAEGFPGG ARAGNVFGTM WHGSLEGDAF RAAFLAEALG CAPSGVRFAA ARERRLDLLA DLAEEHLDVD ALLALAREGA PAGLPVLAPG APPAARPRRG PGAEVSR
|
| |