Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2178 |
Symbol | |
ID | 9246028 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2601899 |
End bp | 2603074 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | Uroporphyrinogen III synthase HEM4 |
Protein accession | YP_003680106 |
Protein GI | 297561132 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000567169 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000562153 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAACACCT CCCTGGACGA CGACACCGCG GCCCCGCCCG CCCACCGCCC GGCCGAACCC GCCACCACCG CGCCGCTGGC CGGGTTCACC GTCGCCGTCA CCGCCGCCCG CCGCGCCGAG GAGATCAGCG CCCTGCTGCG CCGCAAGGGC GCCCAGGTCC TGGCCGCACC GGCCCTGCGC ATCGTGCCGC TCAGCGACGA CCAGCGCCTG GCCTCGGTCT CCGAACAGCT CGCCCGCCGC CCCGCCGACG TCGTCGTGGC CACCACCGGC ATCGGCTTCC GCGGCTGGGT GGAGGCGTGC GAGACCTGGG GCACCGTCGA CCCGCTCCTG GCGAGCCTGC GCACCTCGCG CCTGCTGGCC CGCGGCCCCA AGGCCAAGGG CGCCATCCGC GCCGCCGGAC TCACCGAGGA GTGGTCGCCG CCCTCGGAGT CCTCCGCCGA GGTCCTGGAC TACCTGCTCG CCCGGGGCGT GCGGGGTCTG CGCGTGGCCA TCCAGCTGCA CGGCGAACCC CTGCCCGACT TCACCGCCGC CCTGCGCCTG GCCGGAGCCG ACGTCGTCGA GGTCCCCGTC TACCGCTGGA CCCTGCCCGA GGACACCGCG CCCCTGGACC GCCTCATCGA GGCCGTCACC AACGGGGGAG TGGACGCGGT CACCTTCACC AGCGCCCCGG CCGCGGCGGG GCTGCTGGCC CGCGCCCACA CCACCGGCCA CCAGGCCGCC CTGGTCCGGG CCCTGCGCGG CGACGTCCTG GCCATGTGCG TGGGAGCGGT CACCGCACGC CCCCTCATGG CCCACGACAT CCCCACCGTG TGGCCCCAGC GCGCCCGCGT CGGCGCCCAG GTCCGCGCGC TCGCCGAGGA GCTGCCCGCA CGCTTTCCGA CCCTGTCCGT GGCCGGACAC CGCCTGCGCC TGCGCGGCCA CGCCGTCCTG GTCGACGGCA CCGTGCGCAC CCTCTCGCCC ACCCTGATGC GGGTGCTGCG CGAGCTGGCG CGCAGGCCCG GCCAGGTCCT GGACCGCACC CGCCTGCTCA CCTGCCTGGG CGAGGACGCC GACGCCCACG CCGTGGAGAC GGCCGTGGCC CGGCTGCGCA CCGCGCTGGG CGACCCCCGC ATCATCCAGA CCGTGGTCAA ACGCGGCTAC CGGCTGGCCC TGGACCCGGC CGAACGCACC CTCTGA
|
Protein sequence | MNTSLDDDTA APPAHRPAEP ATTAPLAGFT VAVTAARRAE EISALLRRKG AQVLAAPALR IVPLSDDQRL ASVSEQLARR PADVVVATTG IGFRGWVEAC ETWGTVDPLL ASLRTSRLLA RGPKAKGAIR AAGLTEEWSP PSESSAEVLD YLLARGVRGL RVAIQLHGEP LPDFTAALRL AGADVVEVPV YRWTLPEDTA PLDRLIEAVT NGGVDAVTFT SAPAAAGLLA RAHTTGHQAA LVRALRGDVL AMCVGAVTAR PLMAHDIPTV WPQRARVGAQ VRALAEELPA RFPTLSVAGH RLRLRGHAVL VDGTVRTLSP TLMRVLRELA RRPGQVLDRT RLLTCLGEDA DAHAVETAVA RLRTALGDPR IIQTVVKRGY RLALDPAERT L
|
| |