Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4104 |
Symbol | |
ID | 9247978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4901089 |
End bp | 4902435 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | Coproporphyrinogen dehydrogenase |
Protein accession | YP_003682006 |
Protein GI | 297563032 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.104304 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.524988 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCACCA CCCTGACCCC GCGCCCCGAA CCGGTGCGCG CCGACTCCCC CTACCAGTCC TACGTCTACG CCTACCCGCA CAAGAGCGCC TACCGGCCCT TCACCGAGCG CCCCGCGCTC GCCGACCTGT GGCGCGGCGA GGACGTCGGC GCGCTGTCGC TGTACGCGCA CATCCCGTTC TGCGAGATGC GCTGCGGGTT CTGCAACCTG TTCACCCGCT CCACTCCCCC GGCCGAGCAG GTCACCGCCT ACCTGGACGC GCTGGAGCGC CAGGCGGAGG CGGTGGCCGG GGCGCTGCCG GAGGGCGCCG CCTTCGCCCG GGCCGCCCTG GGCGGCGGCA CCCCCACCTA CCTGACCGCC GAGGAGCTCA CCCGGGTCTA CGACCTCACC GAGAGCGCCT TCGGGGTGGA CCTGTCGGCG ATCCCGGTGT CGGTGGAGAC CTCTCCGGCC ACGGCCACGC CCGACCGGCT GGCGGTGCTC GACGCGCGCG GCGCCACCCG GATCAGCATG GGCGTGCAGA GCTTCCTGGA CGCCGAGGCG CACGCGGCCG GGCGTCCGCA GAAGCGCGCC GAGGTGGACC GGGCGCTGGC CGCGATCCGC GAGCACGCCT CGGCCGACCT CAACCTCGAC CTCATCTACG GCATCGACCG CCAGGACGCC CGCACCTGGG CCTACTCCCT GGACACGGCG CTGGAGTGGG AGCCCGAGGA GGTCTACCTG TACCCGCTGT ACGTGCGCCC GCTCACCGGG CTGGGCCGCC GCGCCCGCGC GTGGGACGAC CACCGGCTGG GCCTGTACCG GCAGGGCCGC GACCACCTGC GCGAACGCGG CTACGAGCAG GTGTCCATGC GCATGTTCCG GCGGGCGGAC GCCCCGAAGA CGCAGGCCCC GGACCACTCC TGCCAGACCG ACGGCATGGT GGGGCTGGGC TGCGGGGCGC GGTCCTACAC CTCCGCCGCG CACTACTCCT TCGACTACGC CGTGGGCGTG GGGCAGGTGC GGTCGATCAT CGCCGACTAC ACGAGCCGCC GCCAAGCGGA CTTCGGACGG GCCGAGGTGG GGTTCCGCAT GGACGAAGGC GAGCGGCGCC GCCGCCACCT GCTCCAGTCG CTGCTGCTCG CGGAGGGGAT GGACACCGCC GCCTACGCCG ACCGGTTCGG CTCCCGCCCC GAGGAGGACT TCGCCGCGAC CCTGGCGGTG CTCGACGGGC GCGGCTGGCT GGAGCGGGAC GGCGCGCCGG ACCTGCTGCG GCTGACCCCG GAGGGGCTCG CGCACTCCGA CGCGGTGGGG CCGATGTTCT TCTCCGCCGG GGTGGCGGCC CTGATGGCCG ACTACGAGGC CCGGTGA
|
Protein sequence | MTTTLTPRPE PVRADSPYQS YVYAYPHKSA YRPFTERPAL ADLWRGEDVG ALSLYAHIPF CEMRCGFCNL FTRSTPPAEQ VTAYLDALER QAEAVAGALP EGAAFARAAL GGGTPTYLTA EELTRVYDLT ESAFGVDLSA IPVSVETSPA TATPDRLAVL DARGATRISM GVQSFLDAEA HAAGRPQKRA EVDRALAAIR EHASADLNLD LIYGIDRQDA RTWAYSLDTA LEWEPEEVYL YPLYVRPLTG LGRRARAWDD HRLGLYRQGR DHLRERGYEQ VSMRMFRRAD APKTQAPDHS CQTDGMVGLG CGARSYTSAA HYSFDYAVGV GQVRSIIADY TSRRQADFGR AEVGFRMDEG ERRRRHLLQS LLLAEGMDTA AYADRFGSRP EEDFAATLAV LDGRGWLERD GAPDLLRLTP EGLAHSDAVG PMFFSAGVAA LMADYEAR
|
| |