Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4648 |
Symbol | |
ID | 9248530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5521295 |
End bp | 5522461 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | Mandelate racemase/muconate lactonizing protein |
Protein accession | YP_003682540 |
Protein GI | 297563566 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.128391 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAATCG TTTCCGCCCA CGTCGGCACC ATCCCGATCA GCTCGTCGAT GCGCAACGCG TACATCGACT TCAGCAGGAT GGACTGCACG ATCCTGGCGC TGGTCAGTGA CGTGGTGGTC GACGGCAGGC CCCTGGTGGG TTACGGCTTC AACTCCAACG GCCGGTACAA CGCCACGGCC ATCCTGAACG AGCGGATGCT GCCGCGGCTG CGCGAAGCCG CCCCGGAGGA TCTGCTGGAC GAGAACGGCG AACTCTCGCC CGCCCGGGCG TGGGACGTCA TGATGCGCAA CGAGAAGCCG GGCGGACACG GTGAACGCTC CGTCGCCGTC GGCGTGGTGG ACATGGCGCT GCACGACCTC GCCGCCAAGG CCGCGGGAGT GCCGCTGTAC CGGTGGATCT CCGACCACTA CGGCGACGGC GACCCGGACG GGGACGTCTT CGTCTACGCC GCCGGCGGCT ACTACGCGCC CGGCAAGACC CTGGAGGACC TCCAGGACGA GATGCGGGGC TTCCTCGACG CCGGGTACGA GGTCGTCAAG ATGAAGATCG GCGGCGCCGA CCTGTCCGAG GACCTCCGGC GCATCGAGGC GGTCATCGAC GTCCTGGGCG GCGACGGGTC CCGGCTGATG GTGGACGTCA ACGGCAAGTT CGACCTGCGG ACCGCGCTGG AGTACGGCCG GGCCATCGAC CGGTACGGCC TCTTCTGGTA CGAGGAGGTC GGCGACCCGC TGGACTACGC CCTGAACGCG ACGCTGTCGG AGGACTACCG CAACCCCATC GCGACCGGCG AGAACCTGTT CTCCCTCCAG GACGCCCGGA ACCTGATCCG CTACGGCGGG ATGCGCCCGG ACCGCGACTT CGTCCAGGTC GACCCGGCGC TGAGCTACGG GCTGACGGAG TACCGCCGGG TCCTGGACAT GCTCGCCCGG CACGGCTGGT CCTCCCGCCG GTGCATCCCG CACGGCGGGC ACCAGTTCTC GCTGCACATC GCCGCGGCCC TCAAGCTCGG CGGCAACGAG TCCTACCCCG GGGAGTTCCA GCCCACGGGC GGCTTCGCCG ACGAGGCTGT GGTCACCCGC GGTCGTGTGG CGCCGGGTGA CCTCCCGGGC ATCGGGCTCG AAGGCAAGGC GAAGTTCTAC GAGGTCCTGC GGGGCCTGCA CGGCTGA
|
Protein sequence | MRIVSAHVGT IPISSSMRNA YIDFSRMDCT ILALVSDVVV DGRPLVGYGF NSNGRYNATA ILNERMLPRL REAAPEDLLD ENGELSPARA WDVMMRNEKP GGHGERSVAV GVVDMALHDL AAKAAGVPLY RWISDHYGDG DPDGDVFVYA AGGYYAPGKT LEDLQDEMRG FLDAGYEVVK MKIGGADLSE DLRRIEAVID VLGGDGSRLM VDVNGKFDLR TALEYGRAID RYGLFWYEEV GDPLDYALNA TLSEDYRNPI ATGENLFSLQ DARNLIRYGG MRPDRDFVQV DPALSYGLTE YRRVLDMLAR HGWSSRRCIP HGGHQFSLHI AAALKLGGNE SYPGEFQPTG GFADEAVVTR GRVAPGDLPG IGLEGKAKFY EVLRGLHG
|
| |