Gene Ndas_4648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4648 
Symbol 
ID9248530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5521295 
End bp5522461 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content69% 
IMG OID 
ProductMandelate racemase/muconate lactonizing protein 
Protein accessionYP_003682540 
Protein GI297563566 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.128391 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAATCG TTTCCGCCCA CGTCGGCACC ATCCCGATCA GCTCGTCGAT GCGCAACGCG 
TACATCGACT TCAGCAGGAT GGACTGCACG ATCCTGGCGC TGGTCAGTGA CGTGGTGGTC
GACGGCAGGC CCCTGGTGGG TTACGGCTTC AACTCCAACG GCCGGTACAA CGCCACGGCC
ATCCTGAACG AGCGGATGCT GCCGCGGCTG CGCGAAGCCG CCCCGGAGGA TCTGCTGGAC
GAGAACGGCG AACTCTCGCC CGCCCGGGCG TGGGACGTCA TGATGCGCAA CGAGAAGCCG
GGCGGACACG GTGAACGCTC CGTCGCCGTC GGCGTGGTGG ACATGGCGCT GCACGACCTC
GCCGCCAAGG CCGCGGGAGT GCCGCTGTAC CGGTGGATCT CCGACCACTA CGGCGACGGC
GACCCGGACG GGGACGTCTT CGTCTACGCC GCCGGCGGCT ACTACGCGCC CGGCAAGACC
CTGGAGGACC TCCAGGACGA GATGCGGGGC TTCCTCGACG CCGGGTACGA GGTCGTCAAG
ATGAAGATCG GCGGCGCCGA CCTGTCCGAG GACCTCCGGC GCATCGAGGC GGTCATCGAC
GTCCTGGGCG GCGACGGGTC CCGGCTGATG GTGGACGTCA ACGGCAAGTT CGACCTGCGG
ACCGCGCTGG AGTACGGCCG GGCCATCGAC CGGTACGGCC TCTTCTGGTA CGAGGAGGTC
GGCGACCCGC TGGACTACGC CCTGAACGCG ACGCTGTCGG AGGACTACCG CAACCCCATC
GCGACCGGCG AGAACCTGTT CTCCCTCCAG GACGCCCGGA ACCTGATCCG CTACGGCGGG
ATGCGCCCGG ACCGCGACTT CGTCCAGGTC GACCCGGCGC TGAGCTACGG GCTGACGGAG
TACCGCCGGG TCCTGGACAT GCTCGCCCGG CACGGCTGGT CCTCCCGCCG GTGCATCCCG
CACGGCGGGC ACCAGTTCTC GCTGCACATC GCCGCGGCCC TCAAGCTCGG CGGCAACGAG
TCCTACCCCG GGGAGTTCCA GCCCACGGGC GGCTTCGCCG ACGAGGCTGT GGTCACCCGC
GGTCGTGTGG CGCCGGGTGA CCTCCCGGGC ATCGGGCTCG AAGGCAAGGC GAAGTTCTAC
GAGGTCCTGC GGGGCCTGCA CGGCTGA
 
Protein sequence
MRIVSAHVGT IPISSSMRNA YIDFSRMDCT ILALVSDVVV DGRPLVGYGF NSNGRYNATA 
ILNERMLPRL REAAPEDLLD ENGELSPARA WDVMMRNEKP GGHGERSVAV GVVDMALHDL
AAKAAGVPLY RWISDHYGDG DPDGDVFVYA AGGYYAPGKT LEDLQDEMRG FLDAGYEVVK
MKIGGADLSE DLRRIEAVID VLGGDGSRLM VDVNGKFDLR TALEYGRAID RYGLFWYEEV
GDPLDYALNA TLSEDYRNPI ATGENLFSLQ DARNLIRYGG MRPDRDFVQV DPALSYGLTE
YRRVLDMLAR HGWSSRRCIP HGGHQFSLHI AAALKLGGNE SYPGEFQPTG GFADEAVVTR
GRVAPGDLPG IGLEGKAKFY EVLRGLHG