Gene Ndas_1162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1162 
Symbol 
ID9245012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1417574 
End bp1418788 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content70% 
IMG OID 
ProductMandelate racemase/muconate lactonizing protein 
Protein accessionYP_003679109 
Protein GI297560135 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.233578 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.767056 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGATCA CCTCCGCCGA CGTCATCGTG ACGTGCCCGG GACGCAACTA CGTCACCCTC 
AAGATCACCA CCGAGGACGG GACCACCGGC CTCGGCGACG CCACCCTGAA CGGCCGCGAA
CTCTCCGTCG CCTCCTACCT GCGCGACCAC GTCTGCCCGC TGCTCGTGGG CCGCGACTCC
GGACGTATCA ACGACATCTG GCAGTACCTG TACCGCGGCG CCTACTGGCG GCGCGGCCCC
GTCACCATGA CCGCGATCGC CGCCGTGGAC TGCGCCCTGT GGGACATCCT CGGCAAGCGG
GTCGGCAAGC CCGTCCACCA GTTGCTCGGC GGAGCCGCGC GCGACGGCGT CATGGTCTAC
GGCCACGCCA GCGGCCAGTC CGTGGACGAC CTCCTGGACT CGGTCGGCGG CTTCCTGGAC
CAGGGGTACC GGGCGGTGCG TGTGCAGGCC GCCGTCCCGG GCCTGGAGAG CACCTACGGG
CTGCACCACC CCGGCACCGG CCACACCTAC GAGCCCGCCG ACGCGGCGAT GCCCACCGAC
AACGTCTGGC ACACCCCCGC CTACCTGGAC TTCGCGCCCG AGATGATGAA GGCGGTCCGG
GAGAGGTTCG GCTACGGCTT CCACCTGCTG CACGACGTGC ACCACCGGCT CTCCCCGTTG
GAGGCCGCCC AGCTGGGCAG GTCGCTGGAG CCCTACCGCA TGTTCTGGAT CGAGGACCCC
ACCCCGGCCG AGGACCAGGA GGCGTTCCGC ACGATCCGCC AGCACACCAC CACCCCGCTG
GCGGTGGGGG AGGTCTTCAA CACGATCTGG GACTGCCAGC ACCTGATCAC CGAGCGGCTC
ATCGACTACA TCCGCATGTC GGTCTCGCAC AGCGGGGGCA TCACGCACCT GCGGCGGATC
TTCGACCTGG CCGACCTGTA CGGGGTGCGC ACCGGCTCGC ACGGCGCGGG CGACCTGTCG
CCGGTGTCGT TCGCCGCGGC CCTGCACCTG GACCTGACCG TGCCCAACTT CGGCATCCAG
GAGTACATGG GGCACCTGGA GCCCGCCAGC GAGGTGTTCC GGACCTCCTA CACCTTCGCG
GACGGCTACA TGCACCCGGG CGACGCGCCG GGTCTGGGGG TGGAGATCGA CGAGGAGGCC
GCGGCCCGCT ACCCCTACGA GGCCCGGTAC CTGCCGGTCA ACCGCAGGCT CGACGGCTCG
ATGCACGACT GGTGA
 
Protein sequence
MRITSADVIV TCPGRNYVTL KITTEDGTTG LGDATLNGRE LSVASYLRDH VCPLLVGRDS 
GRINDIWQYL YRGAYWRRGP VTMTAIAAVD CALWDILGKR VGKPVHQLLG GAARDGVMVY
GHASGQSVDD LLDSVGGFLD QGYRAVRVQA AVPGLESTYG LHHPGTGHTY EPADAAMPTD
NVWHTPAYLD FAPEMMKAVR ERFGYGFHLL HDVHHRLSPL EAAQLGRSLE PYRMFWIEDP
TPAEDQEAFR TIRQHTTTPL AVGEVFNTIW DCQHLITERL IDYIRMSVSH SGGITHLRRI
FDLADLYGVR TGSHGAGDLS PVSFAAALHL DLTVPNFGIQ EYMGHLEPAS EVFRTSYTFA
DGYMHPGDAP GLGVEIDEEA AARYPYEARY LPVNRRLDGS MHDW