Gene Ndas_1050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1050 
Symbol 
ID9244896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1296047 
End bp1297192 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content72% 
IMG OID 
ProductMandelate racemase/muconate lactonizing protein 
Protein accessionYP_003678999 
Protein GI297560025 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.140519 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCGC ACGCCGGAGC GGCCACGCAG GACCGCATCA CCAGCGTCAC GATCTCCTCG 
GTCACCCTTC CCCTGAACAC GCCCATCAGC GACGCCAAGG TCCTCACCGG GCGCCAGCGG
CCGATGACCG AGGTCGCGAT GCTCTTCGCC GAGATCACCA CCGAGGCCGG TCACGAGGGC
GTCGGGTTCG GCTACTCCAA GCGTGCGGGC GGCCCGGGGC AGTTCGCCCA CGCCCGGGAG
GTGGCCTCCG TCCTGCTGGG GGAGGACCCC AGCGACACGG GCAAGATCTG GGACAAGCTC
GTCTGGGCGG GCGCCTCGGT GGGCCGCAGC GGGCTGGCCA CCCAGGCGAT CGCGCCCTTC
GACATCGCCC TGTGGGACCT CAAGGCCAAG CGGGCGGGCC TGCCGCTGGC CAAGCTCCTC
GGCAGCTACC GCGACTCGGT GCGCTGCTAC AACACCTCGG GCGGCTTCCT CCACGCCCCC
GTCGAGGAGG TCATGGAGAG GTCGGCCGCG GCGGTGGCCG ACGGCATCGG CGGTATCAAG
CTCAAGGTCG GCCACCCCGA CAGCGCCACG GACCTGGCCC GGGTCGCGGC GGTGCGCGAA
CACCTGGGCG ACGGCGTGCC GCTGATGGTG GACGCCAACC AGCAGTGGTC GCGGGCCGAC
GCCCAGCGCA TGTGCCGGGC CTTCGAGGAG TTCGGGCTGG TCTGGATCGA GGAGCCGCTG
GACGCCTACG ACTTCGAGGG CCACGGGCGC CTGGCCGCGA CCTTCGACAC CTCCATCGCC
ACCGGGGAGA TGCTCACCAG CGTCGCCGAG CACGCCGAGC TGATCCGCCA CGGGGGCGCG
GACATCATCC AGCCCGACGC GCCCCGGATC GGCGGCATCA CGCAGTTCCT CCAGGTCATG
GCGATGGCCG ACCGGCGCCA CCTCCAGCTG GCCCCGCACT TCGCGATGGA GGTCCACATC
CACCTGGCCG CCGCCTACCG GCACGAGCCG TGGGTGGAGC ACTTCGAGTG GCTCGACCCC
CTCTTCAACG AGCACCTGGA GATCTCGGGC GGGCGCATGC ACCTCTCCGA CCGGCCCGGC
CTGGGGGTGA CCCTGAGCGA CCAGGCGCGC GCGTGGACGG TCGACACCCA CCGCGTCAAG
GCCTGA
 
Protein sequence
MTPHAGAATQ DRITSVTISS VTLPLNTPIS DAKVLTGRQR PMTEVAMLFA EITTEAGHEG 
VGFGYSKRAG GPGQFAHARE VASVLLGEDP SDTGKIWDKL VWAGASVGRS GLATQAIAPF
DIALWDLKAK RAGLPLAKLL GSYRDSVRCY NTSGGFLHAP VEEVMERSAA AVADGIGGIK
LKVGHPDSAT DLARVAAVRE HLGDGVPLMV DANQQWSRAD AQRMCRAFEE FGLVWIEEPL
DAYDFEGHGR LAATFDTSIA TGEMLTSVAE HAELIRHGGA DIIQPDAPRI GGITQFLQVM
AMADRRHLQL APHFAMEVHI HLAAAYRHEP WVEHFEWLDP LFNEHLEISG GRMHLSDRPG
LGVTLSDQAR AWTVDTHRVK A