Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1050 |
Symbol | |
ID | 9244896 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1296047 |
End bp | 1297192 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Mandelate racemase/muconate lactonizing protein |
Protein accession | YP_003678999 |
Protein GI | 297560025 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.140519 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCCGC ACGCCGGAGC GGCCACGCAG GACCGCATCA CCAGCGTCAC GATCTCCTCG GTCACCCTTC CCCTGAACAC GCCCATCAGC GACGCCAAGG TCCTCACCGG GCGCCAGCGG CCGATGACCG AGGTCGCGAT GCTCTTCGCC GAGATCACCA CCGAGGCCGG TCACGAGGGC GTCGGGTTCG GCTACTCCAA GCGTGCGGGC GGCCCGGGGC AGTTCGCCCA CGCCCGGGAG GTGGCCTCCG TCCTGCTGGG GGAGGACCCC AGCGACACGG GCAAGATCTG GGACAAGCTC GTCTGGGCGG GCGCCTCGGT GGGCCGCAGC GGGCTGGCCA CCCAGGCGAT CGCGCCCTTC GACATCGCCC TGTGGGACCT CAAGGCCAAG CGGGCGGGCC TGCCGCTGGC CAAGCTCCTC GGCAGCTACC GCGACTCGGT GCGCTGCTAC AACACCTCGG GCGGCTTCCT CCACGCCCCC GTCGAGGAGG TCATGGAGAG GTCGGCCGCG GCGGTGGCCG ACGGCATCGG CGGTATCAAG CTCAAGGTCG GCCACCCCGA CAGCGCCACG GACCTGGCCC GGGTCGCGGC GGTGCGCGAA CACCTGGGCG ACGGCGTGCC GCTGATGGTG GACGCCAACC AGCAGTGGTC GCGGGCCGAC GCCCAGCGCA TGTGCCGGGC CTTCGAGGAG TTCGGGCTGG TCTGGATCGA GGAGCCGCTG GACGCCTACG ACTTCGAGGG CCACGGGCGC CTGGCCGCGA CCTTCGACAC CTCCATCGCC ACCGGGGAGA TGCTCACCAG CGTCGCCGAG CACGCCGAGC TGATCCGCCA CGGGGGCGCG GACATCATCC AGCCCGACGC GCCCCGGATC GGCGGCATCA CGCAGTTCCT CCAGGTCATG GCGATGGCCG ACCGGCGCCA CCTCCAGCTG GCCCCGCACT TCGCGATGGA GGTCCACATC CACCTGGCCG CCGCCTACCG GCACGAGCCG TGGGTGGAGC ACTTCGAGTG GCTCGACCCC CTCTTCAACG AGCACCTGGA GATCTCGGGC GGGCGCATGC ACCTCTCCGA CCGGCCCGGC CTGGGGGTGA CCCTGAGCGA CCAGGCGCGC GCGTGGACGG TCGACACCCA CCGCGTCAAG GCCTGA
|
Protein sequence | MTPHAGAATQ DRITSVTISS VTLPLNTPIS DAKVLTGRQR PMTEVAMLFA EITTEAGHEG VGFGYSKRAG GPGQFAHARE VASVLLGEDP SDTGKIWDKL VWAGASVGRS GLATQAIAPF DIALWDLKAK RAGLPLAKLL GSYRDSVRCY NTSGGFLHAP VEEVMERSAA AVADGIGGIK LKVGHPDSAT DLARVAAVRE HLGDGVPLMV DANQQWSRAD AQRMCRAFEE FGLVWIEEPL DAYDFEGHGR LAATFDTSIA TGEMLTSVAE HAELIRHGGA DIIQPDAPRI GGITQFLQVM AMADRRHLQL APHFAMEVHI HLAAAYRHEP WVEHFEWLDP LFNEHLEISG GRMHLSDRPG LGVTLSDQAR AWTVDTHRVK A
|
| |