Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0175 |
Symbol | |
ID | 9244006 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 223333 |
End bp | 224397 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | 3-isopropylmalate dehydrogenase |
Protein accession | YP_003678131 |
Protein GI | 297559157 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.690575 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGCGC GCACCGTGAA ACTGGCGGTC ATTCCCGGCG ACGGGATCGG TCCCGAGGTC ATCGCCGAAG GGCTCAAGGT GCTGGAGGTC GCGGCCTCCC AGCACGACCT CGCCGTGGAG ACCACCGAGT ACGAGCTGGG CGCCAGGCTC TGGCACCGCA CTGGTGAGAC CCTGCCCGAC TCGGTCGAGG CCGAACTCGC CGGGCACGAG GCCATCTACC TCGGCGCAGT CGGCGACCCG TCGGTCCCCA GCGGCGTCCT GGAGCGCGGC CTGCTCCTGC GCCTGCGCTT CGACTTCGAC CACTACGTCA ACCTGCGCCC CGTCCGCCTC TACCCGGGGG TGGACAGCCC GCTCGCGGGC GTGGGCCCCG AGGACATCGA CATGCTGGTG GTCCGCGAGG GCACCGAGGG CCCCTACGCG GGCGCCGGGG GCGTCCTGCG CAAGGGCACC CCGCACGAGA TCGCCACCCA GGACAGCGTC AACACCCGCT TCGGGGTGGA GCGCGTCGTG CGCTACGCCT TCGACAAGGC CGCCTCCCGC CCCCGCCGCA AGCTCACCCT GGTCCACAAG GACAACGTCC TGACCTTCGC CGGCGAGCTG TGGCAGCGCG TGGTCCGGGA GGTCGGCGCC GAGTACCCGC AGGTCTCCGT GGACTACTGC CACGTCGACG CCGCGTCGAT GTTCTTCGTC AACCAGCCCG CCCGCTTCGA CGTGGTGGTC ACCGACAACC TCTTCGGCGA CATCATCACC GACATCGGCG CCGCCATCAC CGGCGGTATC GGGCTGGCCG CCAGCGGCAA CATCAACCCG GACAACACCT TCCCCAGCAT GTTCGAACCC GTTCACGGCT CCGCGCCCGA CATCGCGGGC CAGGGCAAGG CCGACCCCAC CGCCACGGTG CTCTCGGCCG CGACCATGCT GGAGCACCTC GGAGTGCCCG AGGCGGCCCG CCGGATCGAG GCCGCGGTCG CCACGGACCT CAAGACCCGT GCCCAGAACG GCGCGGTCCG CTCCACCTCC CAGATCGGCG ACGACCTCGC CGCGCGAGTA GCCGAGCAGG GCTGA
|
Protein sequence | MAARTVKLAV IPGDGIGPEV IAEGLKVLEV AASQHDLAVE TTEYELGARL WHRTGETLPD SVEAELAGHE AIYLGAVGDP SVPSGVLERG LLLRLRFDFD HYVNLRPVRL YPGVDSPLAG VGPEDIDMLV VREGTEGPYA GAGGVLRKGT PHEIATQDSV NTRFGVERVV RYAFDKAASR PRRKLTLVHK DNVLTFAGEL WQRVVREVGA EYPQVSVDYC HVDAASMFFV NQPARFDVVV TDNLFGDIIT DIGAAITGGI GLAASGNINP DNTFPSMFEP VHGSAPDIAG QGKADPTATV LSAATMLEHL GVPEAARRIE AAVATDLKTR AQNGAVRSTS QIGDDLAARV AEQG
|
| |