Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0177 |
Symbol | |
ID | 9244008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 226317 |
End bp | 227927 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | 2-isopropylmalate synthase/homocitrate synthase family protein |
Protein accession | YP_003678133 |
Protein GI | 297559159 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.342023 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGACG ACAGTTTCCA CGTCTTCGAC ACCACGCTGC GCGACGGGGC CCAGCGCGAG GGGATCAACC TGCGGGTCTC CGACAAGCTG GCCATCGCCA AGCTGCTGGA CGACTTCGGG GTGGCGTTCA TCGAGGGAGG GTGGCCGGGG GCCAACCCCA AGGACACCGA GTTCTTCCAG CGGGCTTCAC GAGAGCTGAC GCTGGAGCAC GCGCAACTCA CCGCGTTCGG CGCGACCCGC CGTGCCGGTG TGCGGGCCGC CGACGACCCA CAGGTGGCCG CACTGCGCGA CAGCGGCGCA CCGGTCGTCA CCCTTGTCGC CAAGAGTGAC GACCGGCACG TCGAGCGCGC GCTGCGCACG ACCCTCGACG AGAACCTCGC CATGATCGCC GACACGGTGT CCCACCTGAG CGAACACGGC CAGCGCGTAT TCGTGGACTG CGAGCACTTC TTCGACGGAT ACCTCCACAA CCCCGACCAC GCGCTCGACG TGGTCCGCGC CGCCGCCGGG GCCGGTGCCG ACGTCGTCGT CCTGTGCGAC ACCAACGGCG GCATGCTCCC CACCGACGTC ACCCGTATCG TCACCGAGGT CCGCGAGGCC ACCGGCGCAC GCCTGGGCAT CCACGCCCAG GACGACACCG GCTGCGCCGT CGCCAACACC CTCGCCGCCG TGGACGCGGG CGCCACCCAC GTACAGTGCA CCGCCAACGG CTACGGCGAG CGGGTCGGCA ACGCCAACCT CTTCTCCGTG GTCGGCGCGC TCACGCTCAA GCGCGGCCAG GAGGTCCTCC CTGAGGGCTG CCTGGCCGAG ATGACCCGCG TGGCCACCGC CATCGCCGAG ATCGTCAACC TCACCCCCGA CACGCACCAG CCCTACGTGG GGGTGTCGGC CTTCGCGCAC AAGGCGGGGC TGCACGCCTC CGCGATCAAG GTCGACCCCG ACCTGTACCA GCACACGGAC CCCGCGCTGG TCGGCAACGC CATGCGCATG CTCGTCTCCG ACATGGCCGG GCGGGCCTCC ATCGAACTCA AGGCCAAGGA GTTGGGCCTG GACCTGTCCG GAGACCGCGC CCTGTCGGGG CGGGCCGTGG AGCGGGTCAA GGGCCTGGAG CTGTCGGGCT ACAGCTTCGA GGCCGCCGAC GCCTCCCTGG ACCTGCTGCT GCGCGAGGAA CTGGGGCAGC CGGTCCGCTA CTTCGACACC GAGTCCTGGC GCGTCATCAC CGAACGCCGA CCCCGGGCCG GGTCCAGCCC CCTGGCCAGC GACTACGAGA GCCTCACCGA GGCCACCGTC AAACTGCGGG TCAAGGGCGA ACGCGTGATC GCCACCGCGG AGGGCAACGG CCCCGTCAAC GCCCTGGACC GGGCGCTGCG CAGCGCCATG GAGGGCGTGT ACACCGCGCT GGCCGGGCTG GAGCTGACCG ACTACAAGGT CCGCATCCTG GAGGGCAGCT CCGGCACCAA CGCCATCACC CGCATCCTCA TCACCTTCAG CGACGGGGTG GGGGAGTGGA CCACGGTGGG CGTGGGCCCC AACGTCGTCG ACGCGTCCTG GGTCGCCCTC GAACAGGCCG TCACCTACGG GCTCCTGCGC CAGGGCTACC CGCAGGGCTG A
|
Protein sequence | MRDDSFHVFD TTLRDGAQRE GINLRVSDKL AIAKLLDDFG VAFIEGGWPG ANPKDTEFFQ RASRELTLEH AQLTAFGATR RAGVRAADDP QVAALRDSGA PVVTLVAKSD DRHVERALRT TLDENLAMIA DTVSHLSEHG QRVFVDCEHF FDGYLHNPDH ALDVVRAAAG AGADVVVLCD TNGGMLPTDV TRIVTEVREA TGARLGIHAQ DDTGCAVANT LAAVDAGATH VQCTANGYGE RVGNANLFSV VGALTLKRGQ EVLPEGCLAE MTRVATAIAE IVNLTPDTHQ PYVGVSAFAH KAGLHASAIK VDPDLYQHTD PALVGNAMRM LVSDMAGRAS IELKAKELGL DLSGDRALSG RAVERVKGLE LSGYSFEAAD ASLDLLLREE LGQPVRYFDT ESWRVITERR PRAGSSPLAS DYESLTEATV KLRVKGERVI ATAEGNGPVN ALDRALRSAM EGVYTALAGL ELTDYKVRIL EGSSGTNAIT RILITFSDGV GEWTTVGVGP NVVDASWVAL EQAVTYGLLR QGYPQG
|
| |