Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3670 |
Symbol | |
ID | 9247539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4406285 |
End bp | 4407793 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | methylmalonate-semialdehyde dehydrogenase |
Protein accession | YP_003681574 |
Protein GI | 297562600 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.313455 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAGC ACGTCACCCA CTGGATCGGC GGGTCCGCCC ACGAGGGGCC GGCCCGGCGC ACGGGGGACA TCTACAATCC GGCCTCCGGA CGGGTCACCG GCACGGTCGA CCTCGCCGGA CGGCAGGAGG TCGACGCGGC GGGGACGGCC GCGCGGGAGG CCTTCCCGGG GTGGCGCGAC ACCCCGCTGT CCCGGCGGGT GCAGGTCCTG TTCCGCTTCC GCGAGCTGCT CAGCGCCAAC GCCGACCGGC TGGCGGAGCT GGTCAGCGCC GAGCACGGCA AGGTCCTCTC CGACGCCCGG GGCGAGGTGG CCCGCGGCCT GGAGGTCGTC GACTTCGCCT GCGGCATCCC GCACCTGCTC AAGGGCGGCT ACTCCGAGAA CGTGTCCACG GGCGTGGACG CCTACTCGAT CCTCCAGCCT CTCGGCGTGG TCGCCGGGAT CACGCCGTTC AACTTCCCGG CGATGGTGCC GATGTGGATG TTCCCGGTGG CCCTGGCCTG CGGCAACGCG TTCGTGCTCA AGCCCAGCGA GAAGGACCCC TCCGCGTCGG TGCTGCTGGC CGGGCTGTGG GCCGAGGCGG GGCTGCCCGA GGGCGTGTTC AACGTGGTGC ACGGCGACAA GGAGGCGGTG GACGCCCTCC TGGAGCACCC GGACGTGGCG GCGGTCAGCT TCGTGGGCTC CACCCCGATC GCGCGGTACG TCTACCGGAC CGCCGCCGAG CACGGCAAGC GCGTGCAGGC CCTGGGCGGG GCCAAGAACC ACATGGTGGT GCTGCCCGAC GCCGACCTGG ACCTGGCCGC GGACGCGGCG GTCTCGGCCG GGTTCGGCTC GGCCGGCGAG CGGTGCATGG CCATCTCCGC GGTGGCCGTG GTGGACTCGG TGGCCGACGG GCTGGTGGAG CGGATCCGCG AGCGGGTGGC GCGCCTGCGC GTGGGCCCCG GCGACGACGA GCGCAGCGAG ATGGGGCCGC TGGTCACCAG GGAGCACCGC GACAGGGTGG CCTCCTACCT GGAGTCGGGG GTGCGCGAGG GCGCGACCCT GGCGGTCGAC GGCCGCGCGC ACCCTGTGTC GGGCGGGAGC CCGGACGGGT TCTGGCTGGG ACCGTCGCTG CTGGACCACG TCGGCCCCGA GATGTCGTGC TACCGGGACG AGATCTTCGG CCCCGTGCTG AGCGTGGTGC GCGTGGGCGG CTACGACGAG GCGGTCAAGC TCGTCAACGC CAGCCCCTAC GGCAACGGCA CGGCGGTCTT CACCAACGAC GGGGGCGCCG CCCGGCGGTT CCAGAACGAG GTCGAGGTCG GCATGGTGGG GATCAACGTC CCCATCCCGG TGCCGATGGC CTACTACTCC TTCGGCGGCT GGAAGCAGTC CCTGTTCGGC GACTCACACG CGCACGGCAC GGAGGGTGTC CACTTCTACA CCCGTACCAA GGCGGTCACC GCCCGGTGGG CCGACCCGGG CCAGCGGCCC GAAGGGGGCG TCGACCTGGG GTTCCCCACC AACGGTTGA
|
Protein sequence | MSKHVTHWIG GSAHEGPARR TGDIYNPASG RVTGTVDLAG RQEVDAAGTA AREAFPGWRD TPLSRRVQVL FRFRELLSAN ADRLAELVSA EHGKVLSDAR GEVARGLEVV DFACGIPHLL KGGYSENVST GVDAYSILQP LGVVAGITPF NFPAMVPMWM FPVALACGNA FVLKPSEKDP SASVLLAGLW AEAGLPEGVF NVVHGDKEAV DALLEHPDVA AVSFVGSTPI ARYVYRTAAE HGKRVQALGG AKNHMVVLPD ADLDLAADAA VSAGFGSAGE RCMAISAVAV VDSVADGLVE RIRERVARLR VGPGDDERSE MGPLVTREHR DRVASYLESG VREGATLAVD GRAHPVSGGS PDGFWLGPSL LDHVGPEMSC YRDEIFGPVL SVVRVGGYDE AVKLVNASPY GNGTAVFTND GGAARRFQNE VEVGMVGINV PIPVPMAYYS FGGWKQSLFG DSHAHGTEGV HFYTRTKAVT ARWADPGQRP EGGVDLGFPT NG
|
| |