Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2021 |
Symbol | |
ID | 9245871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2441627 |
End bp | 2442715 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | Mandelate racemase/muconate lactonizing protein |
Protein accession | YP_003679953 |
Protein GI | 297560979 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCGTCA CCGAGGTCGG CGCCCGGGCC TTCCGGCTGC CGCTGCACCG CTCCTGGGAC GGGGGCGTGG ACCGCAACGA CGTCGTGGTG GTGCGGGTGG CCACCGACTC CGGGCTGGTC GGCACGGGTT TCGCGTGGAC CCCGCTGATC GGGGCCCGGG CGGTGGCCGC GCTGGTCAAC GACGACCTGC GGCCCGCGCT GGTGGGCGCC GAGGCCCATC CGGCCCGGTG GGACGAGCTG CGCTGGCACC TGCGCGAGGC GGGCACCAGC GGGCTCACCC TGATGGCGCT GGCGGGGGTG GACATCGCCC TGTGGGACCT GCGCGCCCGC GCGGCGGAGT CGAGCCTGGT GGATCTGGTG GGTCGGCGCC GCGAGTCGGC GCCGGCCTAC GGCAGCGGGG TCAACCTGGA CTACTCCCTG CCCGACCTGG TGGAACAGGT GCGCGGCTGG GTGGAGGCCG GGCACGTCGC CGCCAAGGTC AAGGTCGGCT CGCCCGATCC GGCCCGGGAC GCCGAGCGGG TGGGCGCGGT GCGCGAGGTA CTGGGCCCGG ACCGGCTGCT GATGGTGGAC GCCAACCAGC GCTGGGACGT GCCCGGGGCG GTCCGGGCGC TGGACGCGCT GGAGGAGTTC GGCCCGCACT TCGTGGAGGA GCCCCTCCCG GCCGAGGACC TGGAGGCGCA CGCCCGTCTG CGCGAGCGCA CGCGGGTGCC CTTCGCCGTG GGTGAGAACC TGCGCACGGC CGCCGAGTTC GAGCGGGCCG TGGAGCTGGG GGTGTGCGAC GTCGCCCAGC CCAACGTGGT GCGGGTGGGC GGCATCACCC CGTTCCTGCG GATCGCGGAG TCGATGGCGC GCCGCGGCGT GCCGGTGGCT CCGCACCTGT TGCCCGAGCT GTCGGGGCAG CTGGCGCTGT GCCTGCCCCG GGTGGCCATG GTCGAGGACA TCGACCGGGC CTCCTTCGCC GCGCTGGGAG CGTTGGCCCG CCCCAGCGGG GTGGAGTTCG ACCGGGGCCG GGTGCGCGCC GACACCGGCC ACGGCCACGG CCTGGTGTTC GCCGACACGC TCACACCGGT CGCGGACGCT TCCCCCTAG
|
Protein sequence | MRVTEVGARA FRLPLHRSWD GGVDRNDVVV VRVATDSGLV GTGFAWTPLI GARAVAALVN DDLRPALVGA EAHPARWDEL RWHLREAGTS GLTLMALAGV DIALWDLRAR AAESSLVDLV GRRRESAPAY GSGVNLDYSL PDLVEQVRGW VEAGHVAAKV KVGSPDPARD AERVGAVREV LGPDRLLMVD ANQRWDVPGA VRALDALEEF GPHFVEEPLP AEDLEAHARL RERTRVPFAV GENLRTAAEF ERAVELGVCD VAQPNVVRVG GITPFLRIAE SMARRGVPVA PHLLPELSGQ LALCLPRVAM VEDIDRASFA ALGALARPSG VEFDRGRVRA DTGHGHGLVF ADTLTPVADA SP
|
| |