Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1868 |
Symbol | |
ID | 9245718 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2280829 |
End bp | 2282037 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | homogentisate 12-dioxygenase |
Protein accession | YP_003679802 |
Protein GI | 297560828 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.501601 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTACT ACCGCCAGGT CGGCGAGGTG CCCCGCACCC GCCACACCCA GCACCGCACC CCCGAGGGCG GGCTGTACTA CGAGGAGCTG ATGGGGGAGG AGGGCTTCTC CGCCGACTCC TCGCTGCTCT ACCACCGCGC CATCCCCTCG GCCATCGTCG ACGCCGCCGA GTGGGAGGTC CCCGACCAGT CCCGCACCCG CAACGCCCCG ATGGTGCCGC GCCACCTCAA ACTGCACGAG CTGTTCTCCG ACCAGGAGTG GAAGGCCGCC GACGTCGTCG AGTCGCGCCG CCTGGTCCTG GGCAACGACG ACGTGCGCAT CTCCTACGTG GTCGCCGGGG CGCCCTCGGA GCTGTACCGC AACGGCATCG GCGACGAGTG CGTGTACGTG GAGTCGGGCA CCGCCCGCGT GGAGACCGTC TTCGGTCTGC TGGAGGTCGG CCAGGGCGAC TACGTCGTCC TGCCGCGCGC CACCACCCAC CGCTGGGTGC CCACCGGCGA CCAGCCCCTG CGCGCCTACG TGGTGGAGGC CAACAGCCAC ATCGCCCCGC CCAAGCGCTA CCTGTCCCGC TACGGGCAGT TCCTGGAGCA CGCACCCTAC TGCGAACGCG ACCTGCGCGG CCCCTCCGAG CCGCTGCTGG CCGAGGGCGA GAACGTCGAC GTGCTCATCA AGCACCGCGG CGACGGTCCC GGGGGCATCT CCGGCACCCG ATACACCTAC CCCACCCACC CCTTCGACGT CGTGGGCTGG GACGGGTTCC TGTACCCCTA CGTCTTCAAC GTCGCCGACT TCCAGCCCAT CACCGGCCGC GTCCACCAGC CGCCGCCCGT GCACCAGGTG TTCGAGGGCC ACAACTTCGT CATCTGCAAC TTCGTGCCGC GCAAGGTGGA CTACCACCCC GGCGCCATCC CGGTGCCCTA CTACCACTCC AACGTGGACT CCGACGAGGT CATGTTCTAC TGCGGCGGCG ACTACGAGGC CCGCAAGGGC TCCGGGATCG GCCAGGGCTC CATCTCCCTG CACCCCGGCG GCCACTCCCA CGGCCCCCAG CCCGGCGCCT ACGAGCGCAG CATCGGCGCC GAGGCCTTCG ACGAACTCGC CGTCATGGTC GACACCTTCC GCCCGCTCGA CCTGGGCGAG GGCGGCCTGG CCAGCGACGA CGGAGTCTAC GCCTGGACCT GGGCCGGGGG ACGGAGGGAG CGGCGGTGA
|
Protein sequence | MAYYRQVGEV PRTRHTQHRT PEGGLYYEEL MGEEGFSADS SLLYHRAIPS AIVDAAEWEV PDQSRTRNAP MVPRHLKLHE LFSDQEWKAA DVVESRRLVL GNDDVRISYV VAGAPSELYR NGIGDECVYV ESGTARVETV FGLLEVGQGD YVVLPRATTH RWVPTGDQPL RAYVVEANSH IAPPKRYLSR YGQFLEHAPY CERDLRGPSE PLLAEGENVD VLIKHRGDGP GGISGTRYTY PTHPFDVVGW DGFLYPYVFN VADFQPITGR VHQPPPVHQV FEGHNFVICN FVPRKVDYHP GAIPVPYYHS NVDSDEVMFY CGGDYEARKG SGIGQGSISL HPGGHSHGPQ PGAYERSIGA EAFDELAVMV DTFRPLDLGE GGLASDDGVY AWTWAGGRRE RR
|
| |