Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0657 |
Symbol | |
ID | 4599520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 696778 |
End bp | 697992 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639775256 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_921870 |
Protein GI | 119714905 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCACT ACCGACGGAT CGGTGCGGTC CCGCCGAAGC GGCACACGCA GGCCCGCGAC CCCGACGGCC GGCTCTACTA CGAGGAGCTG ATGGGCGAGG AGGGATTCTC CTCGGACTCG TCGCTGCTCT ACCACCGCGG CGTGCCCTCC GCCATCGTCG CCGCGGAGCC GTGGGAGCTC CCGGACCAGA GCCGCACGCC CAACCACCCG CTCAAGCCGC GGCACCTGCG GCTCCACGAC CTCGAGACCG GCGGCGACGC CGTGACCAGC CGTCGGCTCG TGCTCGCCAA CGCCGACGTG CGGATCTCCT ACGTGGTCGC CGGGTCCGAG CCCTCCGCCT ACTACCGCAA CGCGATCGGC GACGAGTGCG TGTACGTCGA GTCCGGGGCC GGCGTGGTCG AGACCGTGTT CGGCGTGGTG GGCTACCGCG CCGGCGACTA CGTCGTGATC CCGCGCGCGA CCACGCACCG GTGGGTGCCG GCCCCTGGGT CCGAGGACCC GAGCCGGCTC TACGCGATCG AGGCGAACAG CCACATCGCC CCGCCCAAGC GCTACCTGTC CCGCTACGGC CAGCTGCTCG AGCACGCGCC GTACTGCGAG CGGGACCTGT ACGGCCCGAC CCAGCCGTTC ACGGCCGACG GCGGCGACGT CGACGTCCTC GTCAAGCACC GGACCGGCGG CGGGATCGTC GGGACCCGGA TGACCTACGC GACGCACCCC TTCGACGTGG TCGGTTGGGA CGGCTGCCTG TACCCCTACA CGCTCAACAT CGAGGACTAC ATGCCGATCA CCGGCAAGGT GCACCAGCCG CCACCGGTGC ACCAGGTCTT CGAGGGGCAC AACTTCGTGG TCTGCAACTT CCTGCCGCGC AAGGTCGACT ACCACCCGCT CGCGATCCCG GTGCCGTACT ACCACTCCAA CGTCGACAGC GACGAGGTGA TGTTCTACGT CGGCGGCGAC TACGAGGCGC GCAAGGGCTC CGGCATCCGC ATCGGGTCGA TCTCGCTGCA CCCCGGCGGA CACGCCCACG GGCCGCAGCC CTCGGCGATC GAGGCCTCGC TCGGGGTGGA GTACTTCGAG GAGTCGGCGG TCATGGTCGA CACCTTCGCC CCCCTCGACC TCGGCGAGGC GGGCCTCGCG GTCGAGGACC CGGCGTACGC GTGGAGCTGG GCCGGGCGGG GGCCCGAGGA CCCGCCGGTC TTCTCCAACT CGTGA
|
Protein sequence | MAHYRRIGAV PPKRHTQARD PDGRLYYEEL MGEEGFSSDS SLLYHRGVPS AIVAAEPWEL PDQSRTPNHP LKPRHLRLHD LETGGDAVTS RRLVLANADV RISYVVAGSE PSAYYRNAIG DECVYVESGA GVVETVFGVV GYRAGDYVVI PRATTHRWVP APGSEDPSRL YAIEANSHIA PPKRYLSRYG QLLEHAPYCE RDLYGPTQPF TADGGDVDVL VKHRTGGGIV GTRMTYATHP FDVVGWDGCL YPYTLNIEDY MPITGKVHQP PPVHQVFEGH NFVVCNFLPR KVDYHPLAIP VPYYHSNVDS DEVMFYVGGD YEARKGSGIR IGSISLHPGG HAHGPQPSAI EASLGVEYFE ESAVMVDTFA PLDLGEAGLA VEDPAYAWSW AGRGPEDPPV FSNS
|
| |