Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3841 |
Symbol | |
ID | 8449460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 4212828 |
End bp | 4213859 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645042891 |
Product | intradiol ring-cleavage dioxygenase |
Protein accession | YP_003203127 |
Protein GI | 258653971 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3485] Protocatechuate 3,4-dioxygenase beta subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.353412 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.439298 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACGAG CACCGCAGAA CACCCCGACT TACGAGGGTC GAGCCCTCCC CCGGCCGGAC GAGGAGGTCA TGGACCAGGG CCTGTCCTTC GATATGCAGA CGATGCTCGG CCGCCGGCAG ATGCTCCGGG CGGTCGGCCT GGGGGCGGTC GCCGTCGGGC TGGCCGCCTG CGGCGCCGGC ACGACCACCG CATCAACGTC GGCGTCGACC AACGGGTCGA CGACCGGGTC GACCGCCGGG TCGACCGCCG CCAGCCCGTC GAGCAGTGCG GCCGGCAGCG CCAGCAGTTC GAGCGCGGCC GCCGCGACGC TGACCGAGAT CCCGGACGAG ACCGCCGGCC CGTACCCCGG GGACGGCTCC AACGGGCCGG ACGTGCTGTC CCAGAGCGGC ATCGTGCGCA GCGACATCCG CTCCAGCTTT GCCGGCTCGA CCGGCGTCGC CGAGGGCGTG CCGATGACGC TCACCCTGAG CATCAAGGAC ATGGTCAACG GCAACGCCCC GTTCGCCGGG GTCGCGGTCT ACGTCTGGCA TTGCGATCGG GAGGGCCGGT ACTCCCTGTA CAGCGACGGG GTGACCGAGG AGAACTACCT GCGCGGCGTC CAGGTCGCCG GAGCGGACGG CACGGTCAGC TTCACCAGCA TCTACCCGGC CACCTACTCC GGCCGCTGGC CGCACATCCA CTTCGAGGTC TACCCGGACG TCGCCAGCAT CACCGACTCG ACCAACGCCA TCGCCACCTC GCAGGTGGCC ATGCCCAGCG ACGTCAGCGA CCTGGTCTAC CAGCAACCCG GGTACGAGCA GTCGATCATC AACAAGGCCC AGATCAGCCT GACCAGCGAC AACGTCTTCG GCGACGACGG CGGCATCCAC GAGCTGGGCA CCGCGACCGG GGACGTCGCC AGCGGCTACC ACGTCACTCT GGACGTGCCG GTCGACACCT CGACCGCGCC GACGGCCGGC AGCGCCCCCG CGGGCGGCCC CGGCGGTGGC CAGGGCGGTG GCCAAGGTGG TGGCCAGCCG CCGAGCCGCT GA
|
Protein sequence | MRRAPQNTPT YEGRALPRPD EEVMDQGLSF DMQTMLGRRQ MLRAVGLGAV AVGLAACGAG TTTASTSAST NGSTTGSTAG STAASPSSSA AGSASSSSAA AATLTEIPDE TAGPYPGDGS NGPDVLSQSG IVRSDIRSSF AGSTGVAEGV PMTLTLSIKD MVNGNAPFAG VAVYVWHCDR EGRYSLYSDG VTEENYLRGV QVAGADGTVS FTSIYPATYS GRWPHIHFEV YPDVASITDS TNAIATSQVA MPSDVSDLVY QQPGYEQSII NKAQISLTSD NVFGDDGGIH ELGTATGDVA SGYHVTLDVP VDTSTAPTAG SAPAGGPGGG QGGGQGGGQP PSR
|
| |