Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4149 |
Symbol | |
ID | 4596663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 4383460 |
End bp | 4384989 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639778755 |
Product | purine catabolism PurC domain-containing protein |
Protein accession | YP_925333 |
Protein GI | 119718368 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism [T] Signal transduction mechanisms |
COG ID | [COG2508] Regulator of polyketide synthase expression |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGCCGA CCGTTCAGGA CCTGCTCGAC CTCCCGGTGC TGCGGCGCGC CCGGCCCGAG GTGGCCGTCG GCAGCCGCCT GGACGAACGG GAGGTGCGCT GGGTGCACAC CTCCGAGATC TATGAGATCT CACCGTTGCT CAAGGGCGGC GAGGTGCTCT TGACCACCGG GCTCGGCCTG GTGGGCGTCG GGGCGGGAGC GGTGGCGGCG TACGTCGAGG CGCTGGCTCG CCAAGGCGTC GCGGCGTTGG TGCTCGAGGT CGGCCGGACG TTCACCCACC CGCCCGAGGC GCTGGTGGCA GCGGCCCGGG CGCATGACCT GCCTCTCGTG CTGCTCCACG GAGTGGTCCC GTTCATCGAT GTCACGGAGA CGGTCTACCC GATGCTGATC GGCGGTGAGG TCGAGCAGCT GCGCGAGCTC GAGCGGGCCT CGACCAGGTT GCACGAGGCG CTCACCACGG GGGCTGGGCC GGGTGAGCTG CTGGCGCTGG TCGGTGACAT CTGCGGCTCG CCGGCCGGGC TGTACACCTC GGCCGGTGAC CTGCTGGGCG GGGAAGACGT GCGCTCCCGG GAGGGTGGGA CGCTCGAGTT CGACGTGGGT CGGAGTCCTT GGGCGGTGCT CGCCCTGCCA GCGTCGCAAC GCTCGACCCA CACCGGCGTG AGGCGGTTGG CCGAGTTGTG CGCGACCATG ATCGACATCC GCCTGGGCGC CATGTTCCGA GCCGGCCACC GCCCCGGGGC CGACGCCGAC CTGGTCCGCA TGCTGGCGGC GGGACAGTAC CTCTCCAGCG CGGACATCGA GGTGCAATCA CGCGCCGCCG GACTCGTCGT GCGCCCGGGT TGGCGAGCCG TCGGCCTCGC GGTGGACCTG CGCCTGCCAA GCTCGCTGCG CCCCGGTCTC AACGCCACCA TCGAGGCCGC GCGGTCGGTG TTCGGAGTGT CCGCGGTGGC CGAGCTGGAC CGCGAGTTCG TCGTGGCGAC CACGGTACGG CCAGCGGAGC TGCGCTCGCG CCTGTCCTCG TTCGTCGACG CCTTGGAGCG CGAGCTGCAC GCCGCCGTGG GCACAGCGGC GATCCGGGTC TCGGCGGGCC ATCCGGTTGG AGACGCCGCG GGTTTGGCTC GATCGCTGCC GGCCGCCCTC GACGCGCTCC ACCTGGCCCG CCGGCTCGGC CTGGGGTCAC GGACGGTGCT CGCCAGCGAC CTGGGGGTCT TCCACCTGCT CTCGAGCGCC ACCGCGGACG TCGAGCTCGA GCGGTTCGTC CAGGAGCAAC TCGGCGCGTT GCTCGAGCAC GATGCGCGGC ACGGCTCGGA CCTCGTCCAG ACCCTGGACG CATACCTGGA GGCGGGCCTG GGCAAGACGG CGGCGGCCCA GGCACTCGGG ATCCGCAGGC AGACCCTGTA TGCACGCTTG GAGCGGATCA GCAGGCTGCT CGGCGGCCTG GACATCGAGG CGCGTCAGGC GCGCACAGCG CTCGACCTCG CACTGGTGAG CTGGCGCCTG AGGACCTCGG CCGTCACCGG CCGGCCCTGA
|
Protein sequence | MAPTVQDLLD LPVLRRARPE VAVGSRLDER EVRWVHTSEI YEISPLLKGG EVLLTTGLGL VGVGAGAVAA YVEALARQGV AALVLEVGRT FTHPPEALVA AARAHDLPLV LLHGVVPFID VTETVYPMLI GGEVEQLREL ERASTRLHEA LTTGAGPGEL LALVGDICGS PAGLYTSAGD LLGGEDVRSR EGGTLEFDVG RSPWAVLALP ASQRSTHTGV RRLAELCATM IDIRLGAMFR AGHRPGADAD LVRMLAAGQY LSSADIEVQS RAAGLVVRPG WRAVGLAVDL RLPSSLRPGL NATIEAARSV FGVSAVAELD REFVVATTVR PAELRSRLSS FVDALERELH AAVGTAAIRV SAGHPVGDAA GLARSLPAAL DALHLARRLG LGSRTVLASD LGVFHLLSSA TADVELERFV QEQLGALLEH DARHGSDLVQ TLDAYLEAGL GKTAAAQALG IRRQTLYARL ERISRLLGGL DIEARQARTA LDLALVSWRL RTSAVTGRP
|
| |