Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0467 |
Symbol | |
ID | 4597366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 498338 |
End bp | 499204 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 639775081 |
Product | HhH-GPD family protein |
Protein accession | YP_921696 |
Protein GI | 119714731 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.31103 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGAGC TCCATGCACC CGTGCTGCAG TGGTACGACG AGCACGCCCG CGACCTGCCC TGGCGACGCG CGGAGGCCGG TCCGTGGTCG GTGCTGGTCT CGGAGTTCAT GCTGCAGCAG ACCCCCGTCG CCCGGGTGCT GCCTGTCCAC GAGCAGTGGC TGGCCCGGTG GCCGCAGCCG GCCGACCTCG CGGCCGAGTC CGCCGGCGAG GCGGTGCGCG CCTGGGGTCG CCTGGGCTAC CCACGCCGAG CGCTGCGCCT GCACGCCGCG GCCACGGCGA TCCTCGAGCG GCACGACGGC GCCGTGCCGT CGTCGTACGA CGACCTGATC GCGCTGCCCG GGGTCGGCGA CTACACCGCC GCGGCGATCG CGGTGTTCGC GTACGGGCGG CGGCACGTCG TGCTCGACAC CAACGTGCGG CGGGTGCTCA CGCGGACGCT GCAGGGCGTG GAGTTCCCGG CGCCGTCGGT GACCCGCGCC GAGCGCGAGC TGGCCCTGGC GGTGCTGCCC GCGGACGAGC CGACCGCGGC GACCTGGTCG GTCGCGGTGA TGGAGCTCGG GGCGCTCGTG TGCACGGCGG CCAACCCGCG GTGCGCCGAC TGCCCGGTCG CCCGGCTGTG CGCGTGGCGC ACGGCCGGCT ACCCGGCGTA CGACGGGCCC AAGCGCCTGG TGCAGACCTG GGCCGGCACC GACCGCCAGT GCCGCGGCCG GCTGCTCGCC GTGCTCCGCG AGGACGACGG GCCCGTGCAC CGCAGCCGGC TCGACGCCGT CTGGTCGGAG GAGGCCCAGC GGGTCCGCTG CCTCGCCTCG CTGGCGGCCG ACGGGCTGGT CGTCCACGTC GGCCCGGACG CCTACGCGCT GCCCTGA
|
Protein sequence | MNELHAPVLQ WYDEHARDLP WRRAEAGPWS VLVSEFMLQQ TPVARVLPVH EQWLARWPQP ADLAAESAGE AVRAWGRLGY PRRALRLHAA ATAILERHDG AVPSSYDDLI ALPGVGDYTA AAIAVFAYGR RHVVLDTNVR RVLTRTLQGV EFPAPSVTRA ERELALAVLP ADEPTAATWS VAVMELGALV CTAANPRCAD CPVARLCAWR TAGYPAYDGP KRLVQTWAGT DRQCRGRLLA VLREDDGPVH RSRLDAVWSE EAQRVRCLAS LAADGLVVHV GPDAYALP
|
| |