Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_2002 |
Symbol | |
ID | 4598624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 2144060 |
End bp | 2145205 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 639776606 |
Product | Dyp-type peroxidase family protein |
Protein accession | YP_923199 |
Protein GI | 119716234 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2837] Predicted iron-dependent peroxidase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01412] Tat-translocated enzyme [TIGR01413] Dyp-type peroxidase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.117545 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGGAT CCGAGACATC CGCCCGGTCC GGATTCGGCC GGCGGCGGTT CCTCGGGTAC GCCGGGTCAG CGGTCGCGGG TGCGGCCGCC GGCACGGCCG GCGGTGCCGC CTGGGGCGCC GGTCGGGACC GCCGCGACGA CCCGACGCGG AGCGGACCGG GTGCGGAGCT CTCGCCGTAC GGCGTCCACC AGCCGGGCGT CGCCGCGCCG ACCCCGCCGG TGGTCGAGCT GGTCGCGCTC GACCTGCTGC CCGACGCCGA CCGCGACGCT CTCGGGCGGC TGCTCCGGGC GTGGACCGGC GACGTCGAGG CGCTGACCAC CGGGCGTCCG ACCCCCGGCG ACACCGAGCC GTGGCTGGCC GCGGCGCGCG CCGACCTGAC GATCACCGTC GGGCTCGGCC CGGGCGCCCT GGCCGCCGGG CGGATCGAGC CGCCGCGCGG GTTCGAGCCG GTGCCACCGA TGCGCCACGA CCGGCTCGAG GAGCGTTGGT CGGGCGGGGA CCTGGTGCTG GTCGTCGGCG GCCGCGAGGG GACGACGGTC GCGCACGCGG TGCGCCAGCT GGTCCGCGAC GCCCGGCCGT TCGCCCGGGA GCGTTGGCGC CAGGCCGGGT TCTGGAACGG CGTCGACGCC GAGGGCCGGC CGATGACGGG GCGCAACCTG TTCGGGCAGG TCGACGGGAC CGCCAACCCG GCGCCGGGCA CCGGGACCTT CGACGAGACC GTGTGGCTGC GCGAGCCGCC GTGGACCGGC GGCAGCACGC TGGTCGTACG CCGGATCGCG ATGGACCTGG ACACCTGGGC GGAGCTCACC CGCGACCGCC AGGAGCGGGC CCTGGGCCGC AGCCTCGACG ACGGCGCCCG GCTCGACGGC CCGGGTCCAC ACGCGCACGC CCGGCTCGCA CACCCGATGG AGAACGCCGG GGCCCGGATC TTCCGCAAGG GCGCCAGCTA CGCCACCGCG GAGGAGAGCG GCCTGCTGTT CTGCAGCTTC CAGGCCTCGG TCGCCGGGCA GTTCGTGCCG ATCCAGCGGT CCCTGGACCG CGCGGACGCG CTCAACACCT GGACCACGGC GACCGGCTCG GCGGTGTTCG TGGTGCTGCC CGGGTTCGAG CGCGGCGACT GGCTGGGGTC GACGGTGCTG CGATGA
|
Protein sequence | MAGSETSARS GFGRRRFLGY AGSAVAGAAA GTAGGAAWGA GRDRRDDPTR SGPGAELSPY GVHQPGVAAP TPPVVELVAL DLLPDADRDA LGRLLRAWTG DVEALTTGRP TPGDTEPWLA AARADLTITV GLGPGALAAG RIEPPRGFEP VPPMRHDRLE ERWSGGDLVL VVGGREGTTV AHAVRQLVRD ARPFARERWR QAGFWNGVDA EGRPMTGRNL FGQVDGTANP APGTGTFDET VWLREPPWTG GSTLVVRRIA MDLDTWAELT RDRQERALGR SLDDGARLDG PGPHAHARLA HPMENAGARI FRKGASYATA EESGLLFCSF QASVAGQFVP IQRSLDRADA LNTWTTATGS AVFVVLPGFE RGDWLGSTVL R
|
| |