Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3657 |
Symbol | |
ID | 4595769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 3882932 |
End bp | 3883978 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639778265 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_924844 |
Protein GI | 119717879 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCAGCG AACCCCTGGT CCTCGGCATC GAGACCTCGT GCGACGAGAC CGGTGTCGGC ATCGTCCGTG GGCACACCCT GCTCGCGGAC GCGGTGGCGA GCAGCGTCGA CGAGCACGCC CGCTTCGGCG GGGTGGTGCC CGAGGTCGCG AGTCGCGCCC ACCTCGAGGC GATGGTGCCG ACCATCGAAC GGGCCTGCGA GACGGCCGGC ATCCGCCTGT ACGACGTCGA CGCGATCGCG GTCACCAGCG GACCGGGGTT GGCCGGGGCG TTGATGGTGG GGGTAGCCGC CGCGAAGGCG CTCGCGGTCG GCCTCGGCAA GCCGATCTAC GGCGTGAACC ACCTCGCGGC GCACGTTGCC GTCGACCAGC TCGAGCACGG CCCGCTGCCC GAGCCCTGCC TCGCGCTGCT GGTCAGCGGC GGCCACTCCA GCCTGCTGCG GGTCGAGGAC GTCACCTCCG GGGTGGACCC GATGGGGGCG ACCATCGACG ACGCCGCCGG CGAGGCCTTC GACAAGGTGG CCCGGCTGCT CGGCCTGCCG TTCCCCGGTG GCCCCTACAT CGACCGTGCG GCCCGCGAGG GCAGCACCGT GTACGTCGAC TTCCCGCGCG GCCTGACCAG CCGCCGCGAC CTCGAGCGGC ACCGCTTCGA CTTCTCGTTC TCGGGCCTCA AGACCGCGGT CGCGCGGTGG GTCGAGGCAC GGGAGCGGTC CGGCGAGCCG GTGCCGGTGG CCGACGTGGC GGCGAGCTTC CAGGAGGCGG TCTGCGACGT GCTGACCCGC AAGGCGATCG ACGCGGCGTC CAGTGCGGGC ATCGAGGACC TCCTCATCGG CGGTGGGGTC GCCGCGAACT CCCGGCTGCG CGTGCTGGCG GAGGAGCGCG CCGCGGCGCG GGGGATCCGG GTCCGGGTGC CCCGTCCCGG CCTGTGCACC GACAACGGCG CCATGGTCGC CGCTCTGGGC GCCGAGATGG TCGCCCGCGG CCGCACCCCG TCCCCCCTGG ACCTCCCCGC CGACTCCTCG CTCCCCGTGA CCGAGGTTCT CGTCTGA
|
Protein sequence | MSSEPLVLGI ETSCDETGVG IVRGHTLLAD AVASSVDEHA RFGGVVPEVA SRAHLEAMVP TIERACETAG IRLYDVDAIA VTSGPGLAGA LMVGVAAAKA LAVGLGKPIY GVNHLAAHVA VDQLEHGPLP EPCLALLVSG GHSSLLRVED VTSGVDPMGA TIDDAAGEAF DKVARLLGLP FPGGPYIDRA AREGSTVYVD FPRGLTSRRD LERHRFDFSF SGLKTAVARW VEARERSGEP VPVADVAASF QEAVCDVLTR KAIDAASSAG IEDLLIGGGV AANSRLRVLA EERAAARGIR VRVPRPGLCT DNGAMVAALG AEMVARGRTP SPLDLPADSS LPVTEVLV
|
| |