Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3166 |
Symbol | |
ID | 4600151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 3365367 |
End bp | 3366950 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639777772 |
Product | griselysin |
Protein accession | YP_924355 |
Protein GI | 119717390 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3227] Zinc metalloprotease (elastase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCAGAA TCTCGGGCGT CGCCACGGCG GCGGTTCTCG CTCTCACCGC CGTCGGCATC CAGTCCGGCG CCAGCCAGGC CGTGACCGAG CGGCCGAACG CGGTCCAGGC CCTGCTCGCC CATCCGGGCG CGGCGCTGGC CAGCAACGGA ACGGCGTTCA CGGTCACCCA CACCGTGACC GACGCCGACG GCAGCACCCA CGTCCGCATG GACCGCACCT ACCGTGGCCT GCCCGTGCTC GGTGGGGACC TGGTCGTCCA CCGCGGTACC CAGGGCGGGT GGCGCGGGGT GAGCCAGACC CTCGAGAACG AGGTGCACGT CTCGACGACG CCGGCGGTCG GGAAGGCCGC CGCGGCCGTC CGGGCGCTCG CGCCGGCGAA GGCGACGCGC GGCACCACCG GCGCGAAGAC GCAGTCGACC CGCCTGGTCG TCGACGCGAC CACCGGCACT GCTCGGCTCG CGTGGGAGGT CATCACCGGC GGCACCCAGC AGGACGGCAC GCCGAGCCGG CTGGCGACGT ACGTCGACGC CCGGACCGGC GCGGTGATCC GCCGCGAGCA GCAGATCCAG ACCGCGGACG GCTCGGGCCA GTCGCTCTAC TCCGGGACCG TGCCGCTGCA GCTGACGCTG TCGGGCTCGA CGTACCAGCT CAAGGACCCG ACCCGGGGCA ACACCTACAC GACCGACATG GGCAACGCGA GCGACTCCCT CGGCTGCCAG TACTTCGGCT TCAACTGCAA GACCGGCACC CTGTTCACCA GCCCGGACAA CCTGTTCGGC AACGGCGCGA CGAGCAGCCG GGAGTCGGCC GCCGTGGACG CGCAGTACGG CACGAACATG ACGTGGGACT TCTACAAGTC GACCTACGGG CGCAACGGGA TCTTCGGGAC CGGCGCCGGC TCCTACAACC GGGTCCACTA CGGCAAGAAC TACGTCAACG CGTTCTGGGA CGGCACCAAG ATGACGTACG GCGACGGCGA CGGGACCAAC TACGGACCGT TGGTCTCGCT GGACGTGGCC GGTCACGAGA TGTCGCACGG CGTCACCGAG AACACCGCCG GACTGGCCTA CTCGGGCGAG TCCGGTGGTC TCAACGAGGC GACCTCGGAC ATCTTCGGCA CGATGGTGGA GTTCTTCGCC GCCAACGCCA ACGACCCGGG CGACTACCTG ATCGGCGAGG AGTTCGACCT CAAGAACCAC CTCGGCTTCC GGCGGATGGA CAACCCGGCC TCGGACGGCA GCTCCTTCAA CTGCTGGTCG TCGACCGTCG GGAGCGCCGA CGTCCACTAC TCCTCGGGCG TCGGGAACCA CTTCTTCTAC CTGCTCGCCG AGGGCTCGGG CGCCAAGACC ATCGGCGGCG TCGCCCACAA CAGCCCGACC TGCAACGGCT CGACGGTGAC CGGCATCGGC CGGGACGCCG CGAGCGCGAT CTGGTACCGC GCGCTCACGG TCTACATGAC CTCCAGCACC AGCTACGCCG GCGCCCGCAC CGCCACGTTG AACGCGGCGC GGGACCTGTA CGGCGCGGGC AGCGCGCAGC AGAACGCCGT GGCCGCCGCG TGGAGCGCGG TCAGCGTCAA CTGA
|
Protein sequence | MRRISGVATA AVLALTAVGI QSGASQAVTE RPNAVQALLA HPGAALASNG TAFTVTHTVT DADGSTHVRM DRTYRGLPVL GGDLVVHRGT QGGWRGVSQT LENEVHVSTT PAVGKAAAAV RALAPAKATR GTTGAKTQST RLVVDATTGT ARLAWEVITG GTQQDGTPSR LATYVDARTG AVIRREQQIQ TADGSGQSLY SGTVPLQLTL SGSTYQLKDP TRGNTYTTDM GNASDSLGCQ YFGFNCKTGT LFTSPDNLFG NGATSSRESA AVDAQYGTNM TWDFYKSTYG RNGIFGTGAG SYNRVHYGKN YVNAFWDGTK MTYGDGDGTN YGPLVSLDVA GHEMSHGVTE NTAGLAYSGE SGGLNEATSD IFGTMVEFFA ANANDPGDYL IGEEFDLKNH LGFRRMDNPA SDGSSFNCWS STVGSADVHY SSGVGNHFFY LLAEGSGAKT IGGVAHNSPT CNGSTVTGIG RDAASAIWYR ALTVYMTSST SYAGARTATL NAARDLYGAG SAQQNAVAAA WSAVSVN
|
| |