Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1810 |
Symbol | |
ID | 4597647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 1926139 |
End bp | 1929579 |
Gene Length | 3441 bp |
Protein Length | 1146 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639776409 |
Product | peptidase M36, fungalysin |
Protein accession | YP_923009 |
Protein GI | 119716044 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3227] Zinc metalloprotease (elastase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCGCT CCCCCCTGCA TCGTGGCGGC TATCGGACCG CCGCGCTGGC CTTGCTGACC GCCACCCTGG TCCCGGGCCT CGCCCAGCTC CCCGCCTTCG CCACGAGCCC CACCGCGGCT CTCGCCAGCC CGAAGACCGA CGGTCGCGAC GGGGTCGTGC ACCCCCTCGG CGGCCTCGGC GACCCGGTCG CCGGCCTCAC CGACCTCGAC GCCCGTGGCA CCGCGCTCCC CTCCGCAGCC CAGCGCAGCG CCGCTGCCCA GCTCGGCGCC GTCGACCTGC GCTGGAACCA GTTCGGTACG CCGTCCTCGA TCCTGCCGGC CGACGGCGTG CTCGCCCGGG CGTCGTCGAA CGACCCGGTC GCCGCCGCGC GCTCGTGGCT GGCCGACAAC GCGGCGGTCT TCGGACTGAC CCGCGACGCG GTCGCCGGAA TGGAGCTGGT CAACGACCAG CGGCTCGTCC AGAGCGACGC GCATGCCGTG CTCTTCCGCC AGCGGTTCGG CGAGCTGACG CCGGCCCTGG CGAGCATGGT CACCGTCGGC GTCGCCAACG GCGAGATCGC CTACGCCTCC TCGTCCCTCA CGAAGACCAC CGGCACCCCG CCGGCCGCGA CCCTGAGCCC GGTCCAGGCG TGGGTGAAGG CGGCCGCGAA CGTCGGCCGC ACGCTCGACG CCGGCGCACT CGACAAGATC ACCTCGCAGG TCGCCGACGG CTGGACCCGC CTGTCCGTCC CGGGCTACCC GCAGCAGCAG CTGGTCCGGC TGCGTGCGCT CGCGCTCGCC GACGGCTCGG TCCGGCCCGT GTTCGAGACC AACGTGCTCG ACGCCCAGTT GGGTTCGGCG TTCGCGTACA CGCTGATGGT CGACGGCGTG AGCGGCGACA TCCTGCACCG GCAGAACCAG GCCGAGAACA GCAGCGACGC CTTCCAGTTC CAGGGCGAGG TCACCGCCAC GGACTGCGGC CCGAAGCACG AGTTCGAGCT CACCGACGGC AACACCAAGC AGATCGCGGC GATGGCGGCG GCGGCCAACA CCCTCAACGA CATCGAGGTC AAGATCTTCG ACCCGAACGA CCAGCTCCTG GTCACCGGCG ACCTGGGCAC CAGCCCCGAG ACCGCGACGT ACTCCGCCGA CTCGATCCCG GCAGGCATCT ACAAGCTCCA GGTCTGCCCC TTCGACAACC CGACCGTCCC GTTCCTCCCG CCGGGCAACT ACGTCGCGAG CGTGACCACG AGCGACACCG CCGGCCCGGC CACGGGTGAC ATCGGGTTCC CGTCGAAGTG GCGCTACTTC ACGGCCAACC CCAGCCTGGA CTACTCCGCG AAGACCGTCC CGTCGAACTC CGTGATCGGC TGCTGGATCG CCGGCGAGGG CTGCACCTCG CCGACCGGCC CGTTCCGCAA CGTCGCGGCG CCGGGACCGT GGGACGCCAC CGCCAACGGC GTCGGCACGA TGACCACGGT CGGCAACAAC GCCAACACCC ACGAGGCCTG GGTCAGCCCG CTGACACCGG GCGGCACGGC GCAGGCGCCG ATCTCGCCGA CCCGCGAGTA CACGGCTGCG TTCACCGACG CGTGGAACAA CTCCGGCTGC GACCCGGCCC AGCTGACGCC CGGCGGCAAC GACATCGACG CCACCGTGAC GAACCTGTTC GTGGCGCACA ACCGGATGCA CGACTACTCG TACTACCTGG GCTTCACCGA GGACAACTAC AACATGCAGG CCGACAACCT CGGCCGCGGC GGCGTCGCCG GCGACCAGGA GATCGGCAAC GCCCAGGCCG GCGCGCTCAC CGGCGGGCAG CCGTCGTACC TCGGCCGCGA CAACGCCAAC CAGATCACCC TGCAGGACGG CGTGCCCGGC ATCACCAACC AGTACCTCTT CCAGCCGATC GCGGGCGCGT TCTACTCCCC GTGCGTCGAC GGCGGCCTGG ACATGGGGAT CGTCGGCCAC GAGTACACCC ACGCGATCAG CAACCGGATG GTCGGCGGCC CCGACGAGGG CCTCACCTCC GAGCAGGGCG GCGCGATGGG CGAGTCGTGG AGCGACCTGA TCGCCGGTGA GTACATGTTC AGCCACGGCT ACGCCAACGG CGGCAACCCC TGGGCCGTCG GCGTCTACGC CACCGGCAAC AAGTCGGTCG CGATCCGTGA CTACGCGATC AACAGGAACC CGCTCAACTA CTCCGACTAC GGCTTCGACA GCACCGGCAA CGAGGTGCAC GCCGACGGCG AGATCTGGAA CGGCACCCAG TGGGAGGTCC GCCAGGCGCT GGTCAAGAAG TGGAACAAGC AGTTCCCCTA CTCCGACAAG GCCCTCCAGC TGCGGTGCGC CCAGGCCACG CCGACCGCCA GCCCGCTGCC GGCCGGTCAG TGCCCGGGCA ACCGGCGGTG GGTCCAGCTG GTCTTCGACT CGTTCCTGCT CCAGCAGGGC GCCACGTCGA TGCTCGACGC CCGCGACGCG ATGCTCGCCG CCGACCGGAT GCGCTTCGGT GGCGAGGACC TGGACGTGAT GTGGAAGGCC TTCGCCCGCC GCGGTATGGG CCAGGGTGCG TCGATCCCCA ATGCCGACTC CGGCGACACC ACGCCCAGCT TCGCCTCGCC ACGCGAGCAG AACGGGACGG TCAGCTTCCG CAGCAACCGC TCCGGCCGGT TCTTCGTCGG CCACTACGAG GCCCGGGCGA CGCCGATCGC CGACACCTCG GACCAGGGCA AGGTCAAGAC CACCGCCCGG CTGGTCCCCG GTCGCTACGA GATGCTCTAC GTCTCCCAGG CCGGTGGCTT CAAGCGGTTC CACCTCACCG TGGAGCCCGG AGCCCAGACG GTCCACGTGA ACGCGCCGAC GAACCTCGCG GCGAGGAAGA ACGGCGCCAA GGTGCTCGAC GCCACGGCCG GCTCGCTGAA CGCGGTGTCC CTGCTCGACG GCACCGAGAA GACCGCGTGG GGCGGCGTCA CCGCCACCAA CGTCGACGAG TCGCACCCGT TCGTCGTGGT CGACCTCGCG GGCGGCGTGC ACACCGTGCG GCGGGTGCAG GTCAGCGCGA TGCTCAACCC GGCCCCGCCG GACCCGAACG AGATCCCGCT CGCCCAGGAC CCCGACCCCG ACTCGGGCTC CCGGTTCACC GCGCTGCGCC GGTTCGCGCT CGAGGCCTGC GTGAGCGACT GCGGCGAGGC GAAGGCGACC TGGAAGCGGT TCTACGTCTC CCCCGGCAAC GCCTTCCCGG CGGTGCGGCC GCGCCCGGTC GCCCCGAACC TGACGCTGCG CGCCTTCGAG GTCCCCGCGA CCCGGGCCGC CGCGATCCGG TTCGTGGTGC TCGAGAACCA GTGCACCGGC TACGCCGGGT ACGCCGGCGA GCAGGACAAC GACCCGATCA ACGACACCGA CTGCAAGACC GCCGCCGATC GCGGCACGAT CGTGCACGCG GCCGAGCTCC AGGTGTTCTG A
|
Protein sequence | MTRSPLHRGG YRTAALALLT ATLVPGLAQL PAFATSPTAA LASPKTDGRD GVVHPLGGLG DPVAGLTDLD ARGTALPSAA QRSAAAQLGA VDLRWNQFGT PSSILPADGV LARASSNDPV AAARSWLADN AAVFGLTRDA VAGMELVNDQ RLVQSDAHAV LFRQRFGELT PALASMVTVG VANGEIAYAS SSLTKTTGTP PAATLSPVQA WVKAAANVGR TLDAGALDKI TSQVADGWTR LSVPGYPQQQ LVRLRALALA DGSVRPVFET NVLDAQLGSA FAYTLMVDGV SGDILHRQNQ AENSSDAFQF QGEVTATDCG PKHEFELTDG NTKQIAAMAA AANTLNDIEV KIFDPNDQLL VTGDLGTSPE TATYSADSIP AGIYKLQVCP FDNPTVPFLP PGNYVASVTT SDTAGPATGD IGFPSKWRYF TANPSLDYSA KTVPSNSVIG CWIAGEGCTS PTGPFRNVAA PGPWDATANG VGTMTTVGNN ANTHEAWVSP LTPGGTAQAP ISPTREYTAA FTDAWNNSGC DPAQLTPGGN DIDATVTNLF VAHNRMHDYS YYLGFTEDNY NMQADNLGRG GVAGDQEIGN AQAGALTGGQ PSYLGRDNAN QITLQDGVPG ITNQYLFQPI AGAFYSPCVD GGLDMGIVGH EYTHAISNRM VGGPDEGLTS EQGGAMGESW SDLIAGEYMF SHGYANGGNP WAVGVYATGN KSVAIRDYAI NRNPLNYSDY GFDSTGNEVH ADGEIWNGTQ WEVRQALVKK WNKQFPYSDK ALQLRCAQAT PTASPLPAGQ CPGNRRWVQL VFDSFLLQQG ATSMLDARDA MLAADRMRFG GEDLDVMWKA FARRGMGQGA SIPNADSGDT TPSFASPREQ NGTVSFRSNR SGRFFVGHYE ARATPIADTS DQGKVKTTAR LVPGRYEMLY VSQAGGFKRF HLTVEPGAQT VHVNAPTNLA ARKNGAKVLD ATAGSLNAVS LLDGTEKTAW GGVTATNVDE SHPFVVVDLA GGVHTVRRVQ VSAMLNPAPP DPNEIPLAQD PDPDSGSRFT ALRRFALEAC VSDCGEAKAT WKRFYVSPGN AFPAVRPRPV APNLTLRAFE VPATRAAAIR FVVLENQCTG YAGYAGEQDN DPINDTDCKT AADRGTIVHA AELQVF
|
| |