Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_2244 |
Symbol | |
ID | 4597790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 2392442 |
End bp | 2395588 |
Gene Length | 3147 bp |
Protein Length | 1048 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639776843 |
Product | peptidase M23B |
Protein accession | YP_923436 |
Protein GI | 119716471 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0739] Membrane proteins related to metalloendopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGC TGCTGCTCGC GGGACTGCCC GTCCTGATCA TCTTCATGGG GCTGCCGTTC CTCGTGACGC TGATGGTGGT GATGACGACC ACCGCGGCCG CCGAGTGCCG CACCCAGAGC AGCCAAGGCA CCGCGCCTAC CGAGCTCGGT GACCTCGGAG CCATCGACGG CCCGGTGGGC GGCCCGGTCA ACGGCAACAT CACCATGGCG CAGGCCAACA TCCCGCGCCG CTCCGGCCTC GACGGGTTCC GGGCCTCCAT GCCCAAGGTC CTGTCCAAGA ACCCCGAGTT CGTCACCCTC AACGAGGCCA GCGGCTGGAG CCTGGAGCAG ATCGAAGCCG CCGCCCCGGG CTACTCAGCC TTCCGGGTGG CAGCCCCGGC CGGCACCGGC ACCGGCCCCG AGCAGGCCAT GGGCAACGTC GTGCTCTGGA AGAGCTCGAC CTGGACGAAG GTCAACGGCG GCCGGGTCCA GCTCGTCGAC GACGACAAGA CCTTCTACGA CGGGCGTCCG GTCACGTGGG ACCGCTTCGC GACATGGGTC ATGCTGCGCC GCGCCGACGG CTCCGTGGTC TCGGTGGTCT CCACCCACCA CATGACCAAC CCGCACCGTT GGCCGAAGCA GCACGGCAAC CCGCCGCTGA CCCGGCCCCA GCAGTACGGC GCCGGAATGG ACATCCTGCT GCAGCTGCGC AACTCGCTGG CCGCCCACGG TCCGGTGCTG ATCGGCGGCG ACATGAACAC CCAGGCCTCC TACACCGACA TCCCCTGGAC GGCGGCCGCG AAGATGAAGG CCGCCGGCTA CGGATGGCAC AACCACGGCG TCGACTTCAT CTTCTTCCCG CACCACCAGG GCGCCCGGCT CGAACAGGGC TGGGACGGCA CGATGGTTTC GGACCACCAC TGGCTCTCGG CTCGCATCGC GATGAACGGC GCCGGCCCGG AGAGCGCGCC CGAGACCACC ACCACGACTG ACGGGGTCGT GCCCGCGGCG ACCACCGCCC CGACGTCCGC CGAGCCGCCG GCCGGCGACG TGCTCGCCCA GCTGATGCGG CTACGGTTCG CGTCCAACTA CCCGACCATG ACCGACGAGC AGGCCCGCAA CGCGATCACC ATCGCCCAGG TCGCCCGCAA TCTCGAGATT CCCCGCTACG GGCTGCAGAT CGCGATCGCC GCCGCGATCC AGGAGTCCAA GCTGGTCAAC CTCACCGGAG GTGACCGCGA CTCCGGCGGC CTGTTCCAGC AGCGCCCCTC GGCCGGCTGG GGAAGCCGGG CCGAGATCAC CAACGCCGTC CTCGCGGCCC GCGCGTTCTT CGGCCAGGCC CAGCACACCG GCAACCCCGG GCTCCTCGAC ATCCCCGGCT GGCAGAACAT GCCGCTCACC CAGGCCGCGC AGGCCGTCCA GCGCTCGGGC TACCCCGACG CCTACGCCCA GTGGGAGGAC GTCGCCGGCG ACATCACCGA TCTGCTCGGC GGCGACCTGC CGGACCTGCC CGACGACGGC TCCACCACGA ACGTCGCCAA CTGCCAGGGC GAGACCGTCA ACCCCATCAC CGTCGGCACC CTCAACCTGC TCGGCGCCGG CCACACCGAC AAGCCGGGGG AGCGGGCCGG GTACGACACC TGGGACAAGC GACTGCCCGG CGCCATGCGC ACGATCGAGA ACGCCGGCGT CACGATCACC GGCCTCCAGG AGGTGCATGG CCCGCAGGCC CAGGCGCTGG AGAACCAGTA CGCGGCCAAG TGGGGGATGT ACCCGGCCAG CGGGAAGGCA CAGAACCGGG TGATCTGGGA CCGCAACGAG TGGGAGCAGA CCGACGGGCG CCTCGTCGGC ATCCCGTACT TCGGCGGGAA GGACGTCGGC ATGCCGCTGG TGCAGCTGAC CTCGACGACC ACCGGGCAGG TGATCTGGGT CTGGAGCATC CACAACCCGG CCAACACTCA AGGAAGCGCC GCCGGGCACC GCCAGGAGGC GCTGCGTCGC CAGCTGGCCA CGATGACCGA GCTCGCCGGC ACCGGCACCC CTGCGGTGAT ACTGGGCGAC TTCAACGACG GCAAGGACGG CAGCAACGCC TCGCACTGCG CACTGACCCC TGAGCTGAGC AACGCCTTCG GCGGCTCTGC CGAGCCCTGC AAGAAGCCCA AGCAGGACGC GCCGATCGAC CACGTCTACG GCGCGAACCT CACCTGGGCC GGCGCCGAGG TCGACACCAG CACCCAGGCC AGCAAGATCG CCGACCACCC GCTGGTGACC GCCACCACCG CCGGCAGCAG CGCCGGGTGC GCCGTCGACT CCGGTACAGC GGAGGCCAAG TACAACCTCG GCCCGGTCAA GCCGCAGCTG ACCCAGCTGG TCAACATCCT CGGCCCCATG TTCGACATCA AGACCGTCGG CGGCTACCGC GAGAGCGCCA CCGACCCCAA CGGCCACCCG GCCGGGCTCG CGGCCGATTT CATGGTGCCG CTCAACGCGG CGGGCCGCGC GCAGGGCGAT CGTCTCGCCG CCTACGCCAA GGCCAATGCC CAGAAGCTCG GCATCGACTA CATCATCTGG TACCAGCGGA TCTGGTCGGT CGCCCGCGTC GGCGAGGGCT GGCGGCCGAT GGAGGACCGG GGGAGCGCTA CCGAGAACCA CCTCGACCAT GTCCACATCA ACGTCAAGCC CGGCGCCTCC GTCCAGCCGG TCGGCCTCGA GGGCGCGTCC TGCGACGAGG TCGTCTATCC GGTGCCCGCG CAGTACGTCG GAACCGACAA CCACAACTGG CACGAGACCG GCGCGTACTG GTCCAAGTGG CACACCGGCA CCGACTTCTC CGCACCCTGC GGAACCACCG TCTACGCTGC CCACGCCGGC ACCATCGAGA TCGACACCAC CCAGCGTTCC TGGGCTGGGC CGCAGCTGGT CAAGGTCACC ACCGGCGCCG GGTCCCTGAC CACGTGGTAC GCCCACATGG CGACCGTCAG CGTCAGCCGT GGCCAGACCG TCGCCGCCGG CGAGCCGATC GGCCAGGTCG GCAAGGAGGG CAACGTCTCC GGCTGCCACC TCCACTTCGA GGTCCACCTC AAGAACGGCT CCATCTACGG CCCCGACAAC GTCGATCCCT CGACCTGGCT CGCAGAGAAC GCATCACGCC CGAGCCGCGC CGTGTGA
|
Protein sequence | MKKLLLAGLP VLIIFMGLPF LVTLMVVMTT TAAAECRTQS SQGTAPTELG DLGAIDGPVG GPVNGNITMA QANIPRRSGL DGFRASMPKV LSKNPEFVTL NEASGWSLEQ IEAAAPGYSA FRVAAPAGTG TGPEQAMGNV VLWKSSTWTK VNGGRVQLVD DDKTFYDGRP VTWDRFATWV MLRRADGSVV SVVSTHHMTN PHRWPKQHGN PPLTRPQQYG AGMDILLQLR NSLAAHGPVL IGGDMNTQAS YTDIPWTAAA KMKAAGYGWH NHGVDFIFFP HHQGARLEQG WDGTMVSDHH WLSARIAMNG AGPESAPETT TTTDGVVPAA TTAPTSAEPP AGDVLAQLMR LRFASNYPTM TDEQARNAIT IAQVARNLEI PRYGLQIAIA AAIQESKLVN LTGGDRDSGG LFQQRPSAGW GSRAEITNAV LAARAFFGQA QHTGNPGLLD IPGWQNMPLT QAAQAVQRSG YPDAYAQWED VAGDITDLLG GDLPDLPDDG STTNVANCQG ETVNPITVGT LNLLGAGHTD KPGERAGYDT WDKRLPGAMR TIENAGVTIT GLQEVHGPQA QALENQYAAK WGMYPASGKA QNRVIWDRNE WEQTDGRLVG IPYFGGKDVG MPLVQLTSTT TGQVIWVWSI HNPANTQGSA AGHRQEALRR QLATMTELAG TGTPAVILGD FNDGKDGSNA SHCALTPELS NAFGGSAEPC KKPKQDAPID HVYGANLTWA GAEVDTSTQA SKIADHPLVT ATTAGSSAGC AVDSGTAEAK YNLGPVKPQL TQLVNILGPM FDIKTVGGYR ESATDPNGHP AGLAADFMVP LNAAGRAQGD RLAAYAKANA QKLGIDYIIW YQRIWSVARV GEGWRPMEDR GSATENHLDH VHINVKPGAS VQPVGLEGAS CDEVVYPVPA QYVGTDNHNW HETGAYWSKW HTGTDFSAPC GTTVYAAHAG TIEIDTTQRS WAGPQLVKVT TGAGSLTTWY AHMATVSVSR GQTVAAGEPI GQVGKEGNVS GCHLHFEVHL KNGSIYGPDN VDPSTWLAEN ASRPSRAV
|
| |