Gene Noca_2244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2244 
Symbol 
ID4597790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2392442 
End bp2395588 
Gene Length3147 bp 
Protein Length1048 aa 
Translation table11 
GC content70% 
IMG OID639776843 
Productpeptidase M23B 
Protein accessionYP_923436 
Protein GI119716471 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGC TGCTGCTCGC GGGACTGCCC GTCCTGATCA TCTTCATGGG GCTGCCGTTC 
CTCGTGACGC TGATGGTGGT GATGACGACC ACCGCGGCCG CCGAGTGCCG CACCCAGAGC
AGCCAAGGCA CCGCGCCTAC CGAGCTCGGT GACCTCGGAG CCATCGACGG CCCGGTGGGC
GGCCCGGTCA ACGGCAACAT CACCATGGCG CAGGCCAACA TCCCGCGCCG CTCCGGCCTC
GACGGGTTCC GGGCCTCCAT GCCCAAGGTC CTGTCCAAGA ACCCCGAGTT CGTCACCCTC
AACGAGGCCA GCGGCTGGAG CCTGGAGCAG ATCGAAGCCG CCGCCCCGGG CTACTCAGCC
TTCCGGGTGG CAGCCCCGGC CGGCACCGGC ACCGGCCCCG AGCAGGCCAT GGGCAACGTC
GTGCTCTGGA AGAGCTCGAC CTGGACGAAG GTCAACGGCG GCCGGGTCCA GCTCGTCGAC
GACGACAAGA CCTTCTACGA CGGGCGTCCG GTCACGTGGG ACCGCTTCGC GACATGGGTC
ATGCTGCGCC GCGCCGACGG CTCCGTGGTC TCGGTGGTCT CCACCCACCA CATGACCAAC
CCGCACCGTT GGCCGAAGCA GCACGGCAAC CCGCCGCTGA CCCGGCCCCA GCAGTACGGC
GCCGGAATGG ACATCCTGCT GCAGCTGCGC AACTCGCTGG CCGCCCACGG TCCGGTGCTG
ATCGGCGGCG ACATGAACAC CCAGGCCTCC TACACCGACA TCCCCTGGAC GGCGGCCGCG
AAGATGAAGG CCGCCGGCTA CGGATGGCAC AACCACGGCG TCGACTTCAT CTTCTTCCCG
CACCACCAGG GCGCCCGGCT CGAACAGGGC TGGGACGGCA CGATGGTTTC GGACCACCAC
TGGCTCTCGG CTCGCATCGC GATGAACGGC GCCGGCCCGG AGAGCGCGCC CGAGACCACC
ACCACGACTG ACGGGGTCGT GCCCGCGGCG ACCACCGCCC CGACGTCCGC CGAGCCGCCG
GCCGGCGACG TGCTCGCCCA GCTGATGCGG CTACGGTTCG CGTCCAACTA CCCGACCATG
ACCGACGAGC AGGCCCGCAA CGCGATCACC ATCGCCCAGG TCGCCCGCAA TCTCGAGATT
CCCCGCTACG GGCTGCAGAT CGCGATCGCC GCCGCGATCC AGGAGTCCAA GCTGGTCAAC
CTCACCGGAG GTGACCGCGA CTCCGGCGGC CTGTTCCAGC AGCGCCCCTC GGCCGGCTGG
GGAAGCCGGG CCGAGATCAC CAACGCCGTC CTCGCGGCCC GCGCGTTCTT CGGCCAGGCC
CAGCACACCG GCAACCCCGG GCTCCTCGAC ATCCCCGGCT GGCAGAACAT GCCGCTCACC
CAGGCCGCGC AGGCCGTCCA GCGCTCGGGC TACCCCGACG CCTACGCCCA GTGGGAGGAC
GTCGCCGGCG ACATCACCGA TCTGCTCGGC GGCGACCTGC CGGACCTGCC CGACGACGGC
TCCACCACGA ACGTCGCCAA CTGCCAGGGC GAGACCGTCA ACCCCATCAC CGTCGGCACC
CTCAACCTGC TCGGCGCCGG CCACACCGAC AAGCCGGGGG AGCGGGCCGG GTACGACACC
TGGGACAAGC GACTGCCCGG CGCCATGCGC ACGATCGAGA ACGCCGGCGT CACGATCACC
GGCCTCCAGG AGGTGCATGG CCCGCAGGCC CAGGCGCTGG AGAACCAGTA CGCGGCCAAG
TGGGGGATGT ACCCGGCCAG CGGGAAGGCA CAGAACCGGG TGATCTGGGA CCGCAACGAG
TGGGAGCAGA CCGACGGGCG CCTCGTCGGC ATCCCGTACT TCGGCGGGAA GGACGTCGGC
ATGCCGCTGG TGCAGCTGAC CTCGACGACC ACCGGGCAGG TGATCTGGGT CTGGAGCATC
CACAACCCGG CCAACACTCA AGGAAGCGCC GCCGGGCACC GCCAGGAGGC GCTGCGTCGC
CAGCTGGCCA CGATGACCGA GCTCGCCGGC ACCGGCACCC CTGCGGTGAT ACTGGGCGAC
TTCAACGACG GCAAGGACGG CAGCAACGCC TCGCACTGCG CACTGACCCC TGAGCTGAGC
AACGCCTTCG GCGGCTCTGC CGAGCCCTGC AAGAAGCCCA AGCAGGACGC GCCGATCGAC
CACGTCTACG GCGCGAACCT CACCTGGGCC GGCGCCGAGG TCGACACCAG CACCCAGGCC
AGCAAGATCG CCGACCACCC GCTGGTGACC GCCACCACCG CCGGCAGCAG CGCCGGGTGC
GCCGTCGACT CCGGTACAGC GGAGGCCAAG TACAACCTCG GCCCGGTCAA GCCGCAGCTG
ACCCAGCTGG TCAACATCCT CGGCCCCATG TTCGACATCA AGACCGTCGG CGGCTACCGC
GAGAGCGCCA CCGACCCCAA CGGCCACCCG GCCGGGCTCG CGGCCGATTT CATGGTGCCG
CTCAACGCGG CGGGCCGCGC GCAGGGCGAT CGTCTCGCCG CCTACGCCAA GGCCAATGCC
CAGAAGCTCG GCATCGACTA CATCATCTGG TACCAGCGGA TCTGGTCGGT CGCCCGCGTC
GGCGAGGGCT GGCGGCCGAT GGAGGACCGG GGGAGCGCTA CCGAGAACCA CCTCGACCAT
GTCCACATCA ACGTCAAGCC CGGCGCCTCC GTCCAGCCGG TCGGCCTCGA GGGCGCGTCC
TGCGACGAGG TCGTCTATCC GGTGCCCGCG CAGTACGTCG GAACCGACAA CCACAACTGG
CACGAGACCG GCGCGTACTG GTCCAAGTGG CACACCGGCA CCGACTTCTC CGCACCCTGC
GGAACCACCG TCTACGCTGC CCACGCCGGC ACCATCGAGA TCGACACCAC CCAGCGTTCC
TGGGCTGGGC CGCAGCTGGT CAAGGTCACC ACCGGCGCCG GGTCCCTGAC CACGTGGTAC
GCCCACATGG CGACCGTCAG CGTCAGCCGT GGCCAGACCG TCGCCGCCGG CGAGCCGATC
GGCCAGGTCG GCAAGGAGGG CAACGTCTCC GGCTGCCACC TCCACTTCGA GGTCCACCTC
AAGAACGGCT CCATCTACGG CCCCGACAAC GTCGATCCCT CGACCTGGCT CGCAGAGAAC
GCATCACGCC CGAGCCGCGC CGTGTGA
 
Protein sequence
MKKLLLAGLP VLIIFMGLPF LVTLMVVMTT TAAAECRTQS SQGTAPTELG DLGAIDGPVG 
GPVNGNITMA QANIPRRSGL DGFRASMPKV LSKNPEFVTL NEASGWSLEQ IEAAAPGYSA
FRVAAPAGTG TGPEQAMGNV VLWKSSTWTK VNGGRVQLVD DDKTFYDGRP VTWDRFATWV
MLRRADGSVV SVVSTHHMTN PHRWPKQHGN PPLTRPQQYG AGMDILLQLR NSLAAHGPVL
IGGDMNTQAS YTDIPWTAAA KMKAAGYGWH NHGVDFIFFP HHQGARLEQG WDGTMVSDHH
WLSARIAMNG AGPESAPETT TTTDGVVPAA TTAPTSAEPP AGDVLAQLMR LRFASNYPTM
TDEQARNAIT IAQVARNLEI PRYGLQIAIA AAIQESKLVN LTGGDRDSGG LFQQRPSAGW
GSRAEITNAV LAARAFFGQA QHTGNPGLLD IPGWQNMPLT QAAQAVQRSG YPDAYAQWED
VAGDITDLLG GDLPDLPDDG STTNVANCQG ETVNPITVGT LNLLGAGHTD KPGERAGYDT
WDKRLPGAMR TIENAGVTIT GLQEVHGPQA QALENQYAAK WGMYPASGKA QNRVIWDRNE
WEQTDGRLVG IPYFGGKDVG MPLVQLTSTT TGQVIWVWSI HNPANTQGSA AGHRQEALRR
QLATMTELAG TGTPAVILGD FNDGKDGSNA SHCALTPELS NAFGGSAEPC KKPKQDAPID
HVYGANLTWA GAEVDTSTQA SKIADHPLVT ATTAGSSAGC AVDSGTAEAK YNLGPVKPQL
TQLVNILGPM FDIKTVGGYR ESATDPNGHP AGLAADFMVP LNAAGRAQGD RLAAYAKANA
QKLGIDYIIW YQRIWSVARV GEGWRPMEDR GSATENHLDH VHINVKPGAS VQPVGLEGAS
CDEVVYPVPA QYVGTDNHNW HETGAYWSKW HTGTDFSAPC GTTVYAAHAG TIEIDTTQRS
WAGPQLVKVT TGAGSLTTWY AHMATVSVSR GQTVAAGEPI GQVGKEGNVS GCHLHFEVHL
KNGSIYGPDN VDPSTWLAEN ASRPSRAV