Gene Noca_1810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1810 
Symbol 
ID4597647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1926139 
End bp1929579 
Gene Length3441 bp 
Protein Length1146 aa 
Translation table11 
GC content71% 
IMG OID639776409 
Productpeptidase M36, fungalysin 
Protein accessionYP_923009 
Protein GI119716044 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCGCT CCCCCCTGCA TCGTGGCGGC TATCGGACCG CCGCGCTGGC CTTGCTGACC 
GCCACCCTGG TCCCGGGCCT CGCCCAGCTC CCCGCCTTCG CCACGAGCCC CACCGCGGCT
CTCGCCAGCC CGAAGACCGA CGGTCGCGAC GGGGTCGTGC ACCCCCTCGG CGGCCTCGGC
GACCCGGTCG CCGGCCTCAC CGACCTCGAC GCCCGTGGCA CCGCGCTCCC CTCCGCAGCC
CAGCGCAGCG CCGCTGCCCA GCTCGGCGCC GTCGACCTGC GCTGGAACCA GTTCGGTACG
CCGTCCTCGA TCCTGCCGGC CGACGGCGTG CTCGCCCGGG CGTCGTCGAA CGACCCGGTC
GCCGCCGCGC GCTCGTGGCT GGCCGACAAC GCGGCGGTCT TCGGACTGAC CCGCGACGCG
GTCGCCGGAA TGGAGCTGGT CAACGACCAG CGGCTCGTCC AGAGCGACGC GCATGCCGTG
CTCTTCCGCC AGCGGTTCGG CGAGCTGACG CCGGCCCTGG CGAGCATGGT CACCGTCGGC
GTCGCCAACG GCGAGATCGC CTACGCCTCC TCGTCCCTCA CGAAGACCAC CGGCACCCCG
CCGGCCGCGA CCCTGAGCCC GGTCCAGGCG TGGGTGAAGG CGGCCGCGAA CGTCGGCCGC
ACGCTCGACG CCGGCGCACT CGACAAGATC ACCTCGCAGG TCGCCGACGG CTGGACCCGC
CTGTCCGTCC CGGGCTACCC GCAGCAGCAG CTGGTCCGGC TGCGTGCGCT CGCGCTCGCC
GACGGCTCGG TCCGGCCCGT GTTCGAGACC AACGTGCTCG ACGCCCAGTT GGGTTCGGCG
TTCGCGTACA CGCTGATGGT CGACGGCGTG AGCGGCGACA TCCTGCACCG GCAGAACCAG
GCCGAGAACA GCAGCGACGC CTTCCAGTTC CAGGGCGAGG TCACCGCCAC GGACTGCGGC
CCGAAGCACG AGTTCGAGCT CACCGACGGC AACACCAAGC AGATCGCGGC GATGGCGGCG
GCGGCCAACA CCCTCAACGA CATCGAGGTC AAGATCTTCG ACCCGAACGA CCAGCTCCTG
GTCACCGGCG ACCTGGGCAC CAGCCCCGAG ACCGCGACGT ACTCCGCCGA CTCGATCCCG
GCAGGCATCT ACAAGCTCCA GGTCTGCCCC TTCGACAACC CGACCGTCCC GTTCCTCCCG
CCGGGCAACT ACGTCGCGAG CGTGACCACG AGCGACACCG CCGGCCCGGC CACGGGTGAC
ATCGGGTTCC CGTCGAAGTG GCGCTACTTC ACGGCCAACC CCAGCCTGGA CTACTCCGCG
AAGACCGTCC CGTCGAACTC CGTGATCGGC TGCTGGATCG CCGGCGAGGG CTGCACCTCG
CCGACCGGCC CGTTCCGCAA CGTCGCGGCG CCGGGACCGT GGGACGCCAC CGCCAACGGC
GTCGGCACGA TGACCACGGT CGGCAACAAC GCCAACACCC ACGAGGCCTG GGTCAGCCCG
CTGACACCGG GCGGCACGGC GCAGGCGCCG ATCTCGCCGA CCCGCGAGTA CACGGCTGCG
TTCACCGACG CGTGGAACAA CTCCGGCTGC GACCCGGCCC AGCTGACGCC CGGCGGCAAC
GACATCGACG CCACCGTGAC GAACCTGTTC GTGGCGCACA ACCGGATGCA CGACTACTCG
TACTACCTGG GCTTCACCGA GGACAACTAC AACATGCAGG CCGACAACCT CGGCCGCGGC
GGCGTCGCCG GCGACCAGGA GATCGGCAAC GCCCAGGCCG GCGCGCTCAC CGGCGGGCAG
CCGTCGTACC TCGGCCGCGA CAACGCCAAC CAGATCACCC TGCAGGACGG CGTGCCCGGC
ATCACCAACC AGTACCTCTT CCAGCCGATC GCGGGCGCGT TCTACTCCCC GTGCGTCGAC
GGCGGCCTGG ACATGGGGAT CGTCGGCCAC GAGTACACCC ACGCGATCAG CAACCGGATG
GTCGGCGGCC CCGACGAGGG CCTCACCTCC GAGCAGGGCG GCGCGATGGG CGAGTCGTGG
AGCGACCTGA TCGCCGGTGA GTACATGTTC AGCCACGGCT ACGCCAACGG CGGCAACCCC
TGGGCCGTCG GCGTCTACGC CACCGGCAAC AAGTCGGTCG CGATCCGTGA CTACGCGATC
AACAGGAACC CGCTCAACTA CTCCGACTAC GGCTTCGACA GCACCGGCAA CGAGGTGCAC
GCCGACGGCG AGATCTGGAA CGGCACCCAG TGGGAGGTCC GCCAGGCGCT GGTCAAGAAG
TGGAACAAGC AGTTCCCCTA CTCCGACAAG GCCCTCCAGC TGCGGTGCGC CCAGGCCACG
CCGACCGCCA GCCCGCTGCC GGCCGGTCAG TGCCCGGGCA ACCGGCGGTG GGTCCAGCTG
GTCTTCGACT CGTTCCTGCT CCAGCAGGGC GCCACGTCGA TGCTCGACGC CCGCGACGCG
ATGCTCGCCG CCGACCGGAT GCGCTTCGGT GGCGAGGACC TGGACGTGAT GTGGAAGGCC
TTCGCCCGCC GCGGTATGGG CCAGGGTGCG TCGATCCCCA ATGCCGACTC CGGCGACACC
ACGCCCAGCT TCGCCTCGCC ACGCGAGCAG AACGGGACGG TCAGCTTCCG CAGCAACCGC
TCCGGCCGGT TCTTCGTCGG CCACTACGAG GCCCGGGCGA CGCCGATCGC CGACACCTCG
GACCAGGGCA AGGTCAAGAC CACCGCCCGG CTGGTCCCCG GTCGCTACGA GATGCTCTAC
GTCTCCCAGG CCGGTGGCTT CAAGCGGTTC CACCTCACCG TGGAGCCCGG AGCCCAGACG
GTCCACGTGA ACGCGCCGAC GAACCTCGCG GCGAGGAAGA ACGGCGCCAA GGTGCTCGAC
GCCACGGCCG GCTCGCTGAA CGCGGTGTCC CTGCTCGACG GCACCGAGAA GACCGCGTGG
GGCGGCGTCA CCGCCACCAA CGTCGACGAG TCGCACCCGT TCGTCGTGGT CGACCTCGCG
GGCGGCGTGC ACACCGTGCG GCGGGTGCAG GTCAGCGCGA TGCTCAACCC GGCCCCGCCG
GACCCGAACG AGATCCCGCT CGCCCAGGAC CCCGACCCCG ACTCGGGCTC CCGGTTCACC
GCGCTGCGCC GGTTCGCGCT CGAGGCCTGC GTGAGCGACT GCGGCGAGGC GAAGGCGACC
TGGAAGCGGT TCTACGTCTC CCCCGGCAAC GCCTTCCCGG CGGTGCGGCC GCGCCCGGTC
GCCCCGAACC TGACGCTGCG CGCCTTCGAG GTCCCCGCGA CCCGGGCCGC CGCGATCCGG
TTCGTGGTGC TCGAGAACCA GTGCACCGGC TACGCCGGGT ACGCCGGCGA GCAGGACAAC
GACCCGATCA ACGACACCGA CTGCAAGACC GCCGCCGATC GCGGCACGAT CGTGCACGCG
GCCGAGCTCC AGGTGTTCTG A
 
Protein sequence
MTRSPLHRGG YRTAALALLT ATLVPGLAQL PAFATSPTAA LASPKTDGRD GVVHPLGGLG 
DPVAGLTDLD ARGTALPSAA QRSAAAQLGA VDLRWNQFGT PSSILPADGV LARASSNDPV
AAARSWLADN AAVFGLTRDA VAGMELVNDQ RLVQSDAHAV LFRQRFGELT PALASMVTVG
VANGEIAYAS SSLTKTTGTP PAATLSPVQA WVKAAANVGR TLDAGALDKI TSQVADGWTR
LSVPGYPQQQ LVRLRALALA DGSVRPVFET NVLDAQLGSA FAYTLMVDGV SGDILHRQNQ
AENSSDAFQF QGEVTATDCG PKHEFELTDG NTKQIAAMAA AANTLNDIEV KIFDPNDQLL
VTGDLGTSPE TATYSADSIP AGIYKLQVCP FDNPTVPFLP PGNYVASVTT SDTAGPATGD
IGFPSKWRYF TANPSLDYSA KTVPSNSVIG CWIAGEGCTS PTGPFRNVAA PGPWDATANG
VGTMTTVGNN ANTHEAWVSP LTPGGTAQAP ISPTREYTAA FTDAWNNSGC DPAQLTPGGN
DIDATVTNLF VAHNRMHDYS YYLGFTEDNY NMQADNLGRG GVAGDQEIGN AQAGALTGGQ
PSYLGRDNAN QITLQDGVPG ITNQYLFQPI AGAFYSPCVD GGLDMGIVGH EYTHAISNRM
VGGPDEGLTS EQGGAMGESW SDLIAGEYMF SHGYANGGNP WAVGVYATGN KSVAIRDYAI
NRNPLNYSDY GFDSTGNEVH ADGEIWNGTQ WEVRQALVKK WNKQFPYSDK ALQLRCAQAT
PTASPLPAGQ CPGNRRWVQL VFDSFLLQQG ATSMLDARDA MLAADRMRFG GEDLDVMWKA
FARRGMGQGA SIPNADSGDT TPSFASPREQ NGTVSFRSNR SGRFFVGHYE ARATPIADTS
DQGKVKTTAR LVPGRYEMLY VSQAGGFKRF HLTVEPGAQT VHVNAPTNLA ARKNGAKVLD
ATAGSLNAVS LLDGTEKTAW GGVTATNVDE SHPFVVVDLA GGVHTVRRVQ VSAMLNPAPP
DPNEIPLAQD PDPDSGSRFT ALRRFALEAC VSDCGEAKAT WKRFYVSPGN AFPAVRPRPV
APNLTLRAFE VPATRAAAIR FVVLENQCTG YAGYAGEQDN DPINDTDCKT AADRGTIVHA
AELQVF