Gene Noca_1788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1788 
Symbol 
ID4597700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1899776 
End bp1901683 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content75% 
IMG OID639776387 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_922987 
Protein GI119716022 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATCCT CGCGCACCCG GCTGCTCGCG CTCACCAGCG CCCTCGCGGC CGTCCTCTCC 
CTCACCGGCG TCGCCTCCGC GGCGGCGCCG GCGGCACCTG CGGCCGCCAC GGTCGGCTCC
GACCTCCAGG GCGCGCACCG CGGCCTGGAC CGCCTGCTCG CCGACGGTAC GCCGAACCAC
CGGGCGATCG TCACGTTCGC CGCCGTGCCG ACCAGCACCC AGATCGGCGC GCTGCAGGCC
CTCGGCCTCG TCGTGCAGCC GATGTCCCAC CTGCCGCTCG CCCTCGTCGA GGGCCCGGTG
CCCGCGATGG TGCAGGCGGT CACGGCCGGC ATCGGCCTCG ACGTCTACCC CGACGAGCGG
CTCCAGCTCC TCGACACCCC GTCGACCAAC GCGATGTCGT CGAGCCCGGC GGCGGCCCAG
GCCCTGCGCA CCCGCGGCTT CACCGGCAAG GGCGTGACCG TCGGCGTCGT CGACTCCGGC
TGCGACGCGA CGCACTCCGA CCTCGCCGAC CACGTCGTGC ACAACGTGAG CCTGCTCAGC
CCCGAGTACG CCAACGCCGG CACCGACCCG GCGATCGTCG TACCCGTCGA CCAGGGCCCG
GTGAGCAACA CCGACCTCGG CAGCGGCCAC GGGACGCACG TCGCGGGCAT CATCGCCGCC
GACTCCTCCT CGGCCGAGGA CGGCAGTCGG TACGGCGTCG CCCCGGACGC CGACCTCGCC
TGCTTCGCGA TCGGCGCAGT GCTGTTCACG ACCGCGGTCG TCACCGCCTA CGACTACATG
CTCGACCAGC CGGACCTGCT CGGCATCGAC GTCGTCAACA ACTCCTGGGG CAACAGCTAC
CGCCAGTTCG ACCCCGCCGA CCCGGTCGCC GTCGCCACCA AGGCCGTGGC CGACCGTGGC
GTGACCGTGG TGTTCGCCGC CGGCAACTCC GGCAGCGGCG ACGTCCCGAT GAGCCTGAAC
CCGTTCTCCC AGTCGCCCTG GGTGATCTCC GTGGCGGCCG GCACCCTGGA CCGGCACCGC
GGCGACTTCT CCTCCAACGG CCTGGTCCAC GACAACTCGC AACCCACGGC CATCGGCACC
GAGGGACACA CCACCTACAC CGGCGACCGG ATCGGGCTGG TGCACCCGGA CCTCACCGCC
CCCGGCGTCG ACATCGGCTC GACCTGCGAC AGCGCCGGCA CCCTGATCGG GCCGTGCGGA
CCGGACGAGA ACGCCTCGGC CTCGGGCACC TCGATGGCCT CGCCGCACAT CGCCGGCGCG
GCCGCGGTGC TGCTGCAGGC CCAGCCCCGG CTCAGCCCGG AGCAGGTGCG GCTCGCCCTG
CAGGCGACGG CGACGCCGGT CCAGGCGACG GGCGGCCCGG CCGCGCTGCC GTTCTGGGAG
GTCGGGTACG GCTACGCCAA CCTCGACCGC GCGGTCCAGC TGGTGCGCTC CGACGGCTGG
CAGGGGCGGC TGCGTGCCGC CGCCCACCGG GCGGACCGCC GGGTGCTCGC CGCGGACGGC
ACCGCGGTCG TCCGATCCGA CTTCTTCGTC CACGAGGCGC CCCCGGCGAC CGCGGGCGGC
AGTGACAGCG CGTCGTACGA CGTGCCCGTG TCCGCGCGCA CCCGCGGGCT CGCCGTGAGC
CTGGCGTTCC CGTCCGGCGG CAGCGTCGGC GCCAGCCTGT TCAGCTACAC CGTGCAGGTC
CTCGACCCCA GCGGGAAGGT GATCGCGACG ACCACTTCGG ACCCGGTCGC GGGCTCGGGC
ACCGCGCTGG CGACGGTCCG GCTGCCCCAG GGCGCGGAGG CCGGGACGTA CACGTTCGAG
GTCACCGGCG ACTACGCCGC CTCCGACCCG GACACCGTCG ACAGCGACTC GCTGCTGGGC
CGGTTCGTCA CCCTGCACGT GGCCCAGCTG CGGAGCAGCC GGCGCTAG
 
Protein sequence
MPSSRTRLLA LTSALAAVLS LTGVASAAAP AAPAAATVGS DLQGAHRGLD RLLADGTPNH 
RAIVTFAAVP TSTQIGALQA LGLVVQPMSH LPLALVEGPV PAMVQAVTAG IGLDVYPDER
LQLLDTPSTN AMSSSPAAAQ ALRTRGFTGK GVTVGVVDSG CDATHSDLAD HVVHNVSLLS
PEYANAGTDP AIVVPVDQGP VSNTDLGSGH GTHVAGIIAA DSSSAEDGSR YGVAPDADLA
CFAIGAVLFT TAVVTAYDYM LDQPDLLGID VVNNSWGNSY RQFDPADPVA VATKAVADRG
VTVVFAAGNS GSGDVPMSLN PFSQSPWVIS VAAGTLDRHR GDFSSNGLVH DNSQPTAIGT
EGHTTYTGDR IGLVHPDLTA PGVDIGSTCD SAGTLIGPCG PDENASASGT SMASPHIAGA
AAVLLQAQPR LSPEQVRLAL QATATPVQAT GGPAALPFWE VGYGYANLDR AVQLVRSDGW
QGRLRAAAHR ADRRVLAADG TAVVRSDFFV HEAPPATAGG SDSASYDVPV SARTRGLAVS
LAFPSGGSVG ASLFSYTVQV LDPSGKVIAT TTSDPVAGSG TALATVRLPQ GAEAGTYTFE
VTGDYAASDP DTVDSDSLLG RFVTLHVAQL RSSRR