Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0343 |
Symbol | |
ID | 4597980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 364136 |
End bp | 365311 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639774958 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_921574 |
Protein GI | 119714609 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACCTGC TCGACTGGCT CCTGGTCGTG CTCGTGCTCG CCTACGCGCT CTCCGGCTAC TGGCAGGGCT TCATCACCGG TGCCTTCGCG ACCGGGGGGC TGCTGCTCGG CGGACTGTTC GGCGTGTGGC TGGCCCCGGT CGCACTGGGT GACGCCAACC CCTCCCTGAT GGTCTCCCTG GGCGCGCTGT TCATCGTGAT CCTGTCCGCG TCGCTGGGGC AGGCCGTGCT CCAGTTCGCC GGCGCCCGGA TCCGCGAGCG GATCACCTGG CAGCCGGCCC GCGCCCTCGA CGCGGTCGGC GGCGCCATGC TCAGCGCCGT GGCGGTCCTC GTGGTCGCCT GGGCGCTCGG CGTCGCGATC TCGGGGTCTC GGATCGGCGG CGTCACCCCG CTGGTGCGGG GCTCGACCGT GCTCTCCCAC GTCGACGAGG TGATGCCCGC CAGCGCCGAC GGCGCGCTGC AGGCGTTCAA CGACGTCGTC GGCACCAGCT TCTTTCCCCG CTACCTCGAG CCGTTCGCGC CCGAGCGGAT CGTCGAGGTC GGACCCGGCC CCAAGCGGCT GCTCAACGAC CCCGACGTCG AGCGCGCCGG GTCGAGCGTC CTCAAGATCC GCGGCACCAA CGAGTGTGGC CGGGGTGTCG AGGGGTCCGG GTTCCTGTAC GCCGGCAACC GGCTGATGAC CAACGCCCAC GTCGTCGCCG GGATCGACGA CCCCGAGGTC ATCGTCGGCG ACGAGTCGGT CCCGGCGGAC GTCGTCTACT ACAACCCCGA CATCGACGTG GCGGTGCTCT CCTTCGACAG CGGGGACCTG CCGGCTCTGC GCTTCGACCG CGACGCCGGC GCGCCCGACG GTGTCGCGAT CCTGGGCTAC CCGCAGGACG GGCCCTACCA CGTGGAGCCC GCCCGGATCC GCTCCGAGCA GCGGCTCCGC TCACCCAACA TCTACGGCGA CGGCGCGGTG ATCCGTGAGG TCTACTCCCT GCGCGGGCGG ATCCTGCCGG GCAACTCCGG CGGGCCGATC GTGTCCTCGG CCGGCGACGT CGTCGGCGTG GTGTTCGCCG CCTCGGTCAC CGACCACGAA ACCGGCTACG CGCTGACCGC CGGACAGGTC TCCGCCGCCG CGGCCGCCGG CCTGACCAGC TCGAGCCAGG TGTCGACCGG CGGTTGTGCT GGGTGA
|
Protein sequence | MNLLDWLLVV LVLAYALSGY WQGFITGAFA TGGLLLGGLF GVWLAPVALG DANPSLMVSL GALFIVILSA SLGQAVLQFA GARIRERITW QPARALDAVG GAMLSAVAVL VVAWALGVAI SGSRIGGVTP LVRGSTVLSH VDEVMPASAD GALQAFNDVV GTSFFPRYLE PFAPERIVEV GPGPKRLLND PDVERAGSSV LKIRGTNECG RGVEGSGFLY AGNRLMTNAH VVAGIDDPEV IVGDESVPAD VVYYNPDIDV AVLSFDSGDL PALRFDRDAG APDGVAILGY PQDGPYHVEP ARIRSEQRLR SPNIYGDGAV IREVYSLRGR ILPGNSGGPI VSSAGDVVGV VFAASVTDHE TGYALTAGQV SAAAAAGLTS SSQVSTGGCA G
|
| |