Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_2395 |
Symbol | |
ID | 4599495 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 2551932 |
End bp | 2553602 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639776998 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_923587 |
Protein GI | 119716622 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGGTC TACGTGGTGC ACGCAAGGCG GCAGCGGCCC TCGCTGCGAG TGCGGTAGCG GTCCTCCTCC CGCTGGGAGG TACGACGGCG CTGGCCGCGG ACGAGTCCCC CGGCCCGGCG GCCGTGGACG TCACCCCGCT GGAGAAGGTC AACGCCTTGG TCCAGCCGAG CGTGGTCTTC CTGCTCCAGA CCTGGGACGG CTACGTCTAC GACACCTTCA ACAAGCAGTA CCTGAACGAC GGCAATCCGT TCGAGCTGCA GTTCCAGTGC ACCGGCTTCG TGGTGAATCC CAACGGCTAC ATCGCGACCG CCGGCCACTG CGTGGACTTC AAGGAGGTCG AGGGCAGTTT CGTCGAGACC GCCGCCCAGT GGGCGCTCGC CAACGGCTAC TACAGCAGCA CCACCCTCAC CCTGGACGAC ATCGTCGGGT TCGACGACTA CCGGATCGAG TCGAGCGAGC GCAAGAACAC CGCCGACCTG GACATCCAGG TGGGCTGGGG TGCCTCCGTC TCCGGCATCG AGACCTCCGA GGTCAAGCGG GCTCGCGTCA TCGACTTCGA CCGGCAGTCC AAGGGCGACG TCGCGCTGAT CAAGGTCGAG GCCACCGATC TGAACGCCTT GCCGATGGCG ACCGACGAGG TGGACGTGGG GACCGACGTC GTGTCGATCG GCTACCCCGC CTCGGTCGAC TCCGTCACCG ACCCCAACCT GACCCCGTCG TTCAAGGACG GCTCGATCAG CTCGGTCAAG ACGGTCCAGG GCGGCGTACT CCCGGTCTAC GAGATCTCCG CCGCTGTCTC GGGCGGGATG AGCGGGGGGC CGTCGGTGAA TCTCGACGGC GAGGTGATCG GCGTCAACAG CTTCGGCATC CTCGGTGAGC CGCAGGCCTT CAACTTCCTC CGCCCGTCCT CCCAGCTCGC GGAGCTGATG GCCGGTGCGG GCGTGACCAA CGAGCTCAGC GAGACCACGC AGGCGTACCG AGATGGTCTC CTCGCCTACT GGGCGGGCGA CCGAACCACG GCCGTGGACA AGCTCGGGAG CGTCGTCGAC GAGCAGCCCA CCAACAAGCT CGCCGCGGAG TTCCTCGAGA AGGCCCAGGA CCTGCCCGAG CCGCCTCCGG CCGAAGAGTC GGACTCCGGC CTGCCGGTGG TGCCGATCGT GATCGGCGTT GCCGTGCTGG TCCTGGTCGG CGGGGGTCTC CTGGCCTTCC TCCTGCTGCG GCGCAAGGGC GGATCGTCGC CCGCCGCGAC CCCGCCGGCG GCTCCGGTGG CACCCGCGAC CCCGGCGGCA CCGGCCGCTC CGCTCGGCGG ACCGTACGCC GCGCCGTACG CGGACCCGGT GTCGAGCGCG CCCGCCGCAC CGCTCGGCTT CTCCGGCGGG GTGACCACGG CCCCACCGCC GACCATCCCG CCGACCATCC CGCCCACCAG CCCGGCCGCG CCTCCGCCGA CGCCGGTGCC CACGGCGTCC GTCACGCCGC CGCCCGCCGC GCCTGCCCCG GCGCCCGCCC CGGCGTCCAC GCCGGTGTCG GCGTCGGGCC CGCTGCCGAC ACCCCCGGTG GCCTCGGAGG AGCCGGCGGA GAAGCACGAG CCGCACTTCT GCGGGAACTG TGGGGAGCCT GCGGAGCACG GCAAGAAGTT CTGCAGCAAC TGCGGGAGCC CGCTGGCCTG A
|
Protein sequence | MNGLRGARKA AAALAASAVA VLLPLGGTTA LAADESPGPA AVDVTPLEKV NALVQPSVVF LLQTWDGYVY DTFNKQYLND GNPFELQFQC TGFVVNPNGY IATAGHCVDF KEVEGSFVET AAQWALANGY YSSTTLTLDD IVGFDDYRIE SSERKNTADL DIQVGWGASV SGIETSEVKR ARVIDFDRQS KGDVALIKVE ATDLNALPMA TDEVDVGTDV VSIGYPASVD SVTDPNLTPS FKDGSISSVK TVQGGVLPVY EISAAVSGGM SGGPSVNLDG EVIGVNSFGI LGEPQAFNFL RPSSQLAELM AGAGVTNELS ETTQAYRDGL LAYWAGDRTT AVDKLGSVVD EQPTNKLAAE FLEKAQDLPE PPPAEESDSG LPVVPIVIGV AVLVLVGGGL LAFLLLRRKG GSSPAATPPA APVAPATPAA PAAPLGGPYA APYADPVSSA PAAPLGFSGG VTTAPPPTIP PTIPPTSPAA PPPTPVPTAS VTPPPAAPAP APAPASTPVS ASGPLPTPPV ASEEPAEKHE PHFCGNCGEP AEHGKKFCSN CGSPLA
|
| |