Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0884 |
Symbol | |
ID | 4599891 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 922326 |
End bp | 923579 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639775485 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_922094 |
Protein GI | 119715129 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.63894 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGACC AGCCCACCAG TCCGCCTCCG GGACCGCCGA ACCCGTACTT CTCCACCCCG GCGCCGCTGG GTCCGCCTCC CCAGGGCCCG CCTCCCGGGG TGGCGCCGGA CCCGCAGCCG GGGCAGCGTC GTCCTCGCCG GACCGGCCTC GCCGCCGCGG TCGTCGCGAC CGCGCTGGTG GCCGGCGGCG CGGCAGGGGT CGGCGGCGCC GCCGCCTGGA GCGCGCTCGA CGATGGAGGC TCGTCGAGCG CGGGCGGCCC CAGCAGCCGC ACGACGGCGC AGGTCGTCGA CACCCCCGAC TCCGAGGCAC CGGCCGGCTC CGTCGAGCAG GTCGCTGCCA AGGTGCTGCC TTCGGTGGTC AAGATCGACG TGGCCGGCGC CCAGGGCGCC GGTTCGGGCT CGGGGATCAT CCTCAGCTCC GACGGCGAGA TCCTCACCAA CAACCACGTG GTGGAGCTCG CCGGCGACAA CGGGTCGATC CGGGTCTCCT TCAACGACGG CTCCACCGCG AAGGCCGAGA TCCTCGGCAC CGACCCCCTG ACGGACACCG CGGTGATCAA GGCCCAGGAC GTCTCCGGGC TGACGCCCGC GACGATCGGG AAGTCGGGCG ACCTCAAGGT CGGCGAGAGC GTCGTGGCGA TCGGGTCCCC GTTCGGGCTC GACTCGACGG TGACCAGCGG CATCGTGAGT GCGCTGGACC GGCCGGTGGA CGTCGGCTCC GACGGCCAGG GCAACAGCAC GACGTACCCC GCGATCCAGA CCGACGCTGC GATCAACCCG GGCAACAGCG GCGGCGCGCT CGTCGACCTC GACGGCAACG TCGTCGGCAT CAACTCCTCG ATCCGCACCG CCAGCTCCAT GGAGGGGCAG GCCGGCTCGA TCGGGCTCGG CTTCGCCATC CCGATGGACG AGGTGATGCC GATCGTCGAC CAGATGGTCA ACGGCGAGAC CCCGACCCAC GCCCGCCTCG GCATCTCCGT CTCCGACGTC GCGAGCCGGC CCGGAGCCGA GGTGACCGAG GGCGCCGAGG TCCAAGACGT CAACGCCGGC TCGACCGCGG ACGACGCCGG CCTGGCGAAG GGCGACATCA TCACCAAGGT CGACGACCAG CTGATCAGCG GCGCCGACTC CCTGGTCGCC ACCATCAGGT CCTACCGGCC CGGCGACGAG GTCACCGTCA CCTACGAGCA CGGCGGCGAC ACCAAGACCG TCACTCTCCA GCTGGACTCG GACGCGGACA CGTCCAACTC CTGA
|
Protein sequence | MNDQPTSPPP GPPNPYFSTP APLGPPPQGP PPGVAPDPQP GQRRPRRTGL AAAVVATALV AGGAAGVGGA AAWSALDDGG SSSAGGPSSR TTAQVVDTPD SEAPAGSVEQ VAAKVLPSVV KIDVAGAQGA GSGSGIILSS DGEILTNNHV VELAGDNGSI RVSFNDGSTA KAEILGTDPL TDTAVIKAQD VSGLTPATIG KSGDLKVGES VVAIGSPFGL DSTVTSGIVS ALDRPVDVGS DGQGNSTTYP AIQTDAAINP GNSGGALVDL DGNVVGINSS IRTASSMEGQ AGSIGLGFAI PMDEVMPIVD QMVNGETPTH ARLGISVSDV ASRPGAEVTE GAEVQDVNAG STADDAGLAK GDIITKVDDQ LISGADSLVA TIRSYRPGDE VTVTYEHGGD TKTVTLQLDS DADTSNS
|
| |