Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1212 |
Symbol | |
ID | 4599312 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 1285152 |
End bp | 1286567 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 639775806 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_922413 |
Protein GI | 119715448 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.152583 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACGCAGG ACGAGCCCCG CGACGACGAG TCGACGCAGC CGCTGGCGGC CGCGCCCCTC GGGCCGCCGC CAGCCACGCC CCCGGCCACG CCCGCAGAAC CCCCCGCCGC GCCGCCGGCG GCACCACCCG TCGCACCGCC GGCCGGACCG CCCGCGGGTC CGCGGCCGCC GAGCTGGCTG CCGCCCACCG TGCAGTCCCC TCCGCACCGG TGGGCCGGGC TGCCGCCGGG AGCGCCGTAC GCCGGCCAGC AGCAGCCCTT CGGGCCGTCG TTCGGTCCTC CGTCGGGCCC GCCACCGGGC CACCTGGCCC CGCGCGGCCG GGTCCCCGGC TGGCTGTGGC CCGTGGTCTG CGTGCTCGCC CTCACCCTCG GGCTCGTGGG TGGCGCGCTC GGCGGCCTCG CCTACGATCA GCTCAGCGAG GACGAGCCGG GCACCGTGAG CAGCGGCCTG GCCGGCGTCG ACACCGTCTC CGAGGCGCCG CTGTCGGCGG ACAACGGGTC GATCGAGGCG GTCGCCGCGC GGCTGCTGCC GAGCACCGTG CAGATCTCCG CCGAGTACGA GGGCCAGCGC GGCGGGGCCA CCGGCTCCGG CTTCGTGTTG GACCGGCAGG GGCACGTGAT CACCAACAAC CACGTGGTGG CCGACGCCGA CGAGGCCGAC GGCCCGATCC AGATCGTCGA CCAGGACGGC AACCGCTACC CCGCGACCGT GGTCGGGCGC AGCCCGGTCT ACGACCTGGC GGTCCTCTAC TCGCCGAAGG CGAAGGGCCT GACGCCGGCC GCCCTGGGTG CCTCCCAGGA GCTGCGGGTC GGTGAGGGCG TGGTCGCCAT CGGCTCGCCC CTCGGGCTCA GCTCGACCGT GACCGCCGGG ATCGTGAGCG CGCTGCACCG GCCGGTCACC ACCGGCGACG CCGGCAACGA CTCCTCCTAC ATCAACGCGG TCCAGACGGA CGCGGCGATC AACCCCGGCA ACTCCGGGGG GCCGCTGGTG AACCTGCGCG GCCAGGTCGT CGGCGTGAAC TCCGCGATCG CGACGACCGG CGGCGGCATC GGCGGTGAGT CCGGCAACAT CGGCGTCGGC TTCGCGATCC CGATCGAGCA GGTGCGGGTC ACCGCGGACC AGATCCTGCG CACCGGCGAG GCGAAGTACC CGGTCATCGG CGCGCAGGTG CAGACCGGGC AGAACGACGG CAACGGCGCC GAGATCGACG AGGTCATGCC CGACACCCCC GCCGAGAGGG GCGGCCTGCG CAAGGGCGAC GTGATCATCG AGGTCGAGGG CGAGCGGGTC ACCGACGGCA TCGCCCTGAT CGTCGCGATC CGCACCCACC AGCCGGGGGA GACTGTGGAG TTCACGATCC TGCGGGACGG TCAGGAGCGC ACCATCTCGC TCACCCTGGG CGCCGAGACC GGCTGA
|
Protein sequence | MTQDEPRDDE STQPLAAAPL GPPPATPPAT PAEPPAAPPA APPVAPPAGP PAGPRPPSWL PPTVQSPPHR WAGLPPGAPY AGQQQPFGPS FGPPSGPPPG HLAPRGRVPG WLWPVVCVLA LTLGLVGGAL GGLAYDQLSE DEPGTVSSGL AGVDTVSEAP LSADNGSIEA VAARLLPSTV QISAEYEGQR GGATGSGFVL DRQGHVITNN HVVADADEAD GPIQIVDQDG NRYPATVVGR SPVYDLAVLY SPKAKGLTPA ALGASQELRV GEGVVAIGSP LGLSSTVTAG IVSALHRPVT TGDAGNDSSY INAVQTDAAI NPGNSGGPLV NLRGQVVGVN SAIATTGGGI GGESGNIGVG FAIPIEQVRV TADQILRTGE AKYPVIGAQV QTGQNDGNGA EIDEVMPDTP AERGGLRKGD VIIEVEGERV TDGIALIVAI RTHQPGETVE FTILRDGQER TISLTLGAET G
|
| |