Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1343 |
Symbol | |
ID | 4598508 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 1416026 |
End bp | 1419028 |
Gene Length | 3003 bp |
Protein Length | 1000 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639775939 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_922544 |
Protein GI | 119715579 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCAGTTC ACTTCTCGAG GGCGGGCGCG CTTCGGCGCG GGGCCGGCCT CTTCTCGGCG GGGGCGGTCG TGGCCGGCCT CCTGGTCGGT ACGCCGGCCT CGGTCGGTGC CGCGGAGGCC AAGCCCGCGG CGTCCGGATC GACCAGTCTC ACCGCAGGAC GCTACGTCGT GCTCCTGCGC GAGCCCAGCG CGGCCCAGTA CGACGGGACC AACCCGCGGT TCGCGGCGAC GCGCGCCAGG GGCGACCGCC AGTTCGACGC GCGCTCGCAG CGGGTGCGCA CGTACACCGC CCACCTGCGC TCGGCGCAGC GCTCGATCGC GAGCTCCGTC GGCGCCGACG TCGACCAGAG CTACACGATC GCGGCCAACG GGTTCTCGAC CGCCCTCACC CAGGAGCAGG CGCTCGACCT GTCCTCCGAC CGGCGGGTGC TGCTGCTGCA GAAGGACCAG CTGGTCCACG CCGACACCTG GAACACCCCG CGCTTCCTCG GCCTGACCGG CAAGCGGGGA GCGTGGGCCA CCCACGGCGG TCAGAAGAAG GCCGGCGCCG GCATCGTGGT CGCCGACCTC GACTCCGGCA TCTGGCCCGA GGCGAAGTCG TTCGCGGGCC CGGCGCTCAC CAGGAACCCG CAGACCAAGT GGCACATCAG CCGCATCGGC ACCTCGACCC GGATGGACAA GGCCGACGGC GGCGTGTTCA CCGGTGAGTG CGAGCTCGGC GAGGACTGGA CCGCCGACGA CTGCAACACC AAGCTGATCG GCGCCCGCTC CTACAGCGCC GGCTACCTCG CGAGCGGGAA CGCGATCATC GACGCGGACT ACGCCTCGAC GCGCGACGGC AACGGCCACG GCACCCACAC CGCCAGCACC GCCGCGGGCA ACATCGTCGA CCGGGTCAGG ACCGAAGGCG TGGAGTTCGG GACGATCTCC GGCATGGCGC CCGCCGCCCG GATCGCGGCG TACAAGGTCC TCTGGGCCCA GGACGACGGC ACCGCCTCCG GCGTCACCAG CGACATCGTC GCCGCGATCG ACGACGCCGT CTACGACGGC GCCGACGTCC TCAACTTCTC GATCTCGGGC GCGCTGGACA CGGTGGTCGA GGCCACCGAG GTCGCCTTCG AAGGTGCGGC CGAGGCCGGC GTCTTCGTGG CCGCCTCGGC CGGCAACTCC GGTCCCGATG CCTCCACCGT GGCACACAAC AGCCCCTGGC TGACCACGGT GGCCGCATCG ACCCACCACA ACTTCGAGAA CACCCTGGTG CTCGGCAACG GCACCAAGAT CGTGGGCGCC TCGATCAACG ACAAGCGCGT CTCGTCGAAG AAGCTCGTCG ACTCCGAGGC CTCGGGCGTC GCGGGCGGCG ACGACGCCGA CGCCAAGCTC TGCGGCCCCG ACACGCTCGA CCCGGCCAAG GTCACCGGCA AGATCGTGGT CTGCACCCGC GGCGTCTACG ACCGGGTGGC CAAGAGCGCC GAGGTCGCCC GCGCGGGCGG CGTCGGGATG GTGCTCGCCA ACCCGACCGA GAACAGCCTG GACGCCGACT TCCACTCGGT GCCGACCGTG CACATCACCA ACACCGACGC GGCCAAGGTG TTCGCCTACC TGGCCGCGCA GGGCAGCGCC GCCACGGCGA CGATCGAGCC CGGCAACCTC ACCAAGAAGA CGACGCCGCT GCCTCAGATC GCCGGCTTCT CCTCGCGCGG TGCGGCGATC GCGAACGACG CCGACCTGCT CAAGCCGGAC ATCGCCGCAC CGGGTGTGAG CGTGCTCGCC GCGGTGGCGC CGCCGTCGAA CGAGGGACGC GACTACGACC TCTACTCCGG TACGTCGATG GCCGCGCCGC ACATCACCGG CCTGGCCGCG TTCATGCTGA GCGTGCACCC GACGTGGAGC CCGATGAAGG TCAAGTCCGC GATGATGACG ACCGCTCACC GGGTCAAGGA CGCCGAGGGG AAGACCTCGA ACGACGTCCT CGCCGAGGGT TCCGGTCAGG TGAGCCCGCG CCGGTTCTTC GACCCCGGCC TGTTCGTGAC CTCGACCCCG CGCGAGTGGC TCGGGTTCCT CACCGGTCAG GGGCTGGACA CCGGCTACGC GGCCGTCGCG GCGAAGGACC TCAACGGTCC CTCGATGGCC CAGGGCCAGG TGCCGTCGGC GACGTCGTTC ACCCGCACCT TCACCTCGTC GATGGCGGGC ACCTGGAAGG TCTCGGTCTC GGTTCCCGGC TTCGCGGCGG CCCCCAGTGC CACCAAGCTG GTGGCCAGCG GTGCTGGTGA CGTGGAGACG CTGACGGTCG ACTTCACCCG GACGACCGCC CCGCTCCTCG AGTTCGCGAT GGGGTGGGTG ACCCTGACCG GGCCGACCAC CGTGCGGATC CCGGTCGCGC TGCGCCCGGT GTCGGTCAAG GCACCGGCCT CGGTCCAGGG CACCGGCACC GACGGCTCGG TGGAGGTTCC GGTCACCGCC GGCGCCACCG GCGAGCTCCT TGTCGAGCCG ACCGGGCTGG CGAAGGCGCA GACCGCCGAC AACACCGTGG CCGTGGCCGA CTTCCAGCTC GAGTGCGTCG AGATCGGCGC CGACAGCAAG CTGGCCCGGT TCGACCTCGA CGCGGCCGAC GACACCGCCG ACCTGGACAT GTTCGTCTAC TACTCGCCGA CGTCGTGTGA CCCCGACACG CTCGTGGCCC AGGTCGGTCA GTCGGCCAAC CCGACGGCTG ACGAGTCGGT GACGGTCATG GCTCCGGACG CGGGCTTCTA CGTGATCGAG GTCGACGGCT TCGCCGCCGG CGACGCCGGC GCACCGATGG CCTACCAGCT CCGGTCCTAC GACCTGGGCC CGGCCGCGAC CCTGGGCAAC CTCACGGTGA CGCCCAACCC GGTCCCGGTC GTCGCGCAGC AGGAGACGAC GTTCGACGTC AGCTGGTCCG GCCTCGACCC GGACGCGTCG TACCTCGGCA TGCTCGAGTA CGACGGCGCC CTGGCTCCGA CGATGGTGGA GATCACCAGC TGA
|
Protein sequence | MSVHFSRAGA LRRGAGLFSA GAVVAGLLVG TPASVGAAEA KPAASGSTSL TAGRYVVLLR EPSAAQYDGT NPRFAATRAR GDRQFDARSQ RVRTYTAHLR SAQRSIASSV GADVDQSYTI AANGFSTALT QEQALDLSSD RRVLLLQKDQ LVHADTWNTP RFLGLTGKRG AWATHGGQKK AGAGIVVADL DSGIWPEAKS FAGPALTRNP QTKWHISRIG TSTRMDKADG GVFTGECELG EDWTADDCNT KLIGARSYSA GYLASGNAII DADYASTRDG NGHGTHTAST AAGNIVDRVR TEGVEFGTIS GMAPAARIAA YKVLWAQDDG TASGVTSDIV AAIDDAVYDG ADVLNFSISG ALDTVVEATE VAFEGAAEAG VFVAASAGNS GPDASTVAHN SPWLTTVAAS THHNFENTLV LGNGTKIVGA SINDKRVSSK KLVDSEASGV AGGDDADAKL CGPDTLDPAK VTGKIVVCTR GVYDRVAKSA EVARAGGVGM VLANPTENSL DADFHSVPTV HITNTDAAKV FAYLAAQGSA ATATIEPGNL TKKTTPLPQI AGFSSRGAAI ANDADLLKPD IAAPGVSVLA AVAPPSNEGR DYDLYSGTSM AAPHITGLAA FMLSVHPTWS PMKVKSAMMT TAHRVKDAEG KTSNDVLAEG SGQVSPRRFF DPGLFVTSTP REWLGFLTGQ GLDTGYAAVA AKDLNGPSMA QGQVPSATSF TRTFTSSMAG TWKVSVSVPG FAAAPSATKL VASGAGDVET LTVDFTRTTA PLLEFAMGWV TLTGPTTVRI PVALRPVSVK APASVQGTGT DGSVEVPVTA GATGELLVEP TGLAKAQTAD NTVAVADFQL ECVEIGADSK LARFDLDAAD DTADLDMFVY YSPTSCDPDT LVAQVGQSAN PTADESVTVM APDAGFYVIE VDGFAAGDAG APMAYQLRSY DLGPAATLGN LTVTPNPVPV VAQQETTFDV SWSGLDPDAS YLGMLEYDGA LAPTMVEITS
|
| |