Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1588 |
Symbol | |
ID | 4596496 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 1686416 |
End bp | 1689700 |
Gene Length | 3285 bp |
Protein Length | 1094 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639776187 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_922789 |
Protein GI | 119715824 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCTAGCC CGAAGATCAG CACCTACCGC GAGATCGTCC CTCTCATCTA CTCGTGGCGG ACTCCTGACG TCCCCAAGTA CGCGGGCTGG GAGAAGATCG GCTACACCGA GCAGGACAGC GCGGAAAAGC GCATCGACCA GCAGGCCAGC CAGATGTCCA TCACCAAGGA GAAGGTCTGG TCTCGACGCG CGCTGTACAC CACGGAGGCC GGCGGGCGGT TCACGGACAA GGACTTCCAC GAGTACCTGC GCCAGCAGGG GGTCCAGCGA GAAACCACCC CCAAGCGGAC CGAGTGGCAC AACCTCGCGG CAGCGCCGAA GAGGTCGCTG GACTACTTCA ACGAGTTCGC AGGGCAGGAC TTCGCTGACT TCCAGGCCGA TGGAGTGGAC GACTACGTCC TGCGCCCTGA GCAGCAGGCC GCCGTCGATC AGGCGCTGGC CGCCTTCGCC GCGCACGACG AGGTTCTGTG GAACGCGAAG CCCCGGTTCG GCAAGACGCT GACCACCTAC AACCTGATGC GGAAGCTCGA CGTCCACCGG GTCCTCATCG TCACCAACCG TCCTGCGATC GCGAACTCCT GGTACGACGA CTTCATGCGC TTCATCGGCC ACCGGACGAC CTTCAAATTC GTCTCGGAGT CGCCGTCTCT GGAGAACCGC TCTCCGATGA GTCGGGAGCA GTGGCGCGCC CACTCACGCG AGCACGGCGA TGCGGACCCG CGCATCGTCG AGTTCGTCTC GCTGCAGGAC CTCAAGGGGT CGCAGTACTT CGGCGGCAAC TACCCCAAGC TCAAGCACGT CGCCGACTTC GAGTGGGACC TGCTGGTCAT TGACGAGGCC CATGAGGGCA TCGACACCGA CAAGACCGAC GTCGCCTTCG ACCAGATCAC GCGCGCCCGC ACGCTGCACC TGTCGGGCAC ACCGTTCAAG GCGCTGGCCA AGGGCAAGTT CGGCAAGGAC CAGATCTTCA ACTGGACATA CGAGGACGAG CAGACCGCCC GCCAAGAGTG GGCCGACGCC TCCCAAGAGA ACCCGTACCA GGCACTGCCG AAGCTCAACC TGCTGACCTA CCAGATCTCG CGGATGATCA CCGACCGCCT GGCCGAGGGC GTCGCGCTCG AGGAGGACGA GGCGAACATC GACTTCACCT TCGATCTCAA CGAGTTCTTC GCGACCAAGG ACAACGGCTA CTTCGAGCAC GAAGCCGAAG TGATCCGGTT CCTCGACTGC CTAGCGTCCA ACGAGAAGTA CCCGTTCTCC ACCCCAGAGC TGCGCGATGA GATCCGACAC TCGTTCTGGC TGCTCAACCG GGTCGCCTCC GCCAAGGCGC TGCAGAGGCT GCTCAAGCAG CACGAGGTGT TCAAGGACTA CACGGTGGTT CTGGCTGCCG GCGACGGACG CTCCAACGAC GACTCCGACC TGGTCGCGGT CGGCAAGTCG CTGGACAAGG TGCGCACCGC CATCGCGGAG GCCGAGGAGT GCGATGGGAA GACGATCACG CTGTCGGTCG GTCAGCTGAC CACAGGCGTC ACGGTGCCCG AGTGGACCGC AGTGATCATG CTGTCCGACA TGTCCTCGCC GGCGCAGTAC ATGCAGGCGG CGTTTCGTGC GCAGAACCCG TGCACGTTCG AGCGCGCCGA GCACGTGTTC CAGAAGCAGA ACGCCTACAT CTTCGACTTC GCGCCTGAGC GGACGCTGAA GATCTTCGAC GCCTTTGCCA ACAACCTGCA CCCGAACCCG TCTGGCGACC CGGGGGTGCG GCTGGAGAAC ATCCGTACGT TGCTGAACTT CTTCCCGGTC CTGGGCGAGG ACTCCGAGGG CCGGATGATC GAGCTCGACG CGTCCCAGGT CCTGACGTTC CCACAGGTGT TCAAGGCCCG CGAGGTGGTG CGTCGCGGAT TCCTGTCGAA CCTGCTGTTC GCCAACGTGG CTGGCATCTT TCGCTACTCC GCGCACGTCA AGGAGATCCT CGACAAGCTG CCCACCGCGA AGCAGGGCAA GACCAAGAAG GGGGAGCCGA TCGAGATCCC GCACCCGCCG CCGGTCACCG ACCCCGACGG CAACGTCCTC CCGACCGGCC TCGCGTCCGT CATCAACCCC AAGGTCGAGG AGCTCGGCAA GCCGGTCTGG CGGACCGAGG ACATCCCGAT GCCGGAGCCA GACGTACCCG TGCACACCAT GGCCACGAAG ATCGCGAAGG CGGTCACCGA GCAGAGCCAC GACAAGCGCG AGGAGCTTAA GGAAGCCTAC GGACTCACCG CGGCCCAGGT CGACCGCGAC GAGAAGCGCA CCGAGCAGGC CGTGAAGGCG CAGGTCGAGC GTGCTTACAC CGAGCACAAC ATTGCGAGCA AGCACCTTGA GGACGAACTG GAGAACGCTG CCACCGAGGC GGAGGCCGAG GCGATCCAGG CCAAGAAGGT CGAGCAGGAC GAGGCGTTCA AAGCGAACAT CCTGTCCATC GTCACCGAGA CGATGGACTC GATCGTGCCC GAAGTGGTCA CCCGCGAGGA GATCAAGTCC GAGCAGAAGC GGGCGAACCG GACGATGGAC GACGCCCGCT CACACCTGCG CGGCTTCGCC CGGACGATCC CGATGTTCCT GATGGCATAC GGCGACCGCG ACATCCGGCT GGCGAACTTC GACGACTACA CGCCCGACGA CGTCTTCCAC GAGATCACCG GGATCACCGA GGCGGAGTTC CGCGTCCTGC GCGACGGCCA AGAGATCACC GAGGAAGACG GCACCGTCAC GAAGATCCCC GGGATGTTCG ACGAGGCCGT CTTCGACCAG GCCGTCCAGG AGTTCCTGAA CAAGAAGGAA GCACTGGCCG ACTACTTCGA CGACGCGCAG ACCGAGAACA TCTTCGCCTA CATCCCGCAG CAGAAGACGT CCCTGGTGTT CACGCCGCAG CACGTGGTCA AGACGATGGT CGACATCCTC GAGGCCGAGG ACCCAGGTGT CTTCGCCGAC CCGGACAAGA CCTTCGCCGA CCTGTTCTCG ACTGCCGGGC TGTACCTGAT GGAGCTGGTG CGGCGCCTCG ACACCGGCCT TGCCAATGCC CACCCTGACC AGGACGAGCG GATCAAGCAC ATCCTGACCT CACAGATCTT CGAGATGAGC CACAACGAGA TCCTCCACCG CATCACCATC GAGGCAGTGT CGGGGGGTGT TCCCGAACGC AAGGAGTGGA TCGAGAGGTC CGGACACTTC CGCGTAGGAA ACCTCGCCAG GATGACGGCG GATGAGCGCA AGAAGGCCGT GAACGACATG TTGGGAGAGA ACTGA
|
Protein sequence | MPSPKISTYR EIVPLIYSWR TPDVPKYAGW EKIGYTEQDS AEKRIDQQAS QMSITKEKVW SRRALYTTEA GGRFTDKDFH EYLRQQGVQR ETTPKRTEWH NLAAAPKRSL DYFNEFAGQD FADFQADGVD DYVLRPEQQA AVDQALAAFA AHDEVLWNAK PRFGKTLTTY NLMRKLDVHR VLIVTNRPAI ANSWYDDFMR FIGHRTTFKF VSESPSLENR SPMSREQWRA HSREHGDADP RIVEFVSLQD LKGSQYFGGN YPKLKHVADF EWDLLVIDEA HEGIDTDKTD VAFDQITRAR TLHLSGTPFK ALAKGKFGKD QIFNWTYEDE QTARQEWADA SQENPYQALP KLNLLTYQIS RMITDRLAEG VALEEDEANI DFTFDLNEFF ATKDNGYFEH EAEVIRFLDC LASNEKYPFS TPELRDEIRH SFWLLNRVAS AKALQRLLKQ HEVFKDYTVV LAAGDGRSND DSDLVAVGKS LDKVRTAIAE AEECDGKTIT LSVGQLTTGV TVPEWTAVIM LSDMSSPAQY MQAAFRAQNP CTFERAEHVF QKQNAYIFDF APERTLKIFD AFANNLHPNP SGDPGVRLEN IRTLLNFFPV LGEDSEGRMI ELDASQVLTF PQVFKAREVV RRGFLSNLLF ANVAGIFRYS AHVKEILDKL PTAKQGKTKK GEPIEIPHPP PVTDPDGNVL PTGLASVINP KVEELGKPVW RTEDIPMPEP DVPVHTMATK IAKAVTEQSH DKREELKEAY GLTAAQVDRD EKRTEQAVKA QVERAYTEHN IASKHLEDEL ENAATEAEAE AIQAKKVEQD EAFKANILSI VTETMDSIVP EVVTREEIKS EQKRANRTMD DARSHLRGFA RTIPMFLMAY GDRDIRLANF DDYTPDDVFH EITGITEAEF RVLRDGQEIT EEDGTVTKIP GMFDEAVFDQ AVQEFLNKKE ALADYFDDAQ TENIFAYIPQ QKTSLVFTPQ HVVKTMVDIL EAEDPGVFAD PDKTFADLFS TAGLYLMELV RRLDTGLANA HPDQDERIKH ILTSQIFEMS HNEILHRITI EAVSGGVPER KEWIERSGHF RVGNLARMTA DERKKAVNDM LGEN
|
| |