Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1107 |
Symbol | |
ID | 4599360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 1169096 |
End bp | 1172137 |
Gene Length | 3042 bp |
Protein Length | 1013 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639775703 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_922310 |
Protein GI | 119715345 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACCTCG ACGAGCTCGG CGTTCCCGGC AGCGCACCGC ACTTTGAGCC GATCTCGGTC AGTGACGAGA GCACGGTCGT TGCTGAGTTC GTCCCTGACC CGAGCAAGGA ATCGACGTAT CAGTCGGAGG CTGAGCTCGA AGCCGCGTTC ATCCGGCTGC TTCAGGAACA GGCTTACGAG TACCTGCACC TCACCTCGGC CGCCGCGTTG GAGGCCAACC TGCGCGTGCA ACTGGAAGCC GTCAACGGCA TCACGTTCTC CGAGGCTGAG TGGCAGCGCT TCTTCACCGA AACGATCGCC GGCCCGAACA AGGGGATCGT GGAGAAGACT GCGCTGGTTC AGGAGGACCA CGTCCAAATC CTGCGGCGCG ACGACGGGAC GACGAAGAAC GTATATCTGA TCGACAAGCG GAACATCCAC AACAACCGGC TCCAGGTGAT CAACCAGTAC GAAGCAGCCG GAGCCCACGC CACGCGTTAC GACGTGACGA TCCTGGTCAA CGGCCTACCG ATGGTGCACG TAGAACTGAA GCGCCGGGGC GTCGACATCC GTGAAGCCTT CAACCAGATC AACCGCTACG CCCGCGACAG CTTCTGGGCC GGCTCGGGGC TGTTCGAGTA CGTGCAGCTG TTCGTGATCA GCAACGGCAC GCTGACGAAG TACTACTCCA ACACGGTCCG CGACTCCCAT CTCGCCGACC AGAGGCAAGG CAAGCGGTCA CGTCGGAAGA CGTCGAACAG CTTCGAGTTC ACCAGCTGGT GGGCGGATGC CAACAACCGG CCGATCGCCG ACGTAGTTGG CTTCACCAAG ACGTTCTTTT CCAAGCACAC GGTGCTCAAT ATCCTCACGA AGTACTGCGT CTTCGACGTC GACCGCAAAC TCCTGGTGAT GCGCCCGTAC CAGATCGTCG CCGCTGAGCG GATCCTGCAG CGTATCGAGA CGTCGACGAC GTACAAACAG TTCGGCTCGT TGGCTGCCGG GGGTTACATC TGGCACACGA CCGGGTCAGG CAAGACGCTG ACCAGCTTCA AGGCCGCCCA GCTCGCGAGT CGTCTCCCGT TCGTGGAGAA GGTGCTCTTC GTCGTCGACC GCAAGGACCT CGACTACCAG ACCATGCGGG AGTACGAACG CTTCGAGAAG GGCGCGGCCA ACTCCAACAC CTCCACCAGC GTCCTGGCCA AGCAGTTGGA GGGCCCGAAC GCGCGGATCA TCATCACCAC GATCCAGAAG CTCGCCAGGT TCGTCAGCTA CAACCGGCAG CACTCGATCT ACTCCTCGCA CGTCGTGGTG ATCTTCGACG AGTGTCACCG CAGCCAGTTC GGCGACATGC ACTTGGCGAT CACCAACGTC TTCCGCCGCT ACCACCTGTT CGGCTTCACC GGCACACCGA TCTTCGCCGA GAACGCCGGC ACCAGCGGCA GCCCGCGACT GCGCACAACC GAGCAGGCCT TCGGGGAGAA GCTGCACACC TACACGATCG TCGACGCCAT CAACGACCGG AACGTGCTGC CCTTCCGGGT CGACTACGTC AACACCCTCA AGCCCGCCGA GAAACTGACG GACGCTCAGG TCGCGGCCAT CGACACCGAG CGAGCACTGC TGGCTCCGGA GCGGATCAAC CAGATCGTGG GCTACATCCG CGAGCACTTC GACCAGAAGA CCAAGCGCAG CTCCGCGTAT CGGCTCGGCG ATCGCCGGCT CGCAGGGTTC AACTCATTGT TCGCCGCCGC CTCGATCGAC GCCGCGAAGC GCTACTACGC CGAATTCGCC CGGCAGCAGT CAGACCTGGC GTCAGAGCAG CGGCTCAAGG TCGGCCTCAT CTTCAGCTTC ACCGCCAATG GGGAGGAGGC CGACGGCCTG CTGGCCGAGG AGGAGTTCGA GACCGGCGAC CTCGACGCGA CCTCACGTGA CTTTCTGGAA GGCGCGATCC GCGACTACAA CGCGCTGTTC GGCACCAGCT TCGACACCTC GGCCGACAAG TTCCAGAACT ACTACAAGGA CCTCTCCGAG CGGTTGAAGA GCCGCGAGCT GGACCTCGTC ATCGTGGTCA ACATGTTCCT CACCGGCTTC GACGCCACCA CCCTGAACAC TTTGTGGCTG GACAAGAACC TGCGCTCCCA TGGGCTGATC CAGGCGTACT CGCGCACGAA TCGCATCCTG AACTCGGTCA AGACCTACGG CAACATCGTG TCGTTCCGCG ACCTGGAGGA CGCCACCAAT GACGCACTGG CGCTGTTCGG CAACAGGGAC GCCCGCGGCA TCGTCCTGCT GCGGCCGTAC GCCGACTACT ACGCGGAGTA CGAAAATGCC GTCGCCGAGC TCACCGAAGT CTTCCCGCTC GGGGAGCAGA TCATCGGCGA GGCGGCGCAG AAGCAGTACA TCGCCCTGTT CGGAGTGATT CTACGACTGC GAAACATCCT CACCTCGTTC GACGAGTTCG CCGGACACGA GCTGCTGACC GAGCGCGACT ACCAGGACTA TCAGTCGATC TACCTCAACC TGTACGCCGA GTTCCGCGGC GCCAAGGAAG CGGAGAAGGA GTCGATCAAC GACGACGTCG TCTTCGAGAT CGAGTTGATC AAGCAGGTCG AGGTCAACGT CGACTACGTG CTGATGCTCG TCGAGAAGTG GCGCGACGCC AGGGGCAACG GTGCCGACCG AGAGATGGAC GCGCTGATGA AGATCCAGCG TGCGATCGAC TCCAGCGTCA CTCTGCGCAA CAAGCGCGAC CTGATCATGG ACTTTGTAGA GACAATGACT GTGACCGGCG ACGTCAACGA CGACTGGCGG CGCTTCGTCG CCGCGAAGCG GGCTGAGGAG CTGACGAGCA TCATCACTGA AGAGAACCTC AAGCCCGACG AGACTCACGC CTTCGTTGAG GCGGCCTTCC GGGACGGCGC CATCCCGACC ATGGGCACCG CGATCACCCG TATCCTCCCG CCCATCTCGC GGTTCTCTCC GAGCGGCGGC CATACCGTCA AGAAGCAGGC CGTGATCGAC CGGCTGCTCG CCTTCTTCGA CCGCTACTTC GGGTTGGCCT GA
|
Protein sequence | MNLDELGVPG SAPHFEPISV SDESTVVAEF VPDPSKESTY QSEAELEAAF IRLLQEQAYE YLHLTSAAAL EANLRVQLEA VNGITFSEAE WQRFFTETIA GPNKGIVEKT ALVQEDHVQI LRRDDGTTKN VYLIDKRNIH NNRLQVINQY EAAGAHATRY DVTILVNGLP MVHVELKRRG VDIREAFNQI NRYARDSFWA GSGLFEYVQL FVISNGTLTK YYSNTVRDSH LADQRQGKRS RRKTSNSFEF TSWWADANNR PIADVVGFTK TFFSKHTVLN ILTKYCVFDV DRKLLVMRPY QIVAAERILQ RIETSTTYKQ FGSLAAGGYI WHTTGSGKTL TSFKAAQLAS RLPFVEKVLF VVDRKDLDYQ TMREYERFEK GAANSNTSTS VLAKQLEGPN ARIIITTIQK LARFVSYNRQ HSIYSSHVVV IFDECHRSQF GDMHLAITNV FRRYHLFGFT GTPIFAENAG TSGSPRLRTT EQAFGEKLHT YTIVDAINDR NVLPFRVDYV NTLKPAEKLT DAQVAAIDTE RALLAPERIN QIVGYIREHF DQKTKRSSAY RLGDRRLAGF NSLFAAASID AAKRYYAEFA RQQSDLASEQ RLKVGLIFSF TANGEEADGL LAEEEFETGD LDATSRDFLE GAIRDYNALF GTSFDTSADK FQNYYKDLSE RLKSRELDLV IVVNMFLTGF DATTLNTLWL DKNLRSHGLI QAYSRTNRIL NSVKTYGNIV SFRDLEDATN DALALFGNRD ARGIVLLRPY ADYYAEYENA VAELTEVFPL GEQIIGEAAQ KQYIALFGVI LRLRNILTSF DEFAGHELLT ERDYQDYQSI YLNLYAEFRG AKEAEKESIN DDVVFEIELI KQVEVNVDYV LMLVEKWRDA RGNGADREMD ALMKIQRAID SSVTLRNKRD LIMDFVETMT VTGDVNDDWR RFVAAKRAEE LTSIITEENL KPDETHAFVE AAFRDGAIPT MGTAITRILP PISRFSPSGG HTVKKQAVID RLLAFFDRYF GLA
|
| |