Gene Noca_1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1107 
Symbol 
ID4599360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1169096 
End bp1172137 
Gene Length3042 bp 
Protein Length1013 aa 
Translation table11 
GC content62% 
IMG OID639775703 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_922310 
Protein GI119715345 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACCTCG ACGAGCTCGG CGTTCCCGGC AGCGCACCGC ACTTTGAGCC GATCTCGGTC 
AGTGACGAGA GCACGGTCGT TGCTGAGTTC GTCCCTGACC CGAGCAAGGA ATCGACGTAT
CAGTCGGAGG CTGAGCTCGA AGCCGCGTTC ATCCGGCTGC TTCAGGAACA GGCTTACGAG
TACCTGCACC TCACCTCGGC CGCCGCGTTG GAGGCCAACC TGCGCGTGCA ACTGGAAGCC
GTCAACGGCA TCACGTTCTC CGAGGCTGAG TGGCAGCGCT TCTTCACCGA AACGATCGCC
GGCCCGAACA AGGGGATCGT GGAGAAGACT GCGCTGGTTC AGGAGGACCA CGTCCAAATC
CTGCGGCGCG ACGACGGGAC GACGAAGAAC GTATATCTGA TCGACAAGCG GAACATCCAC
AACAACCGGC TCCAGGTGAT CAACCAGTAC GAAGCAGCCG GAGCCCACGC CACGCGTTAC
GACGTGACGA TCCTGGTCAA CGGCCTACCG ATGGTGCACG TAGAACTGAA GCGCCGGGGC
GTCGACATCC GTGAAGCCTT CAACCAGATC AACCGCTACG CCCGCGACAG CTTCTGGGCC
GGCTCGGGGC TGTTCGAGTA CGTGCAGCTG TTCGTGATCA GCAACGGCAC GCTGACGAAG
TACTACTCCA ACACGGTCCG CGACTCCCAT CTCGCCGACC AGAGGCAAGG CAAGCGGTCA
CGTCGGAAGA CGTCGAACAG CTTCGAGTTC ACCAGCTGGT GGGCGGATGC CAACAACCGG
CCGATCGCCG ACGTAGTTGG CTTCACCAAG ACGTTCTTTT CCAAGCACAC GGTGCTCAAT
ATCCTCACGA AGTACTGCGT CTTCGACGTC GACCGCAAAC TCCTGGTGAT GCGCCCGTAC
CAGATCGTCG CCGCTGAGCG GATCCTGCAG CGTATCGAGA CGTCGACGAC GTACAAACAG
TTCGGCTCGT TGGCTGCCGG GGGTTACATC TGGCACACGA CCGGGTCAGG CAAGACGCTG
ACCAGCTTCA AGGCCGCCCA GCTCGCGAGT CGTCTCCCGT TCGTGGAGAA GGTGCTCTTC
GTCGTCGACC GCAAGGACCT CGACTACCAG ACCATGCGGG AGTACGAACG CTTCGAGAAG
GGCGCGGCCA ACTCCAACAC CTCCACCAGC GTCCTGGCCA AGCAGTTGGA GGGCCCGAAC
GCGCGGATCA TCATCACCAC GATCCAGAAG CTCGCCAGGT TCGTCAGCTA CAACCGGCAG
CACTCGATCT ACTCCTCGCA CGTCGTGGTG ATCTTCGACG AGTGTCACCG CAGCCAGTTC
GGCGACATGC ACTTGGCGAT CACCAACGTC TTCCGCCGCT ACCACCTGTT CGGCTTCACC
GGCACACCGA TCTTCGCCGA GAACGCCGGC ACCAGCGGCA GCCCGCGACT GCGCACAACC
GAGCAGGCCT TCGGGGAGAA GCTGCACACC TACACGATCG TCGACGCCAT CAACGACCGG
AACGTGCTGC CCTTCCGGGT CGACTACGTC AACACCCTCA AGCCCGCCGA GAAACTGACG
GACGCTCAGG TCGCGGCCAT CGACACCGAG CGAGCACTGC TGGCTCCGGA GCGGATCAAC
CAGATCGTGG GCTACATCCG CGAGCACTTC GACCAGAAGA CCAAGCGCAG CTCCGCGTAT
CGGCTCGGCG ATCGCCGGCT CGCAGGGTTC AACTCATTGT TCGCCGCCGC CTCGATCGAC
GCCGCGAAGC GCTACTACGC CGAATTCGCC CGGCAGCAGT CAGACCTGGC GTCAGAGCAG
CGGCTCAAGG TCGGCCTCAT CTTCAGCTTC ACCGCCAATG GGGAGGAGGC CGACGGCCTG
CTGGCCGAGG AGGAGTTCGA GACCGGCGAC CTCGACGCGA CCTCACGTGA CTTTCTGGAA
GGCGCGATCC GCGACTACAA CGCGCTGTTC GGCACCAGCT TCGACACCTC GGCCGACAAG
TTCCAGAACT ACTACAAGGA CCTCTCCGAG CGGTTGAAGA GCCGCGAGCT GGACCTCGTC
ATCGTGGTCA ACATGTTCCT CACCGGCTTC GACGCCACCA CCCTGAACAC TTTGTGGCTG
GACAAGAACC TGCGCTCCCA TGGGCTGATC CAGGCGTACT CGCGCACGAA TCGCATCCTG
AACTCGGTCA AGACCTACGG CAACATCGTG TCGTTCCGCG ACCTGGAGGA CGCCACCAAT
GACGCACTGG CGCTGTTCGG CAACAGGGAC GCCCGCGGCA TCGTCCTGCT GCGGCCGTAC
GCCGACTACT ACGCGGAGTA CGAAAATGCC GTCGCCGAGC TCACCGAAGT CTTCCCGCTC
GGGGAGCAGA TCATCGGCGA GGCGGCGCAG AAGCAGTACA TCGCCCTGTT CGGAGTGATT
CTACGACTGC GAAACATCCT CACCTCGTTC GACGAGTTCG CCGGACACGA GCTGCTGACC
GAGCGCGACT ACCAGGACTA TCAGTCGATC TACCTCAACC TGTACGCCGA GTTCCGCGGC
GCCAAGGAAG CGGAGAAGGA GTCGATCAAC GACGACGTCG TCTTCGAGAT CGAGTTGATC
AAGCAGGTCG AGGTCAACGT CGACTACGTG CTGATGCTCG TCGAGAAGTG GCGCGACGCC
AGGGGCAACG GTGCCGACCG AGAGATGGAC GCGCTGATGA AGATCCAGCG TGCGATCGAC
TCCAGCGTCA CTCTGCGCAA CAAGCGCGAC CTGATCATGG ACTTTGTAGA GACAATGACT
GTGACCGGCG ACGTCAACGA CGACTGGCGG CGCTTCGTCG CCGCGAAGCG GGCTGAGGAG
CTGACGAGCA TCATCACTGA AGAGAACCTC AAGCCCGACG AGACTCACGC CTTCGTTGAG
GCGGCCTTCC GGGACGGCGC CATCCCGACC ATGGGCACCG CGATCACCCG TATCCTCCCG
CCCATCTCGC GGTTCTCTCC GAGCGGCGGC CATACCGTCA AGAAGCAGGC CGTGATCGAC
CGGCTGCTCG CCTTCTTCGA CCGCTACTTC GGGTTGGCCT GA
 
Protein sequence
MNLDELGVPG SAPHFEPISV SDESTVVAEF VPDPSKESTY QSEAELEAAF IRLLQEQAYE 
YLHLTSAAAL EANLRVQLEA VNGITFSEAE WQRFFTETIA GPNKGIVEKT ALVQEDHVQI
LRRDDGTTKN VYLIDKRNIH NNRLQVINQY EAAGAHATRY DVTILVNGLP MVHVELKRRG
VDIREAFNQI NRYARDSFWA GSGLFEYVQL FVISNGTLTK YYSNTVRDSH LADQRQGKRS
RRKTSNSFEF TSWWADANNR PIADVVGFTK TFFSKHTVLN ILTKYCVFDV DRKLLVMRPY
QIVAAERILQ RIETSTTYKQ FGSLAAGGYI WHTTGSGKTL TSFKAAQLAS RLPFVEKVLF
VVDRKDLDYQ TMREYERFEK GAANSNTSTS VLAKQLEGPN ARIIITTIQK LARFVSYNRQ
HSIYSSHVVV IFDECHRSQF GDMHLAITNV FRRYHLFGFT GTPIFAENAG TSGSPRLRTT
EQAFGEKLHT YTIVDAINDR NVLPFRVDYV NTLKPAEKLT DAQVAAIDTE RALLAPERIN
QIVGYIREHF DQKTKRSSAY RLGDRRLAGF NSLFAAASID AAKRYYAEFA RQQSDLASEQ
RLKVGLIFSF TANGEEADGL LAEEEFETGD LDATSRDFLE GAIRDYNALF GTSFDTSADK
FQNYYKDLSE RLKSRELDLV IVVNMFLTGF DATTLNTLWL DKNLRSHGLI QAYSRTNRIL
NSVKTYGNIV SFRDLEDATN DALALFGNRD ARGIVLLRPY ADYYAEYENA VAELTEVFPL
GEQIIGEAAQ KQYIALFGVI LRLRNILTSF DEFAGHELLT ERDYQDYQSI YLNLYAEFRG
AKEAEKESIN DDVVFEIELI KQVEVNVDYV LMLVEKWRDA RGNGADREMD ALMKIQRAID
SSVTLRNKRD LIMDFVETMT VTGDVNDDWR RFVAAKRAEE LTSIITEENL KPDETHAFVE
AAFRDGAIPT MGTAITRILP PISRFSPSGG HTVKKQAVID RLLAFFDRYF GLA