Gene Noca_1588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1588 
Symbol 
ID4596496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1686416 
End bp1689700 
Gene Length3285 bp 
Protein Length1094 aa 
Translation table11 
GC content64% 
IMG OID639776187 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_922789 
Protein GI119715824 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCTAGCC CGAAGATCAG CACCTACCGC GAGATCGTCC CTCTCATCTA CTCGTGGCGG 
ACTCCTGACG TCCCCAAGTA CGCGGGCTGG GAGAAGATCG GCTACACCGA GCAGGACAGC
GCGGAAAAGC GCATCGACCA GCAGGCCAGC CAGATGTCCA TCACCAAGGA GAAGGTCTGG
TCTCGACGCG CGCTGTACAC CACGGAGGCC GGCGGGCGGT TCACGGACAA GGACTTCCAC
GAGTACCTGC GCCAGCAGGG GGTCCAGCGA GAAACCACCC CCAAGCGGAC CGAGTGGCAC
AACCTCGCGG CAGCGCCGAA GAGGTCGCTG GACTACTTCA ACGAGTTCGC AGGGCAGGAC
TTCGCTGACT TCCAGGCCGA TGGAGTGGAC GACTACGTCC TGCGCCCTGA GCAGCAGGCC
GCCGTCGATC AGGCGCTGGC CGCCTTCGCC GCGCACGACG AGGTTCTGTG GAACGCGAAG
CCCCGGTTCG GCAAGACGCT GACCACCTAC AACCTGATGC GGAAGCTCGA CGTCCACCGG
GTCCTCATCG TCACCAACCG TCCTGCGATC GCGAACTCCT GGTACGACGA CTTCATGCGC
TTCATCGGCC ACCGGACGAC CTTCAAATTC GTCTCGGAGT CGCCGTCTCT GGAGAACCGC
TCTCCGATGA GTCGGGAGCA GTGGCGCGCC CACTCACGCG AGCACGGCGA TGCGGACCCG
CGCATCGTCG AGTTCGTCTC GCTGCAGGAC CTCAAGGGGT CGCAGTACTT CGGCGGCAAC
TACCCCAAGC TCAAGCACGT CGCCGACTTC GAGTGGGACC TGCTGGTCAT TGACGAGGCC
CATGAGGGCA TCGACACCGA CAAGACCGAC GTCGCCTTCG ACCAGATCAC GCGCGCCCGC
ACGCTGCACC TGTCGGGCAC ACCGTTCAAG GCGCTGGCCA AGGGCAAGTT CGGCAAGGAC
CAGATCTTCA ACTGGACATA CGAGGACGAG CAGACCGCCC GCCAAGAGTG GGCCGACGCC
TCCCAAGAGA ACCCGTACCA GGCACTGCCG AAGCTCAACC TGCTGACCTA CCAGATCTCG
CGGATGATCA CCGACCGCCT GGCCGAGGGC GTCGCGCTCG AGGAGGACGA GGCGAACATC
GACTTCACCT TCGATCTCAA CGAGTTCTTC GCGACCAAGG ACAACGGCTA CTTCGAGCAC
GAAGCCGAAG TGATCCGGTT CCTCGACTGC CTAGCGTCCA ACGAGAAGTA CCCGTTCTCC
ACCCCAGAGC TGCGCGATGA GATCCGACAC TCGTTCTGGC TGCTCAACCG GGTCGCCTCC
GCCAAGGCGC TGCAGAGGCT GCTCAAGCAG CACGAGGTGT TCAAGGACTA CACGGTGGTT
CTGGCTGCCG GCGACGGACG CTCCAACGAC GACTCCGACC TGGTCGCGGT CGGCAAGTCG
CTGGACAAGG TGCGCACCGC CATCGCGGAG GCCGAGGAGT GCGATGGGAA GACGATCACG
CTGTCGGTCG GTCAGCTGAC CACAGGCGTC ACGGTGCCCG AGTGGACCGC AGTGATCATG
CTGTCCGACA TGTCCTCGCC GGCGCAGTAC ATGCAGGCGG CGTTTCGTGC GCAGAACCCG
TGCACGTTCG AGCGCGCCGA GCACGTGTTC CAGAAGCAGA ACGCCTACAT CTTCGACTTC
GCGCCTGAGC GGACGCTGAA GATCTTCGAC GCCTTTGCCA ACAACCTGCA CCCGAACCCG
TCTGGCGACC CGGGGGTGCG GCTGGAGAAC ATCCGTACGT TGCTGAACTT CTTCCCGGTC
CTGGGCGAGG ACTCCGAGGG CCGGATGATC GAGCTCGACG CGTCCCAGGT CCTGACGTTC
CCACAGGTGT TCAAGGCCCG CGAGGTGGTG CGTCGCGGAT TCCTGTCGAA CCTGCTGTTC
GCCAACGTGG CTGGCATCTT TCGCTACTCC GCGCACGTCA AGGAGATCCT CGACAAGCTG
CCCACCGCGA AGCAGGGCAA GACCAAGAAG GGGGAGCCGA TCGAGATCCC GCACCCGCCG
CCGGTCACCG ACCCCGACGG CAACGTCCTC CCGACCGGCC TCGCGTCCGT CATCAACCCC
AAGGTCGAGG AGCTCGGCAA GCCGGTCTGG CGGACCGAGG ACATCCCGAT GCCGGAGCCA
GACGTACCCG TGCACACCAT GGCCACGAAG ATCGCGAAGG CGGTCACCGA GCAGAGCCAC
GACAAGCGCG AGGAGCTTAA GGAAGCCTAC GGACTCACCG CGGCCCAGGT CGACCGCGAC
GAGAAGCGCA CCGAGCAGGC CGTGAAGGCG CAGGTCGAGC GTGCTTACAC CGAGCACAAC
ATTGCGAGCA AGCACCTTGA GGACGAACTG GAGAACGCTG CCACCGAGGC GGAGGCCGAG
GCGATCCAGG CCAAGAAGGT CGAGCAGGAC GAGGCGTTCA AAGCGAACAT CCTGTCCATC
GTCACCGAGA CGATGGACTC GATCGTGCCC GAAGTGGTCA CCCGCGAGGA GATCAAGTCC
GAGCAGAAGC GGGCGAACCG GACGATGGAC GACGCCCGCT CACACCTGCG CGGCTTCGCC
CGGACGATCC CGATGTTCCT GATGGCATAC GGCGACCGCG ACATCCGGCT GGCGAACTTC
GACGACTACA CGCCCGACGA CGTCTTCCAC GAGATCACCG GGATCACCGA GGCGGAGTTC
CGCGTCCTGC GCGACGGCCA AGAGATCACC GAGGAAGACG GCACCGTCAC GAAGATCCCC
GGGATGTTCG ACGAGGCCGT CTTCGACCAG GCCGTCCAGG AGTTCCTGAA CAAGAAGGAA
GCACTGGCCG ACTACTTCGA CGACGCGCAG ACCGAGAACA TCTTCGCCTA CATCCCGCAG
CAGAAGACGT CCCTGGTGTT CACGCCGCAG CACGTGGTCA AGACGATGGT CGACATCCTC
GAGGCCGAGG ACCCAGGTGT CTTCGCCGAC CCGGACAAGA CCTTCGCCGA CCTGTTCTCG
ACTGCCGGGC TGTACCTGAT GGAGCTGGTG CGGCGCCTCG ACACCGGCCT TGCCAATGCC
CACCCTGACC AGGACGAGCG GATCAAGCAC ATCCTGACCT CACAGATCTT CGAGATGAGC
CACAACGAGA TCCTCCACCG CATCACCATC GAGGCAGTGT CGGGGGGTGT TCCCGAACGC
AAGGAGTGGA TCGAGAGGTC CGGACACTTC CGCGTAGGAA ACCTCGCCAG GATGACGGCG
GATGAGCGCA AGAAGGCCGT GAACGACATG TTGGGAGAGA ACTGA
 
Protein sequence
MPSPKISTYR EIVPLIYSWR TPDVPKYAGW EKIGYTEQDS AEKRIDQQAS QMSITKEKVW 
SRRALYTTEA GGRFTDKDFH EYLRQQGVQR ETTPKRTEWH NLAAAPKRSL DYFNEFAGQD
FADFQADGVD DYVLRPEQQA AVDQALAAFA AHDEVLWNAK PRFGKTLTTY NLMRKLDVHR
VLIVTNRPAI ANSWYDDFMR FIGHRTTFKF VSESPSLENR SPMSREQWRA HSREHGDADP
RIVEFVSLQD LKGSQYFGGN YPKLKHVADF EWDLLVIDEA HEGIDTDKTD VAFDQITRAR
TLHLSGTPFK ALAKGKFGKD QIFNWTYEDE QTARQEWADA SQENPYQALP KLNLLTYQIS
RMITDRLAEG VALEEDEANI DFTFDLNEFF ATKDNGYFEH EAEVIRFLDC LASNEKYPFS
TPELRDEIRH SFWLLNRVAS AKALQRLLKQ HEVFKDYTVV LAAGDGRSND DSDLVAVGKS
LDKVRTAIAE AEECDGKTIT LSVGQLTTGV TVPEWTAVIM LSDMSSPAQY MQAAFRAQNP
CTFERAEHVF QKQNAYIFDF APERTLKIFD AFANNLHPNP SGDPGVRLEN IRTLLNFFPV
LGEDSEGRMI ELDASQVLTF PQVFKAREVV RRGFLSNLLF ANVAGIFRYS AHVKEILDKL
PTAKQGKTKK GEPIEIPHPP PVTDPDGNVL PTGLASVINP KVEELGKPVW RTEDIPMPEP
DVPVHTMATK IAKAVTEQSH DKREELKEAY GLTAAQVDRD EKRTEQAVKA QVERAYTEHN
IASKHLEDEL ENAATEAEAE AIQAKKVEQD EAFKANILSI VTETMDSIVP EVVTREEIKS
EQKRANRTMD DARSHLRGFA RTIPMFLMAY GDRDIRLANF DDYTPDDVFH EITGITEAEF
RVLRDGQEIT EEDGTVTKIP GMFDEAVFDQ AVQEFLNKKE ALADYFDDAQ TENIFAYIPQ
QKTSLVFTPQ HVVKTMVDIL EAEDPGVFAD PDKTFADLFS TAGLYLMELV RRLDTGLANA
HPDQDERIKH ILTSQIFEMS HNEILHRITI EAVSGGVPER KEWIERSGHF RVGNLARMTA
DERKKAVNDM LGEN