Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1105 |
Symbol | |
ID | 8446701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 1227836 |
End bp | 1231036 |
Gene Length | 3201 bp |
Protein Length | 1066 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645040242 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_003200501 |
Protein GI | 258651345 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.147445 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCTTAC ACGGGGGAAT GAGCGAGAGC GAATGGGAAC AGCTCGCTCT GGACACCTTG GGCGAACTCG GCTGGCAACC AGTGCACGGC AAGGACGTCG CACCGGGCAG TGGAGAGCGG GAGAAGTGGG AGGACCTGCA CATCCCGTCC CGCATGCTGG CCGCGATGCG AAGGCTGAAC CCGGAGGTGC CCGAGCAGTA TCTGCAGCAG GCGCTTGCCG AGATCGTCAC GCCGACGTCA AATGACGCGA TCACCGAGAA CTACCGGCTG CACATGGCGG TCGCCGAGGG ATACCGGGGG ATCACCTACA TCGATCACGA TGGCGCCGAA CAGAATCCGA CGATCCGCCT GGTCAGTCAG GACCCCGAGG CCAACGACTG GCTCGCCGTC GGCCAGGTCA CCATTTTCCA GGGGGAGTTC GAGCGTCGCT TCGACATCGT GCTGTACTGC AACGGAATGC CGGTGTCCAT CGTCGAACTC AAACGGGCCG GCAGCAAGTA CGCCGATCTG GCGTCCGCTC ACACGCAATT GCAGAACTAT CTCGGGCTGT TCCCGATGGC CTTCCGTTTC TGCGTCTTCA CCATCGTCTC CGACGGCATC ACCGCGAAGT ACGGCACCCC CTTCACCCCG CTGCACCACT TCTCGCCGTG GAACGTCGAC GAGCACGGCG CCGTCGTCAA TCCTGGCGAC CGCGACGCGC AGGGCAACGC CGACACCGCC ATCGAGGTCG CCCTGCACGG CCTCTACACC CACGACCGCT TCCTTGACCT GCAACGCAGT TACACCGCGT TCGACGAGGG CGCCGAAGGC TTGAGCAAGC GCATCGCCAA GCCGCATCAG TACGTCGCGG TCAGCAAGGC GCTCGCCAAG ACAATCGACG CGGTCGAGAG CGACGGGAAA GCAGGCGTCG TCTGGCACAC CCAGGGCTCC GGAAAGTCCA TGGAGATGGA GCTTTACACC AACCTGGTCA TCACCGCGCC GAAGCTGCTC AATCCCACCG TCATCGTCAT CACCGATCGC ACCGAACTGG ACGGCCAGCT GTTTCAGAGC TTCCGGGTCA GCCGCCTGCT GCCCGAAGAA CCCCAGCAGA TCCGTCGGCG ATTGGAACTC CGCGACGAAC TGTCCAACCG GATCAGCGGC GGCATCTACT TCACCACCCT GCAGAAATTC AGCAAGACCG GCGACGAGAA GCTGTCCGGT TCGGATCACC CCCTGCTGTC CAATCGCCGC AACATCATTG TGATGGTCGA CGAGGCGCAC CGCAGCCACT ACGACGACCT TGACGGCTAC GCCCGGCACC TGCGCGACGC GCTGCCGCAC GCCACCCTGA TCGCGTTCAC CGGCACCCCC ATCTCCTTCG ACGACCGCAA CACCCGCGAC GTGTTCGGCG ACTACGTCGA CATCTACGAC CTGAGCCGAG CGGTCCAGGA CGGCGCCACC GTGCCGGTGT ACTTCGAACC TCGGCTGATC AAGGTCGGCC TGGCCGAGGG CGTCACCCAG GAGCAGCTCA ACGAAGCCGC CGACCAGGCC ACCGAAGGCC TGGACGACGT CGAGCGGACC AAGATCGAAC AAGGCGTGGC GGTGATCAAC GCGGTCTACG GCGCGCCGGC CCGGCTGCGT GCGCTCGCCG GCGACCTGGT CGACCACTGG GAGCAACGGC GCACCGCGAT GCGACCGTTG ATCGGCGCGC ACGGCAAGGC GATGATCGTC GGCGGCACCC GGGAGATCTG CGCCCGGCTG TACGAGGAGA TCATCGCGCT GCGCCCGAAC TGGCACTCGC CCGACCTGCA CCAGGGCGTC ATCAAGGTCG TCTACTCGTC GGACTCGTCC GACACGGGAC TGATCGCGAA GCACCGACGG CGCGCGAGTG ACAACGCGAC GATCAAGCAG CGGCTCCGCG AGGTCGACGA TGAGCTCGAA TTGGTCATCG TCAAGGACAT GATGCTGACC GGTTACGACT CGCCGCCCCT GCACACCCTG TACCTGGACC GCCCGCTCAA GGGAGCGCTG CTCATGCAGA CCCTGGCCCG GGTCAACCGC ACATTCAAGG AGAAGGACGC TGGGCTCCTG GTCGGTTATG CGCCGCTGGC CGACAATCTG CAGCAAGCTC TGTCCGAATA CACCCAGCAG GATCAGCAGA ACAAACCGCT CGGCCGGGAC ACCGGAGAAG CCGTTGTGCT GGTCAAGGAA CTGCTGCAGG GCATCGAGAC GATGCTGGCC GGCTTCGACT GGCGCAGACT CATCGTTCCT CGCCAGCCCA AGTCCTACGC CATCGCGGCG ACCGCCACGG CGAACTACCT GCGGAGTCCC GAGACCCCAG GCAATCAGGT CGCCGAGGGT GAAGAACCAC TGCGGTCGCA GTTCCGGCAA ACGGCCGGGC GGCTGACCCG GGCTTGGGCC TTGGCGGCCG GCGACCCCGG ACTGGCGGAC CAGAAGTGGG ACGTGCAGTT CTACGAAGAG GTTCGCATCT ACATGGGCAA GTTCGACGCC GACGAACGCC AGGCCAACGG CGAGCCGGTC CCCGAGGAGA TCGAGCTGCT GCTCTCGTCG CTGATCGCCG AGTCGACCGC ATCGGGCGAA ATCATGGACA TTTACCAGGC GGCCGGGATT CCGAAGCCGT CGCTGTCCGA CCTGACCCCG GATTTCGCCC GAAAGGCGCA GCAGGCCCGC AACCCGCACC TGGCCATCGA GGCGCTGCGT AAGCTGGTCG CCGAGGAATC GGGCAAGGTG TCCCGGCATA ACCTGGTGCG CCAGCGGGCG TTCGGCGAAC GAATCGCCGA GCTGATGATC CGGTATGCCA ACCAGCAGCT GACCTCGGCC GAGGTGATCG CCGAACTGGT CGCCATGGCT CAGGAGATCA AGGCCGAAGC GGGGCGCGGG CAGCACTTCT CGCCCCCGCT GGACGAGGAT CAACTCGCGT TCTATGACGC GGTCGCGCAG AACCCCTCCG CGATCGACGT GCTCGGCGAG GGCAAGCTGG CCGACATCGC CCGGGACTTG GTCACGGTCA TGCAACGCGA CATCCGCACC GACTGGACCG TTCGCGAGGA CGTTCGGTCC CGCCTGCGGT CTTCGATCCG CCGGCTGTTG GCGATTCACA AGTACCCCCC GGATAAGGCC CAGGGGGCCA TCGAGTTGGT CATGGAGCAG ATGGAGTCGA TGGCTCCGAG GTACTCGGAG CAGCGTCGGG CCGGCTCGTA G
|
Protein sequence | MGLHGGMSES EWEQLALDTL GELGWQPVHG KDVAPGSGER EKWEDLHIPS RMLAAMRRLN PEVPEQYLQQ ALAEIVTPTS NDAITENYRL HMAVAEGYRG ITYIDHDGAE QNPTIRLVSQ DPEANDWLAV GQVTIFQGEF ERRFDIVLYC NGMPVSIVEL KRAGSKYADL ASAHTQLQNY LGLFPMAFRF CVFTIVSDGI TAKYGTPFTP LHHFSPWNVD EHGAVVNPGD RDAQGNADTA IEVALHGLYT HDRFLDLQRS YTAFDEGAEG LSKRIAKPHQ YVAVSKALAK TIDAVESDGK AGVVWHTQGS GKSMEMELYT NLVITAPKLL NPTVIVITDR TELDGQLFQS FRVSRLLPEE PQQIRRRLEL RDELSNRISG GIYFTTLQKF SKTGDEKLSG SDHPLLSNRR NIIVMVDEAH RSHYDDLDGY ARHLRDALPH ATLIAFTGTP ISFDDRNTRD VFGDYVDIYD LSRAVQDGAT VPVYFEPRLI KVGLAEGVTQ EQLNEAADQA TEGLDDVERT KIEQGVAVIN AVYGAPARLR ALAGDLVDHW EQRRTAMRPL IGAHGKAMIV GGTREICARL YEEIIALRPN WHSPDLHQGV IKVVYSSDSS DTGLIAKHRR RASDNATIKQ RLREVDDELE LVIVKDMMLT GYDSPPLHTL YLDRPLKGAL LMQTLARVNR TFKEKDAGLL VGYAPLADNL QQALSEYTQQ DQQNKPLGRD TGEAVVLVKE LLQGIETMLA GFDWRRLIVP RQPKSYAIAA TATANYLRSP ETPGNQVAEG EEPLRSQFRQ TAGRLTRAWA LAAGDPGLAD QKWDVQFYEE VRIYMGKFDA DERQANGEPV PEEIELLLSS LIAESTASGE IMDIYQAAGI PKPSLSDLTP DFARKAQQAR NPHLAIEALR KLVAEESGKV SRHNLVRQRA FGERIAELMI RYANQQLTSA EVIAELVAMA QEIKAEAGRG QHFSPPLDED QLAFYDAVAQ NPSAIDVLGE GKLADIARDL VTVMQRDIRT DWTVREDVRS RLRSSIRRLL AIHKYPPDKA QGAIELVMEQ MESMAPRYSE QRRAGS
|
| |