Gene Namu_1105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1105 
Symbol 
ID8446701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1227836 
End bp1231036 
Gene Length3201 bp 
Protein Length1066 aa 
Translation table11 
GC content65% 
IMG OID645040242 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_003200501 
Protein GI258651345 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.147445 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTTAC ACGGGGGAAT GAGCGAGAGC GAATGGGAAC AGCTCGCTCT GGACACCTTG 
GGCGAACTCG GCTGGCAACC AGTGCACGGC AAGGACGTCG CACCGGGCAG TGGAGAGCGG
GAGAAGTGGG AGGACCTGCA CATCCCGTCC CGCATGCTGG CCGCGATGCG AAGGCTGAAC
CCGGAGGTGC CCGAGCAGTA TCTGCAGCAG GCGCTTGCCG AGATCGTCAC GCCGACGTCA
AATGACGCGA TCACCGAGAA CTACCGGCTG CACATGGCGG TCGCCGAGGG ATACCGGGGG
ATCACCTACA TCGATCACGA TGGCGCCGAA CAGAATCCGA CGATCCGCCT GGTCAGTCAG
GACCCCGAGG CCAACGACTG GCTCGCCGTC GGCCAGGTCA CCATTTTCCA GGGGGAGTTC
GAGCGTCGCT TCGACATCGT GCTGTACTGC AACGGAATGC CGGTGTCCAT CGTCGAACTC
AAACGGGCCG GCAGCAAGTA CGCCGATCTG GCGTCCGCTC ACACGCAATT GCAGAACTAT
CTCGGGCTGT TCCCGATGGC CTTCCGTTTC TGCGTCTTCA CCATCGTCTC CGACGGCATC
ACCGCGAAGT ACGGCACCCC CTTCACCCCG CTGCACCACT TCTCGCCGTG GAACGTCGAC
GAGCACGGCG CCGTCGTCAA TCCTGGCGAC CGCGACGCGC AGGGCAACGC CGACACCGCC
ATCGAGGTCG CCCTGCACGG CCTCTACACC CACGACCGCT TCCTTGACCT GCAACGCAGT
TACACCGCGT TCGACGAGGG CGCCGAAGGC TTGAGCAAGC GCATCGCCAA GCCGCATCAG
TACGTCGCGG TCAGCAAGGC GCTCGCCAAG ACAATCGACG CGGTCGAGAG CGACGGGAAA
GCAGGCGTCG TCTGGCACAC CCAGGGCTCC GGAAAGTCCA TGGAGATGGA GCTTTACACC
AACCTGGTCA TCACCGCGCC GAAGCTGCTC AATCCCACCG TCATCGTCAT CACCGATCGC
ACCGAACTGG ACGGCCAGCT GTTTCAGAGC TTCCGGGTCA GCCGCCTGCT GCCCGAAGAA
CCCCAGCAGA TCCGTCGGCG ATTGGAACTC CGCGACGAAC TGTCCAACCG GATCAGCGGC
GGCATCTACT TCACCACCCT GCAGAAATTC AGCAAGACCG GCGACGAGAA GCTGTCCGGT
TCGGATCACC CCCTGCTGTC CAATCGCCGC AACATCATTG TGATGGTCGA CGAGGCGCAC
CGCAGCCACT ACGACGACCT TGACGGCTAC GCCCGGCACC TGCGCGACGC GCTGCCGCAC
GCCACCCTGA TCGCGTTCAC CGGCACCCCC ATCTCCTTCG ACGACCGCAA CACCCGCGAC
GTGTTCGGCG ACTACGTCGA CATCTACGAC CTGAGCCGAG CGGTCCAGGA CGGCGCCACC
GTGCCGGTGT ACTTCGAACC TCGGCTGATC AAGGTCGGCC TGGCCGAGGG CGTCACCCAG
GAGCAGCTCA ACGAAGCCGC CGACCAGGCC ACCGAAGGCC TGGACGACGT CGAGCGGACC
AAGATCGAAC AAGGCGTGGC GGTGATCAAC GCGGTCTACG GCGCGCCGGC CCGGCTGCGT
GCGCTCGCCG GCGACCTGGT CGACCACTGG GAGCAACGGC GCACCGCGAT GCGACCGTTG
ATCGGCGCGC ACGGCAAGGC GATGATCGTC GGCGGCACCC GGGAGATCTG CGCCCGGCTG
TACGAGGAGA TCATCGCGCT GCGCCCGAAC TGGCACTCGC CCGACCTGCA CCAGGGCGTC
ATCAAGGTCG TCTACTCGTC GGACTCGTCC GACACGGGAC TGATCGCGAA GCACCGACGG
CGCGCGAGTG ACAACGCGAC GATCAAGCAG CGGCTCCGCG AGGTCGACGA TGAGCTCGAA
TTGGTCATCG TCAAGGACAT GATGCTGACC GGTTACGACT CGCCGCCCCT GCACACCCTG
TACCTGGACC GCCCGCTCAA GGGAGCGCTG CTCATGCAGA CCCTGGCCCG GGTCAACCGC
ACATTCAAGG AGAAGGACGC TGGGCTCCTG GTCGGTTATG CGCCGCTGGC CGACAATCTG
CAGCAAGCTC TGTCCGAATA CACCCAGCAG GATCAGCAGA ACAAACCGCT CGGCCGGGAC
ACCGGAGAAG CCGTTGTGCT GGTCAAGGAA CTGCTGCAGG GCATCGAGAC GATGCTGGCC
GGCTTCGACT GGCGCAGACT CATCGTTCCT CGCCAGCCCA AGTCCTACGC CATCGCGGCG
ACCGCCACGG CGAACTACCT GCGGAGTCCC GAGACCCCAG GCAATCAGGT CGCCGAGGGT
GAAGAACCAC TGCGGTCGCA GTTCCGGCAA ACGGCCGGGC GGCTGACCCG GGCTTGGGCC
TTGGCGGCCG GCGACCCCGG ACTGGCGGAC CAGAAGTGGG ACGTGCAGTT CTACGAAGAG
GTTCGCATCT ACATGGGCAA GTTCGACGCC GACGAACGCC AGGCCAACGG CGAGCCGGTC
CCCGAGGAGA TCGAGCTGCT GCTCTCGTCG CTGATCGCCG AGTCGACCGC ATCGGGCGAA
ATCATGGACA TTTACCAGGC GGCCGGGATT CCGAAGCCGT CGCTGTCCGA CCTGACCCCG
GATTTCGCCC GAAAGGCGCA GCAGGCCCGC AACCCGCACC TGGCCATCGA GGCGCTGCGT
AAGCTGGTCG CCGAGGAATC GGGCAAGGTG TCCCGGCATA ACCTGGTGCG CCAGCGGGCG
TTCGGCGAAC GAATCGCCGA GCTGATGATC CGGTATGCCA ACCAGCAGCT GACCTCGGCC
GAGGTGATCG CCGAACTGGT CGCCATGGCT CAGGAGATCA AGGCCGAAGC GGGGCGCGGG
CAGCACTTCT CGCCCCCGCT GGACGAGGAT CAACTCGCGT TCTATGACGC GGTCGCGCAG
AACCCCTCCG CGATCGACGT GCTCGGCGAG GGCAAGCTGG CCGACATCGC CCGGGACTTG
GTCACGGTCA TGCAACGCGA CATCCGCACC GACTGGACCG TTCGCGAGGA CGTTCGGTCC
CGCCTGCGGT CTTCGATCCG CCGGCTGTTG GCGATTCACA AGTACCCCCC GGATAAGGCC
CAGGGGGCCA TCGAGTTGGT CATGGAGCAG ATGGAGTCGA TGGCTCCGAG GTACTCGGAG
CAGCGTCGGG CCGGCTCGTA G
 
Protein sequence
MGLHGGMSES EWEQLALDTL GELGWQPVHG KDVAPGSGER EKWEDLHIPS RMLAAMRRLN 
PEVPEQYLQQ ALAEIVTPTS NDAITENYRL HMAVAEGYRG ITYIDHDGAE QNPTIRLVSQ
DPEANDWLAV GQVTIFQGEF ERRFDIVLYC NGMPVSIVEL KRAGSKYADL ASAHTQLQNY
LGLFPMAFRF CVFTIVSDGI TAKYGTPFTP LHHFSPWNVD EHGAVVNPGD RDAQGNADTA
IEVALHGLYT HDRFLDLQRS YTAFDEGAEG LSKRIAKPHQ YVAVSKALAK TIDAVESDGK
AGVVWHTQGS GKSMEMELYT NLVITAPKLL NPTVIVITDR TELDGQLFQS FRVSRLLPEE
PQQIRRRLEL RDELSNRISG GIYFTTLQKF SKTGDEKLSG SDHPLLSNRR NIIVMVDEAH
RSHYDDLDGY ARHLRDALPH ATLIAFTGTP ISFDDRNTRD VFGDYVDIYD LSRAVQDGAT
VPVYFEPRLI KVGLAEGVTQ EQLNEAADQA TEGLDDVERT KIEQGVAVIN AVYGAPARLR
ALAGDLVDHW EQRRTAMRPL IGAHGKAMIV GGTREICARL YEEIIALRPN WHSPDLHQGV
IKVVYSSDSS DTGLIAKHRR RASDNATIKQ RLREVDDELE LVIVKDMMLT GYDSPPLHTL
YLDRPLKGAL LMQTLARVNR TFKEKDAGLL VGYAPLADNL QQALSEYTQQ DQQNKPLGRD
TGEAVVLVKE LLQGIETMLA GFDWRRLIVP RQPKSYAIAA TATANYLRSP ETPGNQVAEG
EEPLRSQFRQ TAGRLTRAWA LAAGDPGLAD QKWDVQFYEE VRIYMGKFDA DERQANGEPV
PEEIELLLSS LIAESTASGE IMDIYQAAGI PKPSLSDLTP DFARKAQQAR NPHLAIEALR
KLVAEESGKV SRHNLVRQRA FGERIAELMI RYANQQLTSA EVIAELVAMA QEIKAEAGRG
QHFSPPLDED QLAFYDAVAQ NPSAIDVLGE GKLADIARDL VTVMQRDIRT DWTVREDVRS
RLRSSIRRLL AIHKYPPDKA QGAIELVMEQ MESMAPRYSE QRRAGS