Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nham_3845 |
Symbol | |
ID | 4030180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter hamburgensis X14 |
Kingdom | Bacteria |
Replicon accession | NC_007964 |
Strand | + |
Start bp | 4222057 |
End bp | 4224993 |
Gene Length | 2937 bp |
Protein Length | 978 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637972234 |
Product | DNA modification methyltransferase-related protein |
Protein accession | YP_579008 |
Protein GI | 92119279 |
COG category | [V] Defense mechanisms |
COG ID | [COG1002] Type II restriction enzyme, methylase subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0841462 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGAAC GGGTCGAGCA GATCGAGGCA TTTGTTGCCT ATGCGAAAAC GTTAAAGGGT GACGAGAAGG GCGAAGCACA GGTGTTCTGT GATCGCCTTT TCCAAGCTTT TGGCCACGAA GGTTATAAGG AAGCCGGCGC GGAACTGGAG AGTCGGGTGA AGAAGGCGTC CGGAAAGGGC GTCAACTTCG CAGACTTGAT CTGGAAACCC CGGGTTCTGA TCGAAATGAA GAAAAGCAGC GAAAAGCTGC ATCTTCATTA CCAGCAAGCC TTCGATTACT GGCTGAACGC GGTCCCTAAC CGCCCGCGAT ATGTGGTGCT CTGCAATTTC AAAGAGTTCT GGATTTACGA CTTTGATAAG CAATTAAACG AGCCAGTAGA CGTCGTCCGG CTTCAAGACC TGCCCGCCCG GTACACGGCG CTAAACTTTC TTTTTCCAGA CAATCCAGAC CCGCTGTTTG GCAACGATCG CGAAGAGGTC TCGCGTGTAG CGGCCTCAAA GGTCGCGCAG TTATTTCGGT CGATGGTCGC TCGCGGCATT CCGCGAGAGC AGGCACAACG ATTTGTACTG CAGGCCGTGG TGGCGATGTT TGCTGAAGAT ATCGACATGA TGCCGGCCGG GACGACCCTG CGGCTAGTGC AGGACTGCCT GGAGCACGGC CAAAATTCGT ACGACGTGTT CGGTGGCCTG TTTCTCCAAA TGAACAATAA GGCGGCGGCG CAGGGCGGCC GCTACAAGGG AGTTCCTTAT TTTAACGGCG GGCTATTTGC GACGGTCCAG CCGATCGAAT TGACTACGGA CGAGCTAGAG TTGCTCGGCA AGAAGGATGA AGGTGCTGCT TGGCAAAACT GGGCCAAGAT CAACCCTGCC ATCTTCGGCA CCATTTTCCA ACAGAGCATG GACAAGGGGG AGCGGCATGC GTTCGGCGCG CACTTCACCC ATGAGGCCGA CATTCAGCGG ATTGTCGGGC CCACGATTGT GCGTCCCTGG CGCGAACGCA TCGATGCAGC GAAGACCATG GCGGAGCTGC TGGAGATTCG CAAAGCGCTT CTCAATTTCC GCGTCCTCGA TCCCGCCTGC GGAAGCGGCA ATTTTCTGTA CGTGGCCTAC AGAGAGATGG TGCGTCTCGA AATCAAGCTC ATGGCCAGAC TGGACAAGGA GTTTAGCTGG AAGACCGTAC AAAAGCAGGC TCAGGCCACA TCGCTCATCA GCCCTCGCCA GTTTTTTGGT GTCGAGCGGG ATTCGTTCGG CGTCGAGTTG ACCAAGGTCA CCCTAATGCT GGCAAAAAAG CTGGCCCTAG ACGAGGCCGC CGATGTTTTG GAGCGCGACC AGATTGAGTT GCCATTGGCG GAGGATGAGG CGCTCCCACT GGACAACCTC GATGGCAACA TTCTTTGCCG CGATGCGCTC CTATCGGACT GGCCCGAAGT AGACACCATT ATCGGAAATC CCCCGTACCA AAGCAAAAAC AAGGCACAGC AAGAGTTCGG GCGTGCCTAT CTGAACAAGA TTCGATCGGT TTTCCCGGAG ATTGACGGAA GGGCCGATTA TTGCGTCTAC TGGTTTAGAA AAGCGCACGA CCAGCTGAAG CAAGGCCAAA GAGCTGGTCT CGTCGGCACC AATACGATCC GGCAAAACTA TTCCCGAATC AGCGGGCTGG ATTACATAGC CAAGCACAAC GGTACGATTA CGGAAGCGGT CTCTACCATG CCGTGGTCGG GCGACGCGGT CGTGCACGTT TCCATCGTCA ACTGGGTGAA AGGCGAGGAT GACGGCAAGA AACGCCTGTA CATTCAGTCA GGCAATGATC CGGCCGGCGG CTGGGATTAC AAGGACCTCG ACGAAATCAA CACCTCGCTT TCGTTTTCAA CGGATGTGAG CCAGGCGCAA CGCATCAATG CGAACGCTGA AAAGGGCGGT TGCTATCAGG GCCAGACACA CGGGCATAAG GGTTTTCTCC CGGAACCGGC CGAAGCGAAG GCGATGATCA AGGCCAGCAA GGCAAACGCT AAGGTCCTCT TCCCATTTTT GATCGCCGAC GATTTCTTGG GTGCGGTAGA CAAACTCGAA TGCAGATACG TCATCGATTT CCAAACCCGC GACCTCCTCC AGGCCAAGGC GTTCAAAAGA CCGTTTGAGC ATCTTGAAAA GACGGTCCTT CCTACCCGAA AGGAAGCTGC AAAGAAGGAA AAGGATCGAA ACAAGGAAGC TTTGGACGCC GACCCGGAAG CCAAGGTCAA CAAGCACCAC GAAAACTTTC TAAAGCGCTG GTGGCTGATG TCTTACGCGC GCGAGGACCT GATGCAGACG TTGGCTCCTT TGAGCCGCTA CATCGTTTGC GCACGCGTTA CGCACAGGCC AATCTTTGAA TTCGTCTCGA CAGCCATTCA TCCGAATGAC GCACTGAGCG TTTTCGCCTT GGAGGATGAT TACTCCTTTG GAATCCTTCA ATCGGGCATC CATTGGGAGT GGTTTATCAA TCGATGCTCG ACCCTCAAGG CTGACTTTCG CTACACTTCG GATACTGTCT TTGATAGTTT TCCGTGGCCC CAGGAACCCA GTGCCGATGC GGTGCGCCTG GTCGCGAAGC GAGCTGTCGA GGTTAGGCAA CTTCGGTCTA AGCTGAAGGT CAAACATCAC CTGTCGCTAA GGGAGTTGTA TCGAGCAATC GAAGGTCCTG GAGAACACGC TCTCAAGAAA GCCCACAAGC TTCTGGACGA GGCCGTGCGC GGAGCTTACG GCATGTCTAA GAAGGCGGAT GTATTAGAAA CATTACTGGA ACTGAACGAG ACCGTAGTAG CTGCGGAGGC CGACGGAAAA CAAGTCGTCG GCCCTGGAAT CCCGCCTTCG GCCTCGAAGC TAAAGAACCT CGTCACTACT GATAAGCTGA CGATCTCGCC GACGAGTTGG GCCAATAATG CTCCTGTAAA AACGTGA
|
Protein sequence | MSERVEQIEA FVAYAKTLKG DEKGEAQVFC DRLFQAFGHE GYKEAGAELE SRVKKASGKG VNFADLIWKP RVLIEMKKSS EKLHLHYQQA FDYWLNAVPN RPRYVVLCNF KEFWIYDFDK QLNEPVDVVR LQDLPARYTA LNFLFPDNPD PLFGNDREEV SRVAASKVAQ LFRSMVARGI PREQAQRFVL QAVVAMFAED IDMMPAGTTL RLVQDCLEHG QNSYDVFGGL FLQMNNKAAA QGGRYKGVPY FNGGLFATVQ PIELTTDELE LLGKKDEGAA WQNWAKINPA IFGTIFQQSM DKGERHAFGA HFTHEADIQR IVGPTIVRPW RERIDAAKTM AELLEIRKAL LNFRVLDPAC GSGNFLYVAY REMVRLEIKL MARLDKEFSW KTVQKQAQAT SLISPRQFFG VERDSFGVEL TKVTLMLAKK LALDEAADVL ERDQIELPLA EDEALPLDNL DGNILCRDAL LSDWPEVDTI IGNPPYQSKN KAQQEFGRAY LNKIRSVFPE IDGRADYCVY WFRKAHDQLK QGQRAGLVGT NTIRQNYSRI SGLDYIAKHN GTITEAVSTM PWSGDAVVHV SIVNWVKGED DGKKRLYIQS GNDPAGGWDY KDLDEINTSL SFSTDVSQAQ RINANAEKGG CYQGQTHGHK GFLPEPAEAK AMIKASKANA KVLFPFLIAD DFLGAVDKLE CRYVIDFQTR DLLQAKAFKR PFEHLEKTVL PTRKEAAKKE KDRNKEALDA DPEAKVNKHH ENFLKRWWLM SYAREDLMQT LAPLSRYIVC ARVTHRPIFE FVSTAIHPND ALSVFALEDD YSFGILQSGI HWEWFINRCS TLKADFRYTS DTVFDSFPWP QEPSADAVRL VAKRAVEVRQ LRSKLKVKHH LSLRELYRAI EGPGEHALKK AHKLLDEAVR GAYGMSKKAD VLETLLELNE TVVAAEADGK QVVGPGIPPS ASKLKNLVTT DKLTISPTSW ANNAPVKT
|
| |