Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbur_1841 |
Symbol | |
ID | 3997502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcoides burtonii DSM 6242 |
Kingdom | Archaea |
Replicon accession | NC_007955 |
Strand | - |
Start bp | 1937996 |
End bp | 1940857 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637959585 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_566474 |
Protein GI | 91773782 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACAC AATCGGAAGC TGTATTGGAA GAGCAGTTGG TCGGGCAACT TTGTGAATTG GGTTATTCTT TAGTGAGGAT AAAGGATGAG GCGGAGCTGA TTGCTAATTT GAAGGGGCAA TTGGAGAAGC ATAATGAGAT CGTGTTCTCG GATAATGAGT TCAAAAAAGT GATGAACTTA CTTAGCAAGG GTAGTGTGTT CGAGAAGGCT AAGAACTTGC GGGAAAAGCA ACATATTCTG CGTGATAATG GGGATGACCT TTATTTTGAC TTCCTGAACA CGGAACATTG GTGTCAGAAT GAATATCAGG TGACGCATCA GGTAGCGATG GAAGGGAAAT ATAAGAACCG CTATGATGTG ACGTTATTGG TCAATGGTCT GCCATTGGTA CAGATAGAAC TCAAAAGGCG TGGACTTGAA CTAAAAGAGG CGTTCAATCA GATACAACGC TATCAAAAGC ACTCATTTGG CTCACACGCT GCATTATTCC AGTATATTCA GATATTTGTC ATCAGCAACG GTGTGAACAC AAAATACTAC GCCAACAATC GTAAGCAATC CTTCAAGCAG ACATTCTACT GGACAGACGA AGAGAACCAG CGTCTCAGCA ATATCCTTAA CGGCTTCACA AGGGATTTTC TGGACCCTTG CAACCTCTCC AAAATGATAT GTCGCTACAT CGTACTGAAC GAAGCCCAGA AAATACTCAT GGTACTTCGC CCCTATCAGT TCTATGCAGT AGAAGCCCTC ATTGACAGAG TAAAACACAG CCATAAGAAC GCTTACATCT GGCACACAAC CGGCTCAGGA AAAACACTGA CATCCTTTAA AGCCAGTCAG ATATTAACAA AACTGCCCAC AGTCAAAAAA GTAGTCTTTG TAGTCGATAG AAAAGACCTC GACTATCAAA CAACAAAAGA ATTCAACAGT TTTAGCAAAG GCTGCATCGA TGGTACGGAC AACACAAAAC AACTGGTCAA ACAACTTAGT GACGACACAG CACTCATTGT CACCACCATT CAAAAACTAA ATGTTGCCAT CAGCAAAAAA CAATATCTTG AAAAAATGGA GCAGCTCAGA GACCAGCGTA TAGTCTTCAT CTTTGACGAA TGCCACCGCT CACAATTTGG CGAAACCCAC AAACGCATAA ATCAATTCTT CAAGAATCCT CAAATGTTCG GTTTCACAGG CACACCCATA TTTGCCGAAA ACGCCATCAA GAACGAATCT GGAAGACGCA CCACCAAAGA TCTCTTTGGC GAATGTCTTC ATAAATACGT CATAACCGAT GCAATAGCCG ATGAGAACGT CCTGAAATTC TCCGTAGAAT ACGTAGGACG CTACAAACAA AAAGAAGGTC GGGAAACCAG CATTGACATA GAGGTAGAGG ACATCGATAC AAATGAGTTA ATGGAATCCC CTGAGCGTCT GGAGAAAATA TCAGATTATA TAATAGCCCA TCACTCCCGT AAAACACACA ACAGGGAATT CACCGCAATA TTTACCGTAA GCAGCATAAA AACCCTGATA AATTACTATG ATATACTGCA AAGAAAGAAA GCAGAAGGCA AGCATGATCT AAAAATTGCG ACAATCTTCA GTTATACCGC CAACGAAGAA GATCCCGATG CTAACGGCTT CTTCAGTGAT GACACTCAAA TATTAGATCC AAAGGCAAAA TACGGCAGCA CAATATCCAA ACACAGCCGT GAAAAACTGG ATGAATATAT CGAAGACTAC AACAAGCTCT TTAACAGTAA ATTCTCCACC AAAGACAGCC AATCCTTCTA CAACTACTAC AATGATATCT CTAAAAAAGT AAAAGAGAAA AAAATCGACA TTCTGATAGT CGTGAACATG TTCCTCACAG GTTTTGATAG TCCATTCCTG AACACTATCT ATGTAGATAA GAACCTGAAA TATCACGGAC TTATTCAGGC ATATTCACGT ACCAACCGGA TACTCAACGA ACAGAAATCA CAGGGTAATG TAGTAGCATT CAGGAACCTG AAAAAAGCAA CAGATGCTGC TATTACATTA TTTAGCAACA CCGAAGCCAT CGAAGTCATT ATAATGCAAC CCTACGAGGA CTATATTGCA AAATTCGATG AAAAATATGA TGCACTTCAA AGAATAACAC CAACCGTAGA AAGTGTAAAT ACATTATCCT CTGAAAATGA AGAACTGGAA TTCATTACCA CATTCAGAGA AATAATGCGT ATATTAAATA TCACTAAAGC ATTTGCAGAC TTCAAATGGT CAGACCTTTC AATGACAGAG CAACTTTTCG AAGACTACAA AAGTAAATAT CTCGACCTTT ACGACAAAGT AAAAAGCGAT CATCAAAAAG AGAAAGTTTC CATCTTAGAA GATGTGGATT TCGAACTGGA ATTAATTCAC CGTGATGAAA TAAACGTTAA TTACATCATC CAGTTACTCA TCCAGCTAAA ATTCGATGCT CAAAAAGATG TGGATCAAGT CGAAAAGGAA ATATCCAGAG TATTAAGTTC TGAATCAACT CTCAGAAGCA AAAAAGAATT GATAGAGAAA TTCATTCAAG AGAATCTCCC TGATATCCAT ACAAGCGATA GTGTTTCAGA AGAATTTGAT CAATTCTGGA ACAATGAACA GCAAAAGGCC TTTGAGCAAC TGGTAAAAGA CGAAGTTCTA TCAGAAGAGC GAACCCAAAA ACTCATAGAA AACTATCTAT ACGCTGAACA AGAACCACTA TTAGATGAGA TATTGGAATT AATAGAAGGT GCACAACCAT CCGTACTGAA GCGCAAAAAG ATAGGTGCTA GAATATTGGA AAAGATAAAG GATTTTATTG ATACTTTCAT TGATGGTATG GATTCAAATT GA
|
Protein sequence | MSTQSEAVLE EQLVGQLCEL GYSLVRIKDE AELIANLKGQ LEKHNEIVFS DNEFKKVMNL LSKGSVFEKA KNLREKQHIL RDNGDDLYFD FLNTEHWCQN EYQVTHQVAM EGKYKNRYDV TLLVNGLPLV QIELKRRGLE LKEAFNQIQR YQKHSFGSHA ALFQYIQIFV ISNGVNTKYY ANNRKQSFKQ TFYWTDEENQ RLSNILNGFT RDFLDPCNLS KMICRYIVLN EAQKILMVLR PYQFYAVEAL IDRVKHSHKN AYIWHTTGSG KTLTSFKASQ ILTKLPTVKK VVFVVDRKDL DYQTTKEFNS FSKGCIDGTD NTKQLVKQLS DDTALIVTTI QKLNVAISKK QYLEKMEQLR DQRIVFIFDE CHRSQFGETH KRINQFFKNP QMFGFTGTPI FAENAIKNES GRRTTKDLFG ECLHKYVITD AIADENVLKF SVEYVGRYKQ KEGRETSIDI EVEDIDTNEL MESPERLEKI SDYIIAHHSR KTHNREFTAI FTVSSIKTLI NYYDILQRKK AEGKHDLKIA TIFSYTANEE DPDANGFFSD DTQILDPKAK YGSTISKHSR EKLDEYIEDY NKLFNSKFST KDSQSFYNYY NDISKKVKEK KIDILIVVNM FLTGFDSPFL NTIYVDKNLK YHGLIQAYSR TNRILNEQKS QGNVVAFRNL KKATDAAITL FSNTEAIEVI IMQPYEDYIA KFDEKYDALQ RITPTVESVN TLSSENEELE FITTFREIMR ILNITKAFAD FKWSDLSMTE QLFEDYKSKY LDLYDKVKSD HQKEKVSILE DVDFELELIH RDEINVNYII QLLIQLKFDA QKDVDQVEKE ISRVLSSEST LRSKKELIEK FIQENLPDIH TSDSVSEEFD QFWNNEQQKA FEQLVKDEVL SEERTQKLIE NYLYAEQEPL LDEILELIEG AQPSVLKRKK IGARILEKIK DFIDTFIDGM DSN
|
| |