Gene Mbur_1841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbur_1841 
Symbol 
ID3997502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcoides burtonii DSM 6242 
KingdomArchaea 
Replicon accessionNC_007955 
Strand
Start bp1937996 
End bp1940857 
Gene Length2862 bp 
Protein Length953 aa 
Translation table11 
GC content39% 
IMG OID637959585 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_566474 
Protein GI91773782 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACAC AATCGGAAGC TGTATTGGAA GAGCAGTTGG TCGGGCAACT TTGTGAATTG 
GGTTATTCTT TAGTGAGGAT AAAGGATGAG GCGGAGCTGA TTGCTAATTT GAAGGGGCAA
TTGGAGAAGC ATAATGAGAT CGTGTTCTCG GATAATGAGT TCAAAAAAGT GATGAACTTA
CTTAGCAAGG GTAGTGTGTT CGAGAAGGCT AAGAACTTGC GGGAAAAGCA ACATATTCTG
CGTGATAATG GGGATGACCT TTATTTTGAC TTCCTGAACA CGGAACATTG GTGTCAGAAT
GAATATCAGG TGACGCATCA GGTAGCGATG GAAGGGAAAT ATAAGAACCG CTATGATGTG
ACGTTATTGG TCAATGGTCT GCCATTGGTA CAGATAGAAC TCAAAAGGCG TGGACTTGAA
CTAAAAGAGG CGTTCAATCA GATACAACGC TATCAAAAGC ACTCATTTGG CTCACACGCT
GCATTATTCC AGTATATTCA GATATTTGTC ATCAGCAACG GTGTGAACAC AAAATACTAC
GCCAACAATC GTAAGCAATC CTTCAAGCAG ACATTCTACT GGACAGACGA AGAGAACCAG
CGTCTCAGCA ATATCCTTAA CGGCTTCACA AGGGATTTTC TGGACCCTTG CAACCTCTCC
AAAATGATAT GTCGCTACAT CGTACTGAAC GAAGCCCAGA AAATACTCAT GGTACTTCGC
CCCTATCAGT TCTATGCAGT AGAAGCCCTC ATTGACAGAG TAAAACACAG CCATAAGAAC
GCTTACATCT GGCACACAAC CGGCTCAGGA AAAACACTGA CATCCTTTAA AGCCAGTCAG
ATATTAACAA AACTGCCCAC AGTCAAAAAA GTAGTCTTTG TAGTCGATAG AAAAGACCTC
GACTATCAAA CAACAAAAGA ATTCAACAGT TTTAGCAAAG GCTGCATCGA TGGTACGGAC
AACACAAAAC AACTGGTCAA ACAACTTAGT GACGACACAG CACTCATTGT CACCACCATT
CAAAAACTAA ATGTTGCCAT CAGCAAAAAA CAATATCTTG AAAAAATGGA GCAGCTCAGA
GACCAGCGTA TAGTCTTCAT CTTTGACGAA TGCCACCGCT CACAATTTGG CGAAACCCAC
AAACGCATAA ATCAATTCTT CAAGAATCCT CAAATGTTCG GTTTCACAGG CACACCCATA
TTTGCCGAAA ACGCCATCAA GAACGAATCT GGAAGACGCA CCACCAAAGA TCTCTTTGGC
GAATGTCTTC ATAAATACGT CATAACCGAT GCAATAGCCG ATGAGAACGT CCTGAAATTC
TCCGTAGAAT ACGTAGGACG CTACAAACAA AAAGAAGGTC GGGAAACCAG CATTGACATA
GAGGTAGAGG ACATCGATAC AAATGAGTTA ATGGAATCCC CTGAGCGTCT GGAGAAAATA
TCAGATTATA TAATAGCCCA TCACTCCCGT AAAACACACA ACAGGGAATT CACCGCAATA
TTTACCGTAA GCAGCATAAA AACCCTGATA AATTACTATG ATATACTGCA AAGAAAGAAA
GCAGAAGGCA AGCATGATCT AAAAATTGCG ACAATCTTCA GTTATACCGC CAACGAAGAA
GATCCCGATG CTAACGGCTT CTTCAGTGAT GACACTCAAA TATTAGATCC AAAGGCAAAA
TACGGCAGCA CAATATCCAA ACACAGCCGT GAAAAACTGG ATGAATATAT CGAAGACTAC
AACAAGCTCT TTAACAGTAA ATTCTCCACC AAAGACAGCC AATCCTTCTA CAACTACTAC
AATGATATCT CTAAAAAAGT AAAAGAGAAA AAAATCGACA TTCTGATAGT CGTGAACATG
TTCCTCACAG GTTTTGATAG TCCATTCCTG AACACTATCT ATGTAGATAA GAACCTGAAA
TATCACGGAC TTATTCAGGC ATATTCACGT ACCAACCGGA TACTCAACGA ACAGAAATCA
CAGGGTAATG TAGTAGCATT CAGGAACCTG AAAAAAGCAA CAGATGCTGC TATTACATTA
TTTAGCAACA CCGAAGCCAT CGAAGTCATT ATAATGCAAC CCTACGAGGA CTATATTGCA
AAATTCGATG AAAAATATGA TGCACTTCAA AGAATAACAC CAACCGTAGA AAGTGTAAAT
ACATTATCCT CTGAAAATGA AGAACTGGAA TTCATTACCA CATTCAGAGA AATAATGCGT
ATATTAAATA TCACTAAAGC ATTTGCAGAC TTCAAATGGT CAGACCTTTC AATGACAGAG
CAACTTTTCG AAGACTACAA AAGTAAATAT CTCGACCTTT ACGACAAAGT AAAAAGCGAT
CATCAAAAAG AGAAAGTTTC CATCTTAGAA GATGTGGATT TCGAACTGGA ATTAATTCAC
CGTGATGAAA TAAACGTTAA TTACATCATC CAGTTACTCA TCCAGCTAAA ATTCGATGCT
CAAAAAGATG TGGATCAAGT CGAAAAGGAA ATATCCAGAG TATTAAGTTC TGAATCAACT
CTCAGAAGCA AAAAAGAATT GATAGAGAAA TTCATTCAAG AGAATCTCCC TGATATCCAT
ACAAGCGATA GTGTTTCAGA AGAATTTGAT CAATTCTGGA ACAATGAACA GCAAAAGGCC
TTTGAGCAAC TGGTAAAAGA CGAAGTTCTA TCAGAAGAGC GAACCCAAAA ACTCATAGAA
AACTATCTAT ACGCTGAACA AGAACCACTA TTAGATGAGA TATTGGAATT AATAGAAGGT
GCACAACCAT CCGTACTGAA GCGCAAAAAG ATAGGTGCTA GAATATTGGA AAAGATAAAG
GATTTTATTG ATACTTTCAT TGATGGTATG GATTCAAATT GA
 
Protein sequence
MSTQSEAVLE EQLVGQLCEL GYSLVRIKDE AELIANLKGQ LEKHNEIVFS DNEFKKVMNL 
LSKGSVFEKA KNLREKQHIL RDNGDDLYFD FLNTEHWCQN EYQVTHQVAM EGKYKNRYDV
TLLVNGLPLV QIELKRRGLE LKEAFNQIQR YQKHSFGSHA ALFQYIQIFV ISNGVNTKYY
ANNRKQSFKQ TFYWTDEENQ RLSNILNGFT RDFLDPCNLS KMICRYIVLN EAQKILMVLR
PYQFYAVEAL IDRVKHSHKN AYIWHTTGSG KTLTSFKASQ ILTKLPTVKK VVFVVDRKDL
DYQTTKEFNS FSKGCIDGTD NTKQLVKQLS DDTALIVTTI QKLNVAISKK QYLEKMEQLR
DQRIVFIFDE CHRSQFGETH KRINQFFKNP QMFGFTGTPI FAENAIKNES GRRTTKDLFG
ECLHKYVITD AIADENVLKF SVEYVGRYKQ KEGRETSIDI EVEDIDTNEL MESPERLEKI
SDYIIAHHSR KTHNREFTAI FTVSSIKTLI NYYDILQRKK AEGKHDLKIA TIFSYTANEE
DPDANGFFSD DTQILDPKAK YGSTISKHSR EKLDEYIEDY NKLFNSKFST KDSQSFYNYY
NDISKKVKEK KIDILIVVNM FLTGFDSPFL NTIYVDKNLK YHGLIQAYSR TNRILNEQKS
QGNVVAFRNL KKATDAAITL FSNTEAIEVI IMQPYEDYIA KFDEKYDALQ RITPTVESVN
TLSSENEELE FITTFREIMR ILNITKAFAD FKWSDLSMTE QLFEDYKSKY LDLYDKVKSD
HQKEKVSILE DVDFELELIH RDEINVNYII QLLIQLKFDA QKDVDQVEKE ISRVLSSEST
LRSKKELIEK FIQENLPDIH TSDSVSEEFD QFWNNEQQKA FEQLVKDEVL SEERTQKLIE
NYLYAEQEPL LDEILELIEG AQPSVLKRKK IGARILEKIK DFIDTFIDGM DSN