Gene Mbur_0920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbur_0920 
Symbol 
ID3998663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcoides burtonii DSM 6242 
KingdomArchaea 
Replicon accessionNC_007955 
Strand
Start bp991167 
End bp993476 
Gene Length2310 bp 
Protein Length769 aa 
Translation table11 
GC content45% 
IMG OID637958697 
ProductHef nuclease 
Protein accessionYP_565613 
Protein GI91772921 
COG category[L] Replication, recombination and repair 
COG ID[COG1111] ERCC4-like helicases
[COG1948] ERCC4-type nuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCAAG CAAATTTTGA ACCTATAGTA AATATGAGCG AACACATTAA GCACCCACTG 
GTCAAACCCA ATACAGTCGA GCAGCGATTG TACCAGCTTG ACCTTGCCGG AAAGGCACTA
TCAGCACCAA CACTGGTAGT GCTGCCAACA GGTCTTGGAA AAACGATAGT CGCCCTTCTT
GTAATAGCTT CACGCCTGCA AAAAACAGGC GGGAAAGCAC TCATACTGTC CCCCACAAAA
CCTCTTGTAG AGCAACATGC CGCTTTTTTA AGGTCCACAC TGAACATCCC CGAAGATGAG
ATACTGACCT TTACAGGTGC AGTAGCGCCT GACAAGAGAG AAGAGCTCTG GAAGAAGGGA
AAAGTGATCA TCTCCACCCC GCAGGTCATT GAGAACGATA TACTTACCAA GCGCATAAGC
CTGGAGGATG TCACTCACAT TACATTTGAT GAGGCTCATC GAGCTGTTGG CAATTATGCA
TATACATATA TAGCCGAAAG ATACTTTGAG GATGCCAAAA AACCACACTG CCTTGCGATA
ACTGCAAGTC CGGGAAGTTC TGATGAGAAG ATAAGTGAGG TATGCACGAA CCTCTACATC
AGATCGGTTG CGATAAAAAC AGAAACAGAC CCTGACGTAA CACCCTATAT CCATAAAAAA
GAGGTCGAAT GGAACCATGT GATACTGCCT TCTGAGATGA GGGAGTTGAA AGACCTTCTG
GAAAAGGTCC TTGAGGATAG ATTCCAAAAA CTAACAGAAC TTGGATATTC CATCCAATAC
GGCAAGAAAG CTTCAAAAAT GGACCTGCTC GGACTTCAGA AAAAGTTACA GGGACAGATC
AGAGAAATGG CCGAACCTGC AGTCTACAGC GCCCTATCGA TCCTTGCAGA GGTTATGAAG
GTAAGTCATG CTGTGGAGAT AGTAGAAACA CAGGGCATCG AAGCACTTAA AAAATATACA
GCCCGACTTG AGAACGAAGC CACCTCCAGA ACAGGAAGTA AAGCTTCAAA AAGGCTTTCT
GATGACCTCT ATATGAGACA GTTATACAAA AGGCTCGAGG AATGCACCAC AGAACATCCG
AAACTTGCCG TTGTAAAAGA TATCGTCTCA AAGGAACTAA ATGGCAAACC CGACTCCCGT
GTAATCGTCT TTACGAACTA TCGCGATACC TCTGAAATGG TGACGAACGC CCTTTCCGAA
ATAAAAGATA TAAGGCCTGT CAAATTTGTG GGACAAAGTT CCAAGTTCAA GGACAAGGGG
CTTACCCAAA AGCAACAGGT TGAGATCATT GAGAAGTTCA AGGCCGGAGA ATATAATGTC
CTTGTGGCAA CATCCGTTGC AGAAGAAGGA CTTGATATCC CAGCTACCGA CCTGGTAGTT
TTTTATGAAC CGGTCCCCTC CGAGATCAGA AGCATACAGA GAAAAGGCAG GACAGGAAGG
AAGCATGAAG GACGTGTCGT AGTACTTGTT ACAAAAGGAA CCCGGGACGA AGCATATTAC
TGGAGCTGTG CACATAAAGA AAAGCGTATG CAAAGCAATA TGCAGCAATG GCAGGAGAAT
ATGTCAGAGT TGAATAGAGC AAACAATGAA AATGACAAGA CCGACATTGC AAGTGAGTTT
AGGAGTGAGG AGGAACAACA GACCGGGCTT TCAGACTTCT CTGACGAGGA AGTGACGGTA
ATCCTCGACC AGAGAGAGAT CAGAAGCACC GTTGCACGCA GTCTTGAGAA ACTTGGATTC
AACATCGTTG TAAAAACACT TGAAGTAGGA GATTATATTG TAAGCGACCG GGTGGCTATC
GAGCGCAAGA GTACCGAAGA CTTTGTCAAT TCCTTACTTG ATCGCCATAT ATTCAGACAG
ATATCCGATC TTGCAGGAGC CTATGAAAAA CCCATACTCA TCATCGAAGG AGAAGGTTTG
TTCACCACAA GGATGGTGAA CCCAAACGCC ATACACGGAA TGCTTGCTTC ACTATCACTG
GATTTTGGAG TGTCAATACT TCACACAAGA GATGCAGAGG ACACTGCATC CCTGATCGGC
ATACTGGCAA AGAGAGAACA GATCGATGAA AAACGCAGTA CCAGTGTCCA CGGAAAAAAA
TCTTCAATGA TGCTATCACA ACAGCAGGAA TATATCGTAT CATCCATCAG TAACATCGGA
CCGAATGCTG CAAAGAATCT CCTTGACCAC TTCGGAACCG TGGAAAATGT TATGAAAGCA
GAGCTAGATG AATTAAAGGA AGTAAAGAAC ATTGGACCGA AGACCGCAGG AAAGATGCGG
GAAATACTTA GCAGCAAATA TAAAAATTGA
 
Protein sequence
MPQANFEPIV NMSEHIKHPL VKPNTVEQRL YQLDLAGKAL SAPTLVVLPT GLGKTIVALL 
VIASRLQKTG GKALILSPTK PLVEQHAAFL RSTLNIPEDE ILTFTGAVAP DKREELWKKG
KVIISTPQVI ENDILTKRIS LEDVTHITFD EAHRAVGNYA YTYIAERYFE DAKKPHCLAI
TASPGSSDEK ISEVCTNLYI RSVAIKTETD PDVTPYIHKK EVEWNHVILP SEMRELKDLL
EKVLEDRFQK LTELGYSIQY GKKASKMDLL GLQKKLQGQI REMAEPAVYS ALSILAEVMK
VSHAVEIVET QGIEALKKYT ARLENEATSR TGSKASKRLS DDLYMRQLYK RLEECTTEHP
KLAVVKDIVS KELNGKPDSR VIVFTNYRDT SEMVTNALSE IKDIRPVKFV GQSSKFKDKG
LTQKQQVEII EKFKAGEYNV LVATSVAEEG LDIPATDLVV FYEPVPSEIR SIQRKGRTGR
KHEGRVVVLV TKGTRDEAYY WSCAHKEKRM QSNMQQWQEN MSELNRANNE NDKTDIASEF
RSEEEQQTGL SDFSDEEVTV ILDQREIRST VARSLEKLGF NIVVKTLEVG DYIVSDRVAI
ERKSTEDFVN SLLDRHIFRQ ISDLAGAYEK PILIIEGEGL FTTRMVNPNA IHGMLASLSL
DFGVSILHTR DAEDTASLIG ILAKREQIDE KRSTSVHGKK SSMMLSQQQE YIVSSISNIG
PNAAKNLLDH FGTVENVMKA ELDELKEVKN IGPKTAGKMR EILSSKYKN