Gene Mboo_2156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2156 
Symbol 
ID5410132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2227311 
End bp2228963 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content56% 
IMG OID640869401 
ProductDNA ligase I, ATP-dependent Dnl1 
Protein accessionYP_001405313 
Protein GI154151695 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase 
TIGRFAM ID[TIGR00574] DNA ligase I, ATP-dependent (dnl1) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.521071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTTCT CTGAGTTCGC TCAGGTTTGC GAAGAACTGG AACATCTCTC CGGCCGTCTG 
GATATGATCG AGGTGATCAG CCGGGCGCTG CCGGATCTCG CACCGGATGA GCTCCCGGTC
TTTGTCCGGT TTGTCATGGG CAGGATCTTC CCCGACTGGA GTGCCCAAAA ACTGGGAATA
GGGCCGAATC TCCTCTATGA GGCAGTCCGC CAGGCCGCCG GGGTAAAACT GGAGACAGTC
ATCACCCGGA TTAATCAGCA GGGGGACGTT GGAAGAGCCG TTGAGGATAT CCTGGCAAAA
AAAACCCAGG TTTCCTGGTC ACACCAGGAT CTCGAACTGG TCTATGTCTA CAACAAACTA
ACGGGAATAT CCTCCCGTGG AGGAGTTACA TCCCAGAAAG AGAAGATCCG TATTGCCATG
CTCCTCTTAG GTGATGCCTC ACCACTTGAA GGACGGTACC TCGCCCGGAT CATGCTTGAA
GAGCTCCGGA TAGGGGTGGG TGAGGGAACC GTCCGTGAAG CAATCGCAAA AGCGTTTATG
GTTGACTCTG CACTGGTCGA GCATGCGATG CAGGCCATCA ACGATCTCGG GGAAGTGGCA
CGGCTTGCAA AGAAAGGGCC CACTGCCCTT TCGGATGTAC ATATAACGCC CTTCCATCCG
GTGAAGATGA TGCTTGCCCA GCAGGGAACG ATTGCCGGCA TGATCGAAGA TCACGGCGAG
ATTGCAGCAG AGTACAAGTA CGACGGGTCC CGTTTCCAGT TCCACAAGGA GGGGGCAAAA
GCGAGGATGT ACTCGCGCCG GCTCGAAGAT GTCAGCGAGG CGCTCCCCGA TGTGATTGAC
CTTCTTTCCA AAGCAACCTC CCATGATGTG ATCCTTGATG GTGAGGTGAT CGCGATCAAG
GACGACCGGC CCATGCCGTT CCAGTCGGTA CTCCGCAGGT TCCGGCGCCG GCATGATATC
GCCGAGGCAC AAGAGGCAAT ACGCATGGTG CCAAACGTCT TCGATATCCT CTACCTCGAT
GGCGAAACCC TCATCGATCT GCCATTCTTT GAACGGCGTA AGAAACTGGA AACCGTGGTC
GGGAAGTTCG TGGCACCACA AGTGGTAAGC ACAGATCCGC AGACAATAGA GCAGACATAC
CACGACGCAC TCGCCGCGGG GCATGAGGGA ATTATGCTTA AGGTACCGGC GTCCCCCTAC
ACCCCGGGCC AGCGGGGAAA GAACTGGATC AAGATCAAGC CAGAAGTGGA CACACTCGAC
CTCGCCGTTA TCGGGGCCGA GTGGGGCGAA GGGAAACGGG CCCATGTCTT TGGGTCGTTT
CTTGTTGCCT GCCAGGACCA GGGTAAGCTG ATCCCGCTCT CCCGGGTGGC CACCGGGTTT
TCAGACGAGC AGCTGACCGA GGTATACGAT CTCCTCAAGG ATGCAGTGAT CTCGCGCACT
GGAAAAGAGG TACGCTTTGA ACCGGAGCTG GTTTTTGAGG TAGGATATGC CGAACTCCAG
GTCAGCCCGA CCTATGATGC CGGATTTGCC CTCAGGTTCC CCCGGTTTAT CCGGATCCGC
GATGACAAGG ATACCACCGA GATCGAGACC CTGGAGAGCA TCAGGGGCCG GTACCAGCGG
CAAGCAAAAT CGGCACAGGC ATACACAAAA TAG
 
Protein sequence
MLFSEFAQVC EELEHLSGRL DMIEVISRAL PDLAPDELPV FVRFVMGRIF PDWSAQKLGI 
GPNLLYEAVR QAAGVKLETV ITRINQQGDV GRAVEDILAK KTQVSWSHQD LELVYVYNKL
TGISSRGGVT SQKEKIRIAM LLLGDASPLE GRYLARIMLE ELRIGVGEGT VREAIAKAFM
VDSALVEHAM QAINDLGEVA RLAKKGPTAL SDVHITPFHP VKMMLAQQGT IAGMIEDHGE
IAAEYKYDGS RFQFHKEGAK ARMYSRRLED VSEALPDVID LLSKATSHDV ILDGEVIAIK
DDRPMPFQSV LRRFRRRHDI AEAQEAIRMV PNVFDILYLD GETLIDLPFF ERRKKLETVV
GKFVAPQVVS TDPQTIEQTY HDALAAGHEG IMLKVPASPY TPGQRGKNWI KIKPEVDTLD
LAVIGAEWGE GKRAHVFGSF LVACQDQGKL IPLSRVATGF SDEQLTEVYD LLKDAVISRT
GKEVRFEPEL VFEVGYAELQ VSPTYDAGFA LRFPRFIRIR DDKDTTEIET LESIRGRYQR
QAKSAQAYTK