Gene Mpal_2781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2781 
Symbol 
ID7272662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2907964 
End bp2909634 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content62% 
IMG OID643571366 
ProductDNA ligase I, ATP-dependent Dnl1 
Protein accessionYP_002467759 
Protein GI219853327 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase 
TIGRFAM ID[TIGR00574] DNA ligase I, ATP-dependent (dnl1) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.543529 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGTTCT CGATCTTTGC TCAGACCTGC GCCGCCCTCG AAGCACAGAA CGGCCGCCTC 
GAAATGAAGC ATGCGATCAG CGTCATTCTC CCGTCCCTCT CTGGAGAGGA CCTACCGATC
TTCATCCGGT TCCTGATGGG AAAGATCTTC CCTGACTGGT CGCCGCAGAA GCTCGGGATC
GGTCCGAACC TGCTCTACGA GGCGGTGGCC TACGTGGCAG GGACGAAGAA GACGGCCCTG
GTCGACCTGA TCAACAGAAC CGGAGATGCA GGGCTTGCTA TCGAACAGTT CCTTGCCACT
AAGGAGCAGA CGGCCTTCTT CACCGAGGAC CCGAGTCTCG CCGAGGTCTA CGCGGCCTGC
ACCCGGATCG CTGCCTCTGC AGGAGGCCGG TCCCAGCGCG AACGGTTGCT GGTCCTCCGA
CAACTCTTCG GTAACGTATC CCCCTTCGAG GCGCGGTATC TGGCCAGACT GATCCTCGGC
GAGCTTCGGA TCGGTATCGG GGAGGGGACG GTCCGGGATG CCATCGCTGA GGCCTACACG
GTCGAACCGG CGCAGGTCGA GCATGCGATG CAGGCGCTCA ATGACCTCGG CGAGGTGGCC
CTTCGGGCCA GAGAGGGTGA GGAGGGACTG ATCCACCTGA GCATCGCCCC GTTCAGGCCG
GTGAAGATGA TGCTCGCTCA GGCAGGGACG ACGATCCCAG AGATGCTGGC TGCTCACGGC
GAGGTGGCCG TGGAGTTCAA GTACGACGGA ACCCGGTTCC AGTTCCATAA AGAGGGGAAG
ACCTGCCGGA TCTACTCCCG GAAACTCGAG GAGGTGACCG ATGCCGTTCC CGAAGTCGGC
GAGGCCCTGC TCGGAGCCAC AGACCATGAC GTGATTCTGG ACGGCGAGGT GATCGCTATC
GGGGCTGACG GCCGGCCGCT CCCGTTCCAG ACCGTGCTCC GGCGTTTCCG GCGCAAGCAC
GGGATCGCGG CCGCCCGGGA GGCGATCACC CTGGTTCCAA GGGTCTTCGA CATCCTGTAC
CGCGATGGCG AGACCCTGAT CGACCTGCCA TTCCAGAGCC GGCGTGCGAT CCTATCAGCA
ACGATCGGTC CGGAGTACCT CGCCCCGCAG CAGGTGCTCT CCAGTGCCGA GGCGGTCGAC
CTTCTCTACC TGGAGGCTAT GGCCGAGGGG CACGAAGGGG TGATGCTCAA GGATCTCCTT
TCCCTGTACT CCCCAGGAGT CCGGGGGAAG CACTGGGTGA AGATCAAACC CGAGGTGGAG
ACCCTCGACC TTGTCGTCAT CGGTGCTGAA TGGGGCGAAG GACGGCGCGC CAGGACCTTC
GGTTCGTTTC TACTTGCCTG TCTGGATCAG GGGGTCTTCC GGGCGGTCAG CAAGGTGGCC
ACTGGCATAT CAGACGAGCA GCTGCAGGAG CTGTACACGC TCTTCAAAGA CCAGGTGATT
GCTGAATCAG GGAACACCGT CACCTTTGAA CCAACGGTGA TCTTCGAGGT CGGCTACGCC
GAGATCCAGA AGTCCCCGTC GTACGAGAGT GGGTACGCCC TCAGATTCCC GCGTTTTGTC
CAGGTCAGAG ACGACAAGGC GGTAGAGGAG ATCGAGACCC TTGAGAGCCT CACCACTCGG
TACCTGGCGC AGAAGACCCA GGCGAACGGA CAGCCTGAGT TCACGTTATA G
 
Protein sequence
MQFSIFAQTC AALEAQNGRL EMKHAISVIL PSLSGEDLPI FIRFLMGKIF PDWSPQKLGI 
GPNLLYEAVA YVAGTKKTAL VDLINRTGDA GLAIEQFLAT KEQTAFFTED PSLAEVYAAC
TRIAASAGGR SQRERLLVLR QLFGNVSPFE ARYLARLILG ELRIGIGEGT VRDAIAEAYT
VEPAQVEHAM QALNDLGEVA LRAREGEEGL IHLSIAPFRP VKMMLAQAGT TIPEMLAAHG
EVAVEFKYDG TRFQFHKEGK TCRIYSRKLE EVTDAVPEVG EALLGATDHD VILDGEVIAI
GADGRPLPFQ TVLRRFRRKH GIAAAREAIT LVPRVFDILY RDGETLIDLP FQSRRAILSA
TIGPEYLAPQ QVLSSAEAVD LLYLEAMAEG HEGVMLKDLL SLYSPGVRGK HWVKIKPEVE
TLDLVVIGAE WGEGRRARTF GSFLLACLDQ GVFRAVSKVA TGISDEQLQE LYTLFKDQVI
AESGNTVTFE PTVIFEVGYA EIQKSPSYES GYALRFPRFV QVRDDKAVEE IETLESLTTR
YLAQKTQANG QPEFTL