Gene Hlac_0878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0878 
Symbol 
ID7401248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp869148 
End bp870932 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content71% 
IMG OID643707943 
ProductDNA ligase I, ATP-dependent Dnl1 
Protein accessionYP_002565546 
Protein GI222479309 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase 
TIGRFAM ID[TIGR00574] DNA ligase I, ATP-dependent (dnl1) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0334376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTTCG CAGCGTTCGC CGACCGGGCG GAGGAGCTGG CGGCCGAGCC CGCCGACATC 
GGAACGACTC GCCTCGTCAC CGACCTACTC GGGGCCGCCG GGGGAACCGA CGAAAACGAC
GACGATCTCG CGACGGTCAC CCGGTTCCTG CTCGGCCGCG TCTTCCCGGC CCACGACACC
CGGACGCTTG ACGTGGGACC GGCCCTCTGT CGCGAGGCGA TCGCCCGCGC CGCCGGGCCG
AACGTGACCG CCGACGACGT GGAGGACCGA CTGGCCGAGG AAGGCGAGAT CGGTGCGGTC
GCAGCAGGCT TCGAGTTCGG CGGCCAGCGA GGGCTCGCCG CCTTCGGCGA GGGGCGGGAC
CGGCTCACCG TCGCCGCGGT CGATGCGGAG CTACGCAGGT TGGCGGCCGC CGCCGGCGAC
GGTAGCGAGT CCCACAAGCG CGACGCCCTG TTCGGGCTGT TCAACCGGTG TGAGCCTGCC
GAGGCGAAGG TTATCGCCCG GCTCGTGCTC GGCGAGATGC GGCTCGGCGT CGGCGAGGGG
GCCGTCCGTG ACGCGATCGC AGAGGCGTTT CTCGCGGGGA ACCCGGAAGG CGACGAACGT
GACGAGAGCG ACACCGACGA CGATCCGATT CTCCGCGCCG GCGACGAGGC GGTCGTAGCC
GTCGAGCGCG CGCTTCAGGT GACCAACGAC TACGGCCGCG TCGCGGTCCT CGCCCGCGAT
GAGGGGCTCA ACGGGCTGCG CGCTGAGGGG CTGGCGGTCG GCCGACCGGT ACAGGCGATG
CTCGCGCAGG CCGGGACGGC GACCGACGCG GTCGAGGCGT TCGGCGAGGT CGCCGTCGAG
ACCAAGTTCG ACGGTGCGCG GGTGCAGGTC CACTACGTGC CCGAGTCGGC CGCCGAGGGC
GACGACGCTG CTGGCGGCAC CGAACTCGGC CCGCGGATCT ACTCGCGGAA CATGGACGAC
GTGACCGACG CCCTCCCGGA GGTCGTCGAG TACGTCGAAG CGCGCGTCTC GGTCCCTGTC
ATCCTCGACG GGGAAGTTGT GGCGGTCGAC GACGACGGCG ATCCCCTCCC GTTTCAAGAG
GTGCTGCGAC GCTTCCGGCG AAAACACGAC GTTGACCGGA TGCGGGAGGA GGTCGGGCTC
CGGCTCCACG CGTTCGACTG CCTCCACGCG GACGGCGAGG ACCTGCTCGA CGAACCGTTC
CGCGCCCGCC ACGACCGACT CGCCGAGGTT CTGTCCGACG CGGCCGCGAG CGTCGAATTC
GCGGGCGATC CGGCGGCGAT CGAGGCGGCG GAGGCGGCCG CGCTCGGCGC GGGCCACGAG
GGCGTGATGC TGAAGAATCC CGAGGCGGCG TACACTCCGG GTAACCGCGG TCGCGACTGG
CTGAAGCGCA AGCCCGACGT GGAGACGCTC GACGCGGTCG TGGTCGGCGC GGAGTGGGGC
GAGGGGCGCC GAGCGGAGCT GTTCGGGACG TTCCTGCTCG GCGTGCGGGC GGGAGACGAC
GAACTCGCCA CAATCGGCAA GGTCGCGACC GGACTCACCG ACGAGGAGCT CGCCGACCTC
ACCGAGCGGC TGGAGCCCCA CGTGGTGAGT GAGGACGGAA CCGAGATCGA AATCCGCCCT
GAGGTCGTCC TCGAAGTCGG GTACGAGGAG ATCCAGACCT CGCCGACCTA CTCGTCGGGC
TACGCCCTTC GATTCCCGCG GTTCGTCGGC GTGCGCGACG ACAAGTCGGT CGATGACGCC
GACTCATTGG AGCGCGTCGT GCGCCTCGCC GGGGATGAGA AGTGA
 
Protein sequence
MEFAAFADRA EELAAEPADI GTTRLVTDLL GAAGGTDEND DDLATVTRFL LGRVFPAHDT 
RTLDVGPALC REAIARAAGP NVTADDVEDR LAEEGEIGAV AAGFEFGGQR GLAAFGEGRD
RLTVAAVDAE LRRLAAAAGD GSESHKRDAL FGLFNRCEPA EAKVIARLVL GEMRLGVGEG
AVRDAIAEAF LAGNPEGDER DESDTDDDPI LRAGDEAVVA VERALQVTND YGRVAVLARD
EGLNGLRAEG LAVGRPVQAM LAQAGTATDA VEAFGEVAVE TKFDGARVQV HYVPESAAEG
DDAAGGTELG PRIYSRNMDD VTDALPEVVE YVEARVSVPV ILDGEVVAVD DDGDPLPFQE
VLRRFRRKHD VDRMREEVGL RLHAFDCLHA DGEDLLDEPF RARHDRLAEV LSDAAASVEF
AGDPAAIEAA EAAALGAGHE GVMLKNPEAA YTPGNRGRDW LKRKPDVETL DAVVVGAEWG
EGRRAELFGT FLLGVRAGDD ELATIGKVAT GLTDEELADL TERLEPHVVS EDGTEIEIRP
EVVLEVGYEE IQTSPTYSSG YALRFPRFVG VRDDKSVDDA DSLERVVRLA GDEK