Gene Hlac_1613 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1613 
Symbol 
ID7399562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1632802 
End bp1633779 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content69% 
IMG OID643708679 
Productflap endonuclease-1 
Protein accessionYP_002566268 
Protein GI222480031 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) 
TIGRFAM ID[TIGR03674] flap structure-specific endonuclease 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.817409 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.647176 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAACG CGGACTTGCG CGACCTCGCA GCGATTCGCG ACATCTCCTT TGCGGAGATC 
GAGGGAAGCG TCGTCGCCGT CGACGCGCAC AACTGGCTGT ACCGGTACCT CACGACGACG
GTGAAGTGGA CGGCTGACGA GACGTACACC ACCACCGACG GCGTCGAGGT TGCCAACCTG
ATTGGGATCG TCCAGGGGCT CCCGAAGTTC TTCGAACATG ACCTCATTCC CGTGATGGTG
TTCGACGGGG CGGTGACCGA GCTAAAAGCC GACGAGGTCG CCGACCGCCG CGAGAAGCGC
GAACAGGCGG AAGAGCGCCG GGTGGCCGCC AAGGAGCGCG GCGATGCGGT CGAGGCCGCG
CGACTGGAGG CCCGCACGCA GCGGCTCACC GACACGATTC AGGAGACGAC TCGGGAGCTG
CTCCGGCTGC TCGACGTGCC GATCGTCGAG GCGCCGGCCG AAGGCGAGGC GCAGTGCGCG
CACATGGCGG CGACCGGAAC CGTCGACCAC GCCGGCAGCG AGGACTACGA CACGCTGCTT
TTCGGTGCGC CGACGACGCT CCGCCAGCTC ACGAGCAAGG GCGATCCGGA GCTGATGGAT
CTGGCGGCGA CGCTCGACGA CCTCGGCTTC GACCGACAGG GGCTCGTCGA CGCCGCGATG
CTCTGTGGCA CCGACTTCAA CGAGGGCGTC CGCGGGATCG GGCCGAAGAC GGCGGTAAAA
GCGGTGCGAG AGCACGGCGA CCTGTGGGGC GTCCTCGACG CGCGGGGCGT CGAGATCCCG
AACGCCGAGG CGATCCGCGA GCTGTTCATG GACCCGCCAG CGACGGACGT GGACGTGGAC
ACGGCGGTGA ACCCCGACGT GGACGCCGCC CGCGAGTACG TCGTCGACGA GTGGGGCGTC
GCCGCCGACG AGGTCGAACG CGGGTTCGAA CGCATCGCGG AGTCGCAGGT TCAGACCGGG
CTCGACCGGT GGACGTGA
 
Protein sequence
MGNADLRDLA AIRDISFAEI EGSVVAVDAH NWLYRYLTTT VKWTADETYT TTDGVEVANL 
IGIVQGLPKF FEHDLIPVMV FDGAVTELKA DEVADRREKR EQAEERRVAA KERGDAVEAA
RLEARTQRLT DTIQETTREL LRLLDVPIVE APAEGEAQCA HMAATGTVDH AGSEDYDTLL
FGAPTTLRQL TSKGDPELMD LAATLDDLGF DRQGLVDAAM LCGTDFNEGV RGIGPKTAVK
AVREHGDLWG VLDARGVEIP NAEAIRELFM DPPATDVDVD TAVNPDVDAA REYVVDEWGV
AADEVERGFE RIAESQVQTG LDRWT