Gene Hlac_2404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2404 
Symbol 
ID7400522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2395621 
End bp2397567 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content68% 
IMG OID643709477 
Productphosphoesterase RecJ domain protein 
Protein accessionYP_002567049 
Protein GI222480812 
COG category[L] Replication, recombination and repair 
COG ID[COG1107] Archaea-specific RecJ-like exonuclease, contains DnaJ-type Zn finger domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATA ACACCGCCGG GGATTCCGGC ACGCGGGGCG ATTCGAGCCC CGAACCCGAC 
GCCGACACGG CGGAGGGGGC GGCTGACGAC CGGCCGACTG TCTACGATCT GGCATCGGAC
TGCACCCTTG AAGACGCCGA AGTCGATGCG CTGTACCACG CTGAAGTCAA CTGCGTCGTC
GACTACGGCA TCTTCGTCGA CGTCTCGGAC GCCCTCTCGG GGCTCGTCCA CGAGTCGAAT
CTCGACGGTG ACTACGCCGT CGGCGACCGA CTGGTGGTCC GGCTGACCGA GGTGAAGGAG
AACGGCGACG TGGCGTTCGA CGACGAGAAC CTCGACGACT ACCGCACGGA GACGGTCGTC
CACGAGCCGA CCGTCTCCCG CGTCCGCGGG CTCACGCCCG GCGACGAGGT CACCGTCGAA
GGAGAGGTCG TGCAGGCGAA ACAGACAGGT GGGCCGACGA TCTTCGCCGT CGCGGACGCC
TCCGGCGTCG TCTCGTGTGC CGCCTTCGAG GAGGCCGGCG TCCGCGCGTA CCCCGAGGTC
GAGGTCGGCG ACATGGTCCA CGTCAGCGGG ACGGTTGAGA CCCGCGAGAA CGCTCTCCAG
CTGGAGGTCG ACTCGCTGAA GCGGCTCCCC GAGGGGCGAG CGGCCGAAGC CCGCGAGCGC
TTCGAGGCGG CTCTCGACGA GCGCGCCGAG CCCGCCGACG TCGACCCGCT CGTCGAGTGG
GAGGCGTTCG AGCCGATCCA CGACGACCTC CGGGAACTGG CGCGGCTGCT CCGCCGCACC
GTGCTCGCCG GCCGTCCGAT CCGCGTCCGA CACCACGCCG ACGGCGACGG GATGTGTGCC
GCGATCCCAG TCCAGCTCGC CTTGGAGAAC TTCGTCTGCG ATGTCCACGA GGACCCGGAC
GCGGCCCAAC ACCTGTTCAA GCGGCTCCCG AGCAAGGCGC CGTACTACGA GATGGAGGAC
GTGACCCGCG ATCTGAACTT CGCGCTGGAG GGGCGTGCCC GCCACGGCCA GAAGCTCCCC
TTCCTACTGA TGCTCGACAA CGGTTCGACC GAGGAGGACG TGCCCGCCTA CGAGAACCTC
GCGCACTACG ACATCCCCAT CGCAGTCGTC GACCACCACC ACCCCGATCC CGAGGCCGTC
GAACCGCTCC TCGACGCCCA CGTCAACCCG TACCTCCACG ACGAGGATTA CCGAGTCACC
ACGGGGATGA TGTGCGTCGA ACTCGCCCGG CTGATCGACC CGTCGATCAC GGGCGAACTC
GAACACGTCC CGGCGGTCGC CGGGCTCTCC GACCGCTCGA AGGCGGAGAC GATGGACGAC
TACGTCGCGC TCGCCGAGGG CGCCGGCTAC GACGAGTCCG ACCTGCTCGA CATCGGCGAA
GCGCTCGACT ACGCCGCCCA CTGGCTGCGC TACTCCGAGG GGAAGACCCT CGTCAACGAC
GCCCTCAACG TGGGCTGTGA GGACGAAGCA CGCCACGAGG AGCTGGTCGA GTTCCTCTCC
GAGCGAGCCG ACCGCGACGT GCAACGCCAG CTCGACGCCG TCGACGACCA CGTCGAACAC
GAGCGGCTCG CCTCCGGCGC GCACCTCTAC CGCATCGACC TCGACGAGTA CGCCCACCGG
TTCACCTACC CCGCGCCCGG GAAGACGACC GGCGAGCTCC ACGACACTCG CGTCAAGGAA
ACCGGTGACC CCGTCATCAC CATCGGCTAC GGTCCCGATT TCTGCGTGCT TCGCTCCGAT
GGCGTTCGGC TCGACATCCC GAACATGGTG ACCGAACTGA ACGAGGAACT TCCCGAAGCG
GGCGTCTCGG GCGGCGGTCA CCTCGTCGTC GGTTCGATCA AGTTCGTGAA GGGCCGCCGT
AGCGCCGTTA TCGAGACGCT CGTCGAGAAG ATGGCCGACG CGGAGATCGA CGAGGCGCTC
TCGTCGACGG TCGCGATCGA CGACTGA
 
Protein sequence
MSDNTAGDSG TRGDSSPEPD ADTAEGAADD RPTVYDLASD CTLEDAEVDA LYHAEVNCVV 
DYGIFVDVSD ALSGLVHESN LDGDYAVGDR LVVRLTEVKE NGDVAFDDEN LDDYRTETVV
HEPTVSRVRG LTPGDEVTVE GEVVQAKQTG GPTIFAVADA SGVVSCAAFE EAGVRAYPEV
EVGDMVHVSG TVETRENALQ LEVDSLKRLP EGRAAEARER FEAALDERAE PADVDPLVEW
EAFEPIHDDL RELARLLRRT VLAGRPIRVR HHADGDGMCA AIPVQLALEN FVCDVHEDPD
AAQHLFKRLP SKAPYYEMED VTRDLNFALE GRARHGQKLP FLLMLDNGST EEDVPAYENL
AHYDIPIAVV DHHHPDPEAV EPLLDAHVNP YLHDEDYRVT TGMMCVELAR LIDPSITGEL
EHVPAVAGLS DRSKAETMDD YVALAEGAGY DESDLLDIGE ALDYAAHWLR YSEGKTLVND
ALNVGCEDEA RHEELVEFLS ERADRDVQRQ LDAVDDHVEH ERLASGAHLY RIDLDEYAHR
FTYPAPGKTT GELHDTRVKE TGDPVITIGY GPDFCVLRSD GVRLDIPNMV TELNEELPEA
GVSGGGHLVV GSIKFVKGRR SAVIETLVEK MADAEIDEAL SSTVAIDD