Gene Hlac_1644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1644 
Symbol 
ID7399594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1666189 
End bp1667190 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content55% 
IMG OID643708711 
Productintegrase family protein 
Protein accessionYP_002566299 
Protein GI222480062 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.104877 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGAC AAATCGAACC CGAACGCGCA GTAGAACGGT ACCTAAACGA ACGCCGGGCG 
GACATTTCTG AGTCCACCTA TTACAACCAC TCATCCCTGC TGTCACAGTT CATTGAGTGG
TGTGAGGCCG AAGGTCTGGA CTACGTCAAC GAACTTGACG GGTTCCACAT CTCCGACTTC
AAAATCCATC GCCGGGACGA GGATGGTATC AACAAAGTCA CACTCTACAA TCAGATGACT
GTCCTTCGCG TCTTCCTCCG GTGGTGTGAA TCACGCAGTC TGGTTGAAGA CCTCGCGGAG
AACATCTTGA TGCCGGTTCC CGAAGATGAC TCTCGGGACA CGATGATCGA CTCGGAGACG
TCCGCACAGA TCCTCCAATA CCTCCAAAAG TACGAATATG GGACGTTGAA ACACACGGTA
TTCTCGCTCC TGTGGGACAC CGGGTTCCGT GTGGGAACTC TCCGAGCGGT CGATCTTGGA
GATTACCATT CAGAGAAACA GTTCATTGAG GTGGAACACC GTGCGGAGAC TGGTACACCG
CTCAAGAACA AGTACGGAGC CGAACGTGAA GTGAATCTCC ATGAATGGGT GTGTGACGTG
ATCGACGACT ACGTCGAAAT GTACAGGCAC GACATAACCG ATGACCACGG ACGGGAACCA
CTAATCACGA CGGAACAAGG TCGTCCTGTT CGGTCGAACA TACGTGGCCA CATTAACTCC
ATGACGCGCC CCTGCGTGTA CGCGGGCAGG TGCCCCCACG ATAGGGATCC AGATAGTTGC
GAAGCCGCGC AGCGACGGGA CGCAGCCGCA CGGTGTCCTG GTTCGGTTCC TCCTCACGCA
ATTCGTCGGT CCGCGATCAC AGCATGGCTC AACGATGGCC ACACAAAGGA ACTCCTCTCC
GATAGGATGA ACGTCTCCGT GAAGACGCTG GAGAAGCATT ACGATGCCCG GACGGAAAGC
GAAAAGCGGG AACTTCGCCG CGAGGAGTTC GGGATGGAGT AG
 
Protein sequence
MTRQIEPERA VERYLNERRA DISESTYYNH SSLLSQFIEW CEAEGLDYVN ELDGFHISDF 
KIHRRDEDGI NKVTLYNQMT VLRVFLRWCE SRSLVEDLAE NILMPVPEDD SRDTMIDSET
SAQILQYLQK YEYGTLKHTV FSLLWDTGFR VGTLRAVDLG DYHSEKQFIE VEHRAETGTP
LKNKYGAERE VNLHEWVCDV IDDYVEMYRH DITDDHGREP LITTEQGRPV RSNIRGHINS
MTRPCVYAGR CPHDRDPDSC EAAQRRDAAA RCPGSVPPHA IRRSAITAWL NDGHTKELLS
DRMNVSVKTL EKHYDARTES EKRELRREEF GME