Gene Hlac_0414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0414 
Symbol 
ID7401031 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp434253 
End bp435528 
Gene Length1276 bp 
Protein Length424 aa 
Translation table11 
GC content59% 
IMG OID643707478 
ProductProtein of unknown function DUF1225 
Protein accessionYP_002565087 
Protein GI222478850 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.859835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.270687 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGTG CCAACACTTT TGAGGTCGTG CCACAGACCG AGAACGACAA AGAGTGCCTC 
CTACGGCTAC TCGATGCATC CGCTTCTCTG TGGAACGAAC TGACCTACGA ACGTCGTCAG
AACTACTTCG GTGACGGCGA CGTGTGGGAC ACTCCCGAGT ACCGAGGACG CTACAACGGC
GTCGTCGGAA GCGCGACTGT TCAACAGGTC ACGCGCAAGA ACAGCGAAGC GTGGCGGTCG
TTCTTCGCCC TCAAGGAGAA AGGCGAGTAC GCCAACCCAC CGTCGTACTG GGGCAACGAG
GAGGACGGAC GCGAACTCCG TACCTACATC CGATGCAACC AGTACACGAT TGAGTGGGGG
AAACGTAGCC GTCTCGAAAT CCCTGTCGGG CAAGAACTGA AAGACGAATA CGGACTCGGC
TACCACGAAC GACTCCGCCT CGAAGTCCGA GGCAACCCGA AGTGGGACGG CAAACAGGGT
CGTCTGGAAC TTGAGTACGA CGAGGTTAGC GACACGTTCA GGGCTTTTCA ACCAGTCACC
GTACCTGATT CTCGACTGGA TTCACCACTG GCTTCGGAAG AAGCCGCCCT CGACGTTGGA
GCGAACAATC TCGTCGCGTG TTCCACGACT ACTGGGAACC AGTACCTCTA CGACGGTCGT
GAGTTGTTCG GACGGTTCCG CGAGACGACA GACGAAATCG CCCCGCCTAC AGTCGAAACT
CCGAGAGGGT CGCTACTCCT CGAATCGGAT TCGACGGCTG TACCGACAGC GGACGAAGCG
TCGTGACCAT GCACAGAACG CGCTGGTGCG CGACCTCGTT GAACGGCTGT ACGATGAGGG
CGTGGCGACG GTGTACGTGG GCGACCTGAC AGACGTGCTG GAAGCGCATT GGTCGGTCAG
GGTGAACGAG AAGACGCACA ACTTCTGGGC GTTCAAGAAG TTCATCCACC GTCTCGCGTG
CGTCTGTGAG GAGTACGGCA TCAGCCTCGA AACCGAGTCG GAAGCGTGGA CGAGTCAGAC
GTGTCCCGAG TGTGGCGACC ACGAGAAGAC GGTTCGCCAC GAGGATACGC TGACGTGTCC
ATGTGGCTTC GAGGGGCACG CCGACCTCAC GGCGTCAGAG ACGTTCCTTC GGGAAAACAG
CAATTGCGAA ATCAGGCCGA TGGCACGGCC CGTGCGATTC GAGTGGGACG ACCACGACTG
GTCGGGGAAA CTATACCCTC ACGAAAGTCC CAAAGAAGTG CGCACGAACC CGCAAGTTGC
CTCCGTGGGT CGGTAG
 
Protein sequence
MKRANTFEVV PQTENDKECL LRLLDASASL WNELTYERRQ NYFGDGDVWD TPEYRGRYNG 
VVGSATVQQV TRKNSEAWRS FFALKEKGEY ANPPSYWGNE EDGRELRTYI RCNQYTIEWG
KRSRLEIPVG QELKDEYGLG YHERLRLEVR GNPKWDGKQG RLELEYDEVS DTFRAFQPVT
VPDSRLDSPL ASEEAALDVG ANNLVACSTT TGNQYLYDGR ELFGRFRETT DEIARLQSKL
REGRYSSNRI RRLYRQRTKR RDHAQNALVR DLVERLYDEG VATVYVGDLT DVLEAHWSVR
VNEKTHNFWA FKKFIHRLAC VCEEYGISLE TESEAWTSQT CPECGDHEKT VRHEDTLTCP
CGFEGHADLT ASETFLRENS NCEIRPMARP VRFEWDDHDW SGKLYPHESP KEVRTNPQVA
SVGR