Gene Hlac_1556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1556 
Symbol 
ID7401488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1576225 
End bp1577637 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content70% 
IMG OID643708622 
Producttype III restriction protein res subunit 
Protein accessionYP_002566213 
Protein GI222479976 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTCC GGCTGACCTA CGAGGACGGG ACGATCCGGG TCGTCGCTGG CGACGCCACT 
GGCAACGGTG CGAACGATGC CGACGGCGAC GCTCTGGAGT CGCTCCCGCC GCTCCCCGGC
GTCGAGAGCG ACCCGCGATC GGGGACCGGG CGCGCCCCGG CCTACCGCTA CGCCGCGATC
CGACGAGCCT TGGAGGTCGC CGGCGTGAGC GTCGAGGATC ACGTGCTGGA CGCGAGCGAC
CGCGCGGGAG CGGCAGCCGG GCTCGACACC GGCCTTTCGA CCGACTACGA TCTCCGGGAG
TACCAGCGTG AGGCGCTCGA CGCGTGGCGC GACGCCGGCG ACCGCGGCGT GCTCGAACTC
CCGACCGGGG CCGGCAAGAC CGTGATCGCA ATACGCGCGA TGGTCGAGCT AGGCGTGCCG
ACCCTCGTCG TAGTGCCCAC GGTCGATCTC CTCAATCAGT GGCAGCGGGA GCTAGAAGCG
GAGTTCGACG TACCAATCGG GCGGTTCGGC GGCGGCGAAC AGCGCCAAGA GGCGATCACG
GTGTCGACGT ACGACTCTGC GTACCTGAAA GCCGAGGATA TCGGCGACGC CTTCGAGTTC
GTCGTCTTCG ACGAGGTCCA CCACCTCGGC GGCGAGGGGT ATCGTGACGT GGCGCGGCTG
CTCGCGGCGC CCGCCCGGCT CGGGCTCACC GCCACCTTCG AGCGCCCCGA CGACGCGCAC
GAGACCGTCG CAGAGCTGAT CGGCGACCGC GTGTACGCGC TCGACGTGGA CGACCTCGCG
GGCGACCACC TCGCCTCCTA CGACATCCGA CGGATCGAGG TGGAGCTGAC GCCCGACGAG
CGCGAGCGCT ACGACGCGAA GCAGGGCACC TTCGTCGAGT ACGTCCGGGA CGCGGGGATC
ACGTTCACGA GCGGGAGCGA CTATCAGGAA CTCGTCAAGC GCTCCGGCAA CGACCCGGCC
GCGAGGGAGG CGCTCCTCGC GAAACAGGAC GCCCGCGAGA TCATGATGAA CGCGCGCCGG
AAGATCGACC GCTTGGAGTC GATCCTCGAC CGCCACCGCG ACGACCGCGT GATCGTGTTC
ACCGCCCACA CCGACCTCGT CTACCGGCTT TCCGAGCGAT TCCTGCTGCC CGCGATCACC
GCCGAGACGG GCGCGAAGGA GCGCCGCGAG ATTCTGGAGC GCTTCCGCGA GGGGACCTAC
GGTCGGGTCG TCGCCGCCAA CGTCCTCGAC GAGGGCGTCG ACGTGCCCGA CGCGAACGTC
GCGGTCGTGC TCTCCGGCTC GGGGAGTGAA CGAGAGTTCA CCCAGCGGCT CGGGCGGGTG
CTCCGTCCCA AAGACGACGG TGGGCGGGCG ATCCTCTACG AGGTCGTCAG CACGGAGACC
GCGGAGGAGC GGGTGGCGAG CCGGCGGCGG TGA
 
Protein sequence
MDVRLTYEDG TIRVVAGDAT GNGANDADGD ALESLPPLPG VESDPRSGTG RAPAYRYAAI 
RRALEVAGVS VEDHVLDASD RAGAAAGLDT GLSTDYDLRE YQREALDAWR DAGDRGVLEL
PTGAGKTVIA IRAMVELGVP TLVVVPTVDL LNQWQRELEA EFDVPIGRFG GGEQRQEAIT
VSTYDSAYLK AEDIGDAFEF VVFDEVHHLG GEGYRDVARL LAAPARLGLT ATFERPDDAH
ETVAELIGDR VYALDVDDLA GDHLASYDIR RIEVELTPDE RERYDAKQGT FVEYVRDAGI
TFTSGSDYQE LVKRSGNDPA AREALLAKQD AREIMMNARR KIDRLESILD RHRDDRVIVF
TAHTDLVYRL SERFLLPAIT AETGAKERRE ILERFREGTY GRVVAANVLD EGVDVPDANV
AVVLSGSGSE REFTQRLGRV LRPKDDGGRA ILYEVVSTET AEERVASRRR