Gene Hlac_3185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3185 
Symbol 
ID7399314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp418762 
End bp420162 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content63% 
IMG OID643706985 
Producttype III restriction protein res subunit 
Protein accessionYP_002564607 
Protein GI222476086 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAATCG AGTTCGACGA CGGAACCCTC CTGCTCCGCG ATGCACCGGA TAACGTCCCT 
TACGCGGAGT GGGACGACCG CGTCGACGAG TACCGCACCC AAGCGTATCG ATATCGAGCG
CTCCTCGAAT GGGCCGGTGC ATGGTCGGAC GGAGACGAAC AAGCAACGCT GCACGACGGC
TTCGCTCCCG CGCTCGAAGA CGCCGCTCGG GCCTACCCCG ATCTCGACCT CACGCCAGCG
CTCCACATCG AACCACGAGA CTACCAGCAG GCCGCCCTCG ACGCCTGGAT CGACCACGAC
CGCCGGGGCA GCGTCGTCCT CCCCACGGGC AGCGGGAAGA CGTTCCTCGG CCTCCAAGCC
ATCGCTGACG CCGGCGTCAG CGCGCTCGTC GTCACGCCGA CGATCGATCT GATGAATCAG
TGGCACGCCA CGCTCACCAA CGCCTTTGGT GAGCAGCTCC CGGAGCCGGT CGGCGTCCTC
GGCGGTGGCA GCCACAACGT CACCGCGATC ACCGTCACCA CCTACGACAG CGCCTATCGG
TACATCAACG AGTATGGCGA CCAGTTCGGG CTACTCGTCG TCGATGAGGA ACACCACCTG
CCAGCGCCGA CCTACCGTCA AATCCCTGAG ATGACCATCG CGCCCTATAG GTTGGGACTG
ACCGCGACCT ACGAGCGCCC CGACGGGAAA CACGAACTGC TCGAAGAGCT CCTCGGGCCA
GTTGTCTACC GCGAAAACGT CGACGAACTC GCCGGGGAGT ATCTCAGCGA GTACGAGACG
ATCCACATGT CGGTCGACCT CACGGCCGAC GAGGGCGAGA CGTACGACGA GGAGTACCAG
CTCTACCGCG ATTACGTTGA CAGCCACGAG TTCGACCTCT GGAAGGAGGA CGGCTATGCG
GAGTTCCTCA AACGCACGTC CTACGACCCG CAAGGGCGGC GGGCGCTCAT CGCCAAGCAA
CGTGCCGAGC GAATCGCCCG AACCGCCGAA AAGAAACTCG ACACGCTCGA CAATCTCATC
AAACGGCATC ACGATGACCG CGCCATCATC TTCACCGCGA ACAACGACTT CGCCTACGAC
ATCTCTCAAG AATTCATCGT CCCCTGCATC ACCCACCAGA CCAAGACTGA CGAACGCACC
GAGATTCTGG AACGGTTCCG GACGGGGGAG TACTCGATGC TGGTGACTTC GCAGGTGCTT
GACGAGGGGA TCGACGTGCC TGCGGCGAAC GTCGGGATCA TCCTCTCGGG GAGTGCCTCG
AAACGCCAGT ACGCCCAGCG GCTCGGCCGG ATATTGCGGC CCACGGATGA TCGCCAACCG
GCACGCCTCT ACGAAATCAT CACGGACGAG ACGATGGAGA CCTATGTTTC CCAGCGCCGC
CGTGAGGGGG TGAGTGCGTA G
 
Protein sequence
MRIEFDDGTL LLRDAPDNVP YAEWDDRVDE YRTQAYRYRA LLEWAGAWSD GDEQATLHDG 
FAPALEDAAR AYPDLDLTPA LHIEPRDYQQ AALDAWIDHD RRGSVVLPTG SGKTFLGLQA
IADAGVSALV VTPTIDLMNQ WHATLTNAFG EQLPEPVGVL GGGSHNVTAI TVTTYDSAYR
YINEYGDQFG LLVVDEEHHL PAPTYRQIPE MTIAPYRLGL TATYERPDGK HELLEELLGP
VVYRENVDEL AGEYLSEYET IHMSVDLTAD EGETYDEEYQ LYRDYVDSHE FDLWKEDGYA
EFLKRTSYDP QGRRALIAKQ RAERIARTAE KKLDTLDNLI KRHHDDRAII FTANNDFAYD
ISQEFIVPCI THQTKTDERT EILERFRTGE YSMLVTSQVL DEGIDVPAAN VGIILSGSAS
KRQYAQRLGR ILRPTDDRQP ARLYEIITDE TMETYVSQRR REGVSA