Gene Hlac_3083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3083 
Symbol 
ID7399054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp338222 
End bp339622 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content62% 
IMG OID643706887 
Producttype III restriction protein res subunit 
Protein accessionYP_002564509 
Protein GI222475988 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGTCG AATTCGACGA CGGGACGCTC CTGCTCCGTA ATGCTCCTGA CGATGTTCCC 
TATGGGGAGT GGGACGACCG CGTCGACGAG TACCGAACGC GAGCATATCG ATATCGAGCC
CTGCTCGAGT GGGCCGGTAA GTGGACGGAC GGAAACGAGC AGGCAACGTT GCAAGAAGGC
TTCGCTCACA CTCTCGAAGA CACCGCGCGG GCCTACCCCG ATCTCGATCT CACGCAAGCG
CTCCACATCG AACCGCGTGA CTACCAGCAA GCGGCGCTCG ACGCCTGGAT CGACCACGAT
CGGCGAGGGA GTGTCGTACT CCCCACGGGC AGCGGGAAGA CGTTCCTCGG GCTGCAGGCC
ATCGCCGACG CTGGCGTCAG TACTCTCGTC GTGACGCCGA CGATTGACCT CATGAACCAG
TGGCACGCCA CGCTCACCAA CGCCTTCGGC GACCAACTCA CGGAACCGGT CGGCGTCCTC
GGCGGCGGCA GCCACGACGT CACCGCGATC ACCGTCACCA CCTACGACAG CGCCTACCGC
TACGTCAACG AGTACGGCGA TCAGTTCGGC TTGCTCGTCG TCGACGAGGA ACACCACCTG
CCAGCCCCGA CCTACCGGCA GATCCCCGAG ATGATTATCG CCCCGTATCG CCTCGGGCTG
ACCGCCACCT ACGAGCGGCC CGATGGTAAG CACGAACTTC TTGAGGACCT CCTCGGCCCG
GTCGTCTACC GGAAGGACGT CGACGAACTC GCCGGCGAAT ACCTCAGCGA GTACGAAACG
ATCCACATGT CGGTCGACCT CACGGCTGAC GAACGTGAGG AGTACGACGA GGAGTACCAG
ATCTATCGCG ACTACGTCGA CAGCCACGAG TTTGACCTCT GGAAAGAGGA CGGCTACGCA
GAGTTCCTCA AACGCACGTC CTACGACCCG CAAGGGCGGC GGGCGCTCAT CGCCAAGCAA
CGTGCCGAGC GAATCGCCCG AACCGCCGAA AAGAAACTCG ACACGCTCGA CAACCTATTG
AAACGTCATC ACGATGATCG AACAATTATT TTCACCGCCA ACAACGACTT CGCCTACGAC
ATCTCCCGGG AGTTCATCGT CCCCTGTATC ACTCACCAGA CCAAGACTGA CGAACGCACC
GAAATCCTCG ACCGCTTCCG GAGCGGGGAG TACTCGATGC TCGTCACGTC ACAGGTGCTC
GACGAGGGCA TCGACGTCCC GGCGGCAAAC GTCGGGATCA TCCTCTCGGG GAGCGCCTCG
AAACGCCAGT ACGCGCAACG GCTTGGCCGC ATCCTGCGAC CCACGGACGA CCGCCAGCCC
GCGCGGCTCT ACGAGATCAT CACCGAGGAT ACGATGGAGA CGTACGTCTC CCAACGCCGC
CGTGAGGGGG TGAGTGCGTA G
 
Protein sequence
MQVEFDDGTL LLRNAPDDVP YGEWDDRVDE YRTRAYRYRA LLEWAGKWTD GNEQATLQEG 
FAHTLEDTAR AYPDLDLTQA LHIEPRDYQQ AALDAWIDHD RRGSVVLPTG SGKTFLGLQA
IADAGVSTLV VTPTIDLMNQ WHATLTNAFG DQLTEPVGVL GGGSHDVTAI TVTTYDSAYR
YVNEYGDQFG LLVVDEEHHL PAPTYRQIPE MIIAPYRLGL TATYERPDGK HELLEDLLGP
VVYRKDVDEL AGEYLSEYET IHMSVDLTAD EREEYDEEYQ IYRDYVDSHE FDLWKEDGYA
EFLKRTSYDP QGRRALIAKQ RAERIARTAE KKLDTLDNLL KRHHDDRTII FTANNDFAYD
ISREFIVPCI THQTKTDERT EILDRFRSGE YSMLVTSQVL DEGIDVPAAN VGIILSGSAS
KRQYAQRLGR ILRPTDDRQP ARLYEIITED TMETYVSQRR REGVSA