Gene Hlac_3609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3609 
Symbol 
ID7402523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp363070 
End bp366078 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content56% 
IMG OID643710146 
Productplasmid replication protein RepH 
Protein accessionYP_002567712 
Protein GI222481476 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACTCAG AACCCAATAA TCAAGATCGA CACCCCGATA TTCGGCAGGT TGCGCCTGAT 
GGATCACTTT CGACAGCTGG CATCCTTGTC AGTGATGTCT CTCCAAGAAT TACTGAGTGG
GTTCCAGGGC TGCTCGACGA ACTCCTCCCA AGCAGTGTTC GAACTCTTCG AGAATACATT
GCTCACACTG ATCCAGGAAT CCTCTCGGAC GGGCGATACG GTTCGCTGGT GGGCAAAATG
CAGGCCGATA CCTTGGGGGT CACGCATCGA GAATGGGGGT CGACGACCGA CACGTGGAGT
GATGATGAGG CTGAAGCCGT CGAATACCTC CATTCGCTAT TTAATCGCGC TGTCAAGTAC
CACGATCAGT CGACCGACGA ACTATCGAAC TACCACCAGC AGCGTGAGAA AAATATCGAC
ACTGCACTGA CCAAGATCGG GACTGGCAAA GGTCCACTCA ACGGTGGCCT CGAAGCACTC
GCTAAAGGCC CCGTTGCACT TCACAGTGAG TTCGACGAGG CTCCACAGCC CATTACGCTC
ATCCTCGATG GGCAGTCCTG GACGGACCTC GACGACCGGT CAACTGGTGT ACGGGCTCTC
GCTGCAATTG CAGTCCTCGG GTCAGCCTTC GATGTTCGTC TCGTTATTTC GCCTGGGCTA
GACCAGCATC TCCGTCGACG GTATTCTACG TGGTACGATA CACATCTGGG TCTTACTGAG
AGTGCTGATA GGTCCACACA AGAAACAGTG GCTACGGCTG GTCGCTCATC AGCGACGACC
CGTCGACGCG CGTGGGAAGT ACTGCAGGAG CTCCCAGAAG ACTCTGGACG TCTCCGGTTG
CTTGGGAATC TCCCAGTCGA CGGCTCACGC GACTATCGGG ACCTCAAACA AGACGACGAA
ATCGGCGTCA CACCGGGGAC TGTTGGCCGC TACGTCCTTG ACCTCGAAGA GTTGGGTCTC
GCCTCTATTG ACCGCTCTGG ACAGTACAAC AGTGCCTCTC TCACGTCGCT TGGCCAGCTT
GCTGTCGAGG AGTTTCTGAC CGCTGATTAC CGGACGGTCC ACCCCTCCCA GTCGAGATTG
CAGACGGCTC TTACGCCGAC CCCTCAGTCC GATGCAGGTA CAGTGTATAG AGCGCAAGCA
AGCACGGAAG GAGGGGAGGG GACACCTCCG ACAGCAACCG CCGAGGAGTG GATGGCCGCA
ACGGGCTCCC CCACCAACGG CGACAGCTAC GTCCAGTGGC TCAACGGCCC CTCTGAGATC
CTCGACGCAT GGGGAATGCA CCACCGATAC GCCGCTGGCC GCCGAAATCG CGGAGTAAAC
CTCGTTGACG ACCGGCTCGC GAAGTTTGAC GACGGTCGCA TCTCCTATCT CAGTTGTTTC
GACGACGATC TGCTCGTTAT GTCTCAGTGG GGTGGCCCTC TCCCAACACT GGGACGTATT
GCGGGAGCCT TACTGAGCGA TAAAGCACTG AGTAAGATAC TGACTCCATC TGCCCTTGGC
AGCGAGTTCG AGGCAATCGA CGATGGTGTC GTCGACAAAC TCGATCAAAA GGTAGGCGAT
ATCATCCGTT GGGGTCACCA GATCGGCTGG TTTAGCGAAG ACGAGGAACA ATACGATGAT
TGGAGAGAAC GTATCGGCAC CGTTCGAAGC CTTTGTCTCG AAAAAGTCGG TGAGCTCACC
AACAGCGACG ATATCGAAGC TCGGAAAGAA CTGTTTCGAG ATCTCCAGGG GTTGATTGCT
TCGGCAACAC AGCTCTACTA CGCTATCGGC GTCGACGTAA CGATCAACAT CCGAATGCCG
GACACAGGGA TGCTGATTCG GGATGAGAAA CGCCTGAATG ACTTTCTGAA TTTCACTCGG
TACACCGTCC CCAAACAGTC TGTCTACGGG ATCCACTCGG GATACCGAAT GCTCCTGGAA
GATCGCGAAG AGAAGCTGAA AAAGCGACTG TCGTACGATG TCGATAGTTC CGACCCAGCG
ATGCATCTGA CCGCCTCATG GGTGTTTTCG GGACCAACAA TGACGGATCT CAAACCACAA
ATTGAGAAGG CAATCGAACA GGAAGCAAGC GAGATCCGAG AGGCTATTGC CGATGGAACG
GAGGCGGCAC CGGTGATGGA GATCCCCGTC CAGATCTCGA ACACCTATAC TGCTACCCGA
GAGCTCGTCG AAGAATTTGC GACGGCGAAA GGCTACGAGG TCTCCCATCG GGGTGATATC
CATGAGCGAA ACGACGACTT AGAGCGTCTG GTTCGGCTGT TCCTCCGGGT GTTAGGGACT
GCTGATCGGC CACACCGGGC TTGTCCATCG GATGTCGCCG AAGCCATGTT GCATATCGCT
CGGTCGACTC AGAGCTTCGA TTTCATCAGT ATTCGGGACA TCGCCTACGG GCTGTCACAA
CTGCCAGCAG AGCGGTTACT GCCCGATCTG CCGCCGACAG CGACGAAACT GCTGAAAACA
CTGCTGGATG CAGACGAGCC GATGCGACGC TCCAAAATTA TTGAGGCCGC TGGGATCTCT
GGGAGTAGCT ACGACCGGTA TATCAACGAG CTCGCAGCGT GGGACATCAT TGAGCCGACC
GAATCTGAGG GTCGACGGCG GTGGGAAGGG CACTTAGAGA CATGGTGGAG TCCCCAGAGT
CATCGCGAAG AACCGTTTGG AGAACCAGAG CCAGATACGG CAATCATCGA CGCAGAGTTC
GCCCGTGATG TAGGGAGCCG AGTACTGTGC CACTACATAA CGCACTATGA TCTCCCCGAG
TTAGAAGAGG TGTATATGAG TGGACTCTGT CCGATAGCTC CAGACGATGA TATCGAGGCG
TTGTTCGGAC GGCATCGTCG GTTATCACGG TGGTGGGCGT TCCTGTGGGG GGCGTATGCA
GATGAAGACG AGATCGCGAA CGCTGAAGCT GTGACAGGAT CGGAGACTGC AGTTCGGATT
GGACAGTTGC CGGGCTCGGT GGAGGATTCA CAACAGAGCC TCGGTGAGTG TAAATCTGTG
TCGACCTAG
 
Protein sequence
MHSEPNNQDR HPDIRQVAPD GSLSTAGILV SDVSPRITEW VPGLLDELLP SSVRTLREYI 
AHTDPGILSD GRYGSLVGKM QADTLGVTHR EWGSTTDTWS DDEAEAVEYL HSLFNRAVKY
HDQSTDELSN YHQQREKNID TALTKIGTGK GPLNGGLEAL AKGPVALHSE FDEAPQPITL
ILDGQSWTDL DDRSTGVRAL AAIAVLGSAF DVRLVISPGL DQHLRRRYST WYDTHLGLTE
SADRSTQETV ATAGRSSATT RRRAWEVLQE LPEDSGRLRL LGNLPVDGSR DYRDLKQDDE
IGVTPGTVGR YVLDLEELGL ASIDRSGQYN SASLTSLGQL AVEEFLTADY RTVHPSQSRL
QTALTPTPQS DAGTVYRAQA STEGGEGTPP TATAEEWMAA TGSPTNGDSY VQWLNGPSEI
LDAWGMHHRY AAGRRNRGVN LVDDRLAKFD DGRISYLSCF DDDLLVMSQW GGPLPTLGRI
AGALLSDKAL SKILTPSALG SEFEAIDDGV VDKLDQKVGD IIRWGHQIGW FSEDEEQYDD
WRERIGTVRS LCLEKVGELT NSDDIEARKE LFRDLQGLIA SATQLYYAIG VDVTINIRMP
DTGMLIRDEK RLNDFLNFTR YTVPKQSVYG IHSGYRMLLE DREEKLKKRL SYDVDSSDPA
MHLTASWVFS GPTMTDLKPQ IEKAIEQEAS EIREAIADGT EAAPVMEIPV QISNTYTATR
ELVEEFATAK GYEVSHRGDI HERNDDLERL VRLFLRVLGT ADRPHRACPS DVAEAMLHIA
RSTQSFDFIS IRDIAYGLSQ LPAERLLPDL PPTATKLLKT LLDADEPMRR SKIIEAAGIS
GSSYDRYINE LAAWDIIEPT ESEGRRRWEG HLETWWSPQS HREEPFGEPE PDTAIIDAEF
ARDVGSRVLC HYITHYDLPE LEEVYMSGLC PIAPDDDIEA LFGRHRRLSR WWAFLWGAYA
DEDEIANAEA VTGSETAVRI GQLPGSVEDS QQSLGECKSV ST