Gene Hlac_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0203 
Symbol 
ID7402132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp218701 
End bp220545 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content67% 
IMG OID643707266 
ProductUvrD/REP helicase 
Protein accessionYP_002564878 
Protein GI222478641 
COG category[L] Replication, recombination and repair 
COG ID[COG0210] Superfamily I DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.451743 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC CCACGGTGAC GCGACTGTTC GGCGGTCCGG GCAGCGGGAA GACCACGGCG 
CTCCTGGACC GCGTCGAGGG GATCCTCGAC GACGGGGACG CCGACGTCCG TGACGTGCTC
GTCGTCTCGT ACACGCGCGC AGCCGCCGCC GAGATCCGCG AACGACTCGC AGAGCGGCTC
GACATCTCCC CGCGCAGCCT CCAGGGGAAC GTCTGTACCA TGCACGCGAA GGCGTACGAG
CTGCTCGATC TGTCGCGCGG CGACGTGGTC GGCGAGGACG ACAAAGAGGA GTTCTGCGAG
GAGTACGGCA TCGAATTCGA GGACCAGCAC GGCGGCGCCG GGCGGCGGAC CGCGCGGTCG
ACGACGATCG GTAACAAGAT CATCGCCACC TCGCAGTGGC TCCAGCGCAC CGAGCGTGAC
GTGTCCGACT GGTACGACGT GCCCTTCCAG TGGGACGTCG AGGAGGTTCG GCTCCCGCCC
GAGGAGGACC CCAACGCTCA GGAAGGGAAC AAGTACACCC CGACGTGGCC CTCCGACGAC
GAGCGGATCG ACATCCCCGA GACGATCCGC GCGTGGCGCG GCTACAAGGG CGACAACGAT
CTGGTCGGCT TCGCGGACAT GCTCGAACGC GTCGCGCAGC GCTCGCTGGT GCCGAACGTC
GACTACCTGA TCATCGACGA GTTTCAGGAC ATCACGACGC TGCAGTACAA CGTCTTCGAG
GAGTGGGAAC CGCACATGCG GAAGGTGCTC ATCGCCGGCG ACGACGACCA GGTCGTCTAC
GCGTGGCAGG GCGCCGACCC CGACCTCCTC TTGGACACCG ACGTCGACGA GGACGTGGTC
CTCCCGAACT CCTACCGGCT GCCCTCCGAG ATCCTCAACG TCGTCAACGC CGAGATCCGT
CACATCGACA AGCGGCAGGA GAAGGACCTC CACCCCCGCA AGGAGGGCGG CAGCGTCGAA
GCGATCCAGT CGCCGTCGAT GCTCGAACTC GTCCGGAACG TCCGGTACAC CGTCGATGAC
GACGAGGGCA GCGTGATGTG TCTGTTCCGC GCGCGCTACC AGATGTTCGA CTTCATCGAC
GAGTTCATCG ACCACGGGAT CCCGTTCACG ATGCTGACCG ACGGCCGGAT GTGGACGGAC
CGCGTGCAGG ACTACGTCAG CGCCATCGAG AAGTCCGACG CGGGCGATCC CGTGAACGGA
CTGGAAGCCC GGCGGCTCGC GGACATGCTG CAGGACTCGG CGTTCGGCAC CCACGAGCGC
GACGAGTTCT ACGACTTCCT CGACGACCGC GAGGAGGCGG CCGACGCCGA CGACATCTCG
CTCATCGAGG TGACGACCGA CGAGCTCGAC GCCCACATCC CGTTCATGCC CGACGCGAAC
AGCGCCGACG ACATGGTCCG GAAGGTGACG AGCTTCCAGC GGAAGTCGAT GGGCGCGTAC
TTCGGCGGCG ACTACGAGGG GGCGGACCCG ACCCGCGTCC GCGTCGGCAC TATCCACTCC
GCGAAGGGCC GCGAGGCCGA TCACGTGTTC GTCGCGACGG ATCTCACCGA GAAGGTGGTC
GAGCAGATGG CCGCCTCCAT CGACGACCCG ACCGACGTGG ACGGCATCGA GGAGTTCACG
AAGACCACGA GCCCGGTCCC CGTCCTCACC GACAACGAGC GCCGGGTCTT CTACGTCGGG
ATGTCCCGCG CCCGCGAGCG GCTCGTGATC ATGGAGAGCC TCATCAGTGG GGCGCCGACG
CTCCCGATCA GCGTCCTGCT CTTTAACGAA CTCCGCGACG AGCCCGCACA GGAGCTCGTC
GATGAGGTGC AGGCAGAGCT GGCCGTCCCC GAACCCGAGC CGTGA
 
Protein sequence
MTDPTVTRLF GGPGSGKTTA LLDRVEGILD DGDADVRDVL VVSYTRAAAA EIRERLAERL 
DISPRSLQGN VCTMHAKAYE LLDLSRGDVV GEDDKEEFCE EYGIEFEDQH GGAGRRTARS
TTIGNKIIAT SQWLQRTERD VSDWYDVPFQ WDVEEVRLPP EEDPNAQEGN KYTPTWPSDD
ERIDIPETIR AWRGYKGDND LVGFADMLER VAQRSLVPNV DYLIIDEFQD ITTLQYNVFE
EWEPHMRKVL IAGDDDQVVY AWQGADPDLL LDTDVDEDVV LPNSYRLPSE ILNVVNAEIR
HIDKRQEKDL HPRKEGGSVE AIQSPSMLEL VRNVRYTVDD DEGSVMCLFR ARYQMFDFID
EFIDHGIPFT MLTDGRMWTD RVQDYVSAIE KSDAGDPVNG LEARRLADML QDSAFGTHER
DEFYDFLDDR EEAADADDIS LIEVTTDELD AHIPFMPDAN SADDMVRKVT SFQRKSMGAY
FGGDYEGADP TRVRVGTIHS AKGREADHVF VATDLTEKVV EQMAASIDDP TDVDGIEEFT
KTTSPVPVLT DNERRVFYVG MSRARERLVI MESLISGAPT LPISVLLFNE LRDEPAQELV
DEVQAELAVP EPEP