Gene Hlac_2604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2604 
Symbol 
ID7399830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2581499 
End bp2583496 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content71% 
IMG OID643709676 
ProductDNA mismatch repair protein MutS domain protein 
Protein accessionYP_002567245 
Protein GI222481008 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0194376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.689485 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTTGG AGGCGATCCC CGGCGTCGGC GCGAAGACGG CCGCGGCGTT GCACGAGCTC 
GACGACCCGG TCGCGACCGT CGAGTCCGGC GACGTCGCCG CGATCGCCCG CGCGCCCGGT
GTCAACGAGG CGCGCGCGGC CCGCATCGCT CGCGGAGCGA TCCGTCGCCG ACACGACGAC
GACGGGCGCG TGCTGGCGAC CGACCGCGCC CGCGAGGTGT ACCGCTCGGC GATCGACCTG
CTGCGCGAGC GCACCGTCAC CGACTACGCC GCCAAGCGGC TGGAGACGTT CTACCCGAGC
GAATCGACCT CGCGGATCGC GGAGGCGCAG GCGTTCGTCG AGGGGGCGAT GGAGCGCGAG
CCCGATCCCG CCGTCCACGA GGCGCTCGCC GGGGTCAAAC CCCTGATCGA CCCGCCGACG
GTTCGAGTGC GCGACCGCTG TCTCGCGACG GCCGACGCCG AGGCGCTCGC CCGCGCCGAG
TCGGCGGTGC CGGAGCTGTC GGTCGAGACC GTCGAGAACG CCCGTGACAT CTCGGAACTC
GCGCGGTCGT ACGCGACCGT GATCGTCTTA GACGAATCGT TCGCCGGACT CGACGTCGAG
GGTGACGTAC ACGTCCGCCC GGATGCGTTG GACAAGCCCG CGGAGACCGT CCCCGAGCGT
CTGCTCGCCT TCTTCGCGGC GAACCGCGAG CGACTGGAAG CGGCCGCGGC GGTCCACGAG
ACGGCGAACC TCTCCCCCGC GGCCGACCTC GACCGCCTCC GCGACGCCCT CGCGCGGCTC
GACGACGACG GGACGGTCGT CGGCGACGGG GAGCTCGAAC GACTGACCGC CGCCGTCGAC
GACCTCGATG CCGCGGTGTC GACGGCGGAG TCAGTGGCCG ACGACCGGCT CAGAGAGGCG
ATCCGCGAGC GCGACGTGAC CATCGAGGGG ACCGACTTCC TCTCGCTGGT CGAGCAGGGC
GCCCGGGTCG ACTCGCTCTT AGACCGCGAG CTGGCCGACG AGTACGACGC GGCGATGGCC
CGCGCTCGCG AGCACCTCGC GGACGCGCTC CGCTTGGAGC CCGAGGAGGC GGAACTCGCC
GACCGGGTGT TCGGCGACGA CCCCTCTTTC CCGGTCGAGC ACGATGAGAG CGCGGTCTCG
CGGCTCCGCA CCGAGCTTGC GGCGGCGCGC GACCGCCGGG CTGCCCGCCT GAAGGCGGAA
CTCGCGAGCG ACCTCGGCGA CCTGCGGGAG CCCGTGGAGG AACTCGTCCG GGACGCCCTC
GAACTCGACG TGGAACTGGC GATCGCGCGA TTCGCGCGCG ACTTCGACTG CGTCATGCCC
GAGGTGGTTG ATTCGGACGG GGACGGCTCT CCCGGACCCT CCGGCTTTCG GATTGTGGGC
GGCCGCTCCT CACTTCTCGA CGTTGATTTC GAGAACGTGG AGCCGATCGA CTACGCGGTG
TCGGGCGCGA CGCTCCTCTC GGGAGTCAAC TCCGGCGGGA AGACCTCGAC CCTCGATCTC
GTGGCGCTCG TCGTCGTCCT CGCGCAGATG GGGATGCCCG TCCCCGCCGA GTCCGCCACC
GTCGAGCGCT TCGAGGAGAT CCACTACTAC GCCAAATCGC AGGGAACCCT CGACGCGGGC
GCGTTCGAGG CGACCCTGCG GGACTTCGGC GACCTCGTCG AGGGCGCGGA CGGGCGGCTC
GTCTTGGTCG ACGAGCTTGA GTCGATCACG GAGCCGGGCG CCTCCGCGAA GATCATCGCG
GGCATCCTCG AAGCGCTCGA CGAGCAGGAC GCCACCGCCG TCTTCGTCTC CCACCTGGCC
CGCGAGATCC GGGACGCGGC CGACTTCGAA GTCGCCGTCG ACGGAATCGA GGCCGCCGGG
CTCGTCGACG GCGAGCTACG GGTGAATCGC TCACCGCGGA AGGGTCACCT CGCGCGGTCG
ACCCCGGAGC TCATCGTCGA GAAGCTCGCG GGCGACCGCG ACACCGACTT CTACGGGGAC
TTACTGGAGA AGTTCTGA
 
Protein sequence
MELEAIPGVG AKTAAALHEL DDPVATVESG DVAAIARAPG VNEARAARIA RGAIRRRHDD 
DGRVLATDRA REVYRSAIDL LRERTVTDYA AKRLETFYPS ESTSRIAEAQ AFVEGAMERE
PDPAVHEALA GVKPLIDPPT VRVRDRCLAT ADAEALARAE SAVPELSVET VENARDISEL
ARSYATVIVL DESFAGLDVE GDVHVRPDAL DKPAETVPER LLAFFAANRE RLEAAAAVHE
TANLSPAADL DRLRDALARL DDDGTVVGDG ELERLTAAVD DLDAAVSTAE SVADDRLREA
IRERDVTIEG TDFLSLVEQG ARVDSLLDRE LADEYDAAMA RAREHLADAL RLEPEEAELA
DRVFGDDPSF PVEHDESAVS RLRTELAAAR DRRAARLKAE LASDLGDLRE PVEELVRDAL
ELDVELAIAR FARDFDCVMP EVVDSDGDGS PGPSGFRIVG GRSSLLDVDF ENVEPIDYAV
SGATLLSGVN SGGKTSTLDL VALVVVLAQM GMPVPAESAT VERFEEIHYY AKSQGTLDAG
AFEATLRDFG DLVEGADGRL VLVDELESIT EPGASAKIIA GILEALDEQD ATAVFVSHLA
REIRDAADFE VAVDGIEAAG LVDGELRVNR SPRKGHLARS TPELIVEKLA GDRDTDFYGD
LLEKF