Gene Hlac_0629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0629 
Symbol 
ID7401764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp646790 
End bp649093 
Gene Length2304 bp 
Protein Length767 aa 
Translation table11 
GC content71% 
IMG OID643707695 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_002565301 
Protein GI222479064 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCCGC CGAACATCGA GCGGTTGGAT GAGCGGACCG TCCAGCGCAT CGCGGCCGGT 
GAAGTCGTCG AGCGTCCGGC CAGCGTCGTC AAGGAGCTGA TCGAGAACAG CCTCGATGCG
GGCGCGACGC GCGTGGCCGT CTCGGTCGAG GCGGGCGGCA CCGAGGGGAT CCGCATCCGA
GACGACGGCG TCGGCATCCC CGCAGACCAA CTGGAGGCGG CCGTCGCGGA ACACGCTACC
TCGAAGATCG GCAAGATTGA GGACCTCGAT CACGGCGTCG GCACCCTCGG CTTCCGCGGC
GAGGCGCTGT ACACCGTCGG CGCGGTCTCG CGGCTCACCG TCCGGTCGCG CCCACCAGGA
GCCGACGCGG GCTCAGAGAT TACCGTCGAG GGCGGCGACG TGGGAGACGT CCGTCCCGCC
GGCTGTCCCG AGGGGACGAC CGTGGAGGTC GACGGCCTCT TCTACAACAC CCCCGCCCGC
AAGAAGTTCC TCAAGCGGAC CGCCACCGAG TTCGACCGCG TGAATGCGGT CGTCACCGGC
TACGCGCTCG CGAACCCCGG CGTCGCCGTC TCGCTGGAAC ACGACGGACG GGAGACGTTC
GCGACGGAGG GGAACGGTGA CCTCCGGTCT GCCGTCCTCG CGGTCTATGG CCGCGAGGTC
GCGGACGCGA TGGTGGACAT GGAGTGGGAA CCGGGTAATT CGGACACCGA CTCGCCCGTT
CACAGTGTCA CCGGCCTCGT CTCCCACCCG GAGACGACCC GGTCGAGCCG CGAATACCTC
GCGACCTACG TCAACGGTCG GTACGTCACG GCGGGCGCCC TCCGCGACGC CGTCCTCGAC
GCGTACGGCG GCCAACTCGC ACCCGATCGG TATCCCTTCG CGGTGCTCTT CGTCGAGGTC
CCGCCGGGCG ACGTCGACGT GAACGTCCAT CCGCGCAAGC TCGAAGTCCG GTTCGACGAG
GAGCCGGCGG TACGCGCCGC GGTCGAGGAG GCGGTCGAGG CCGCGCTGCT CGACCACGGG
CTGATCCGCT CGACCGCACC GCGAGGGCAG TCGGCCCCCG ATCAGACAGA GATCAACCCC
GAAGGGCCGG AGACCGAGGC CATCGGCGGC GCCGGAACCG ATCACGAGCG CGCCGCGCTC
GAAGACCGCG AGAGCGGCGA CAGGGCCGGT GAGAGCCGGG ACGGCAACGA CGATTCGGCC
GCCGGATCCG CGGCCGACGC GTCCGAACTG GACCCCACGG ACGATGACGC GTGGGCAGTC
GGCGACGTGA GTTCGGACGA CACCGCTGAT CCGGGCGGCC CGCCCGCCGA CCGCACCGGC
GAGTCAGCCG GGCCCACCGC TCCCGACGGT TCCAACAGTT CCGCCGGTTC CGCGTCCGAC
CGCCCCTCGC CGCGGAGCTG GCAGTCGGAG CCGGACGACG CCGAGGACGG TACGGAGGAG
GGCGACACTG GCGCGGTCGC CGGCGTCGAG GCCGACACCG AGGGCGACGC CGGAGAGGCC
GGCGGACTCG ACCGATTCGG CGGCTCGGCG ACCGACGACA ACGAGGATTC TGGCGCCACC
GACACCTCAC CTGATCCCAC CGCCGACGCG TCGGGCGGAC GACGGGAGCC GACGGCCCAA
CCGCGCTCGA CCGCGACCGC ACAGCGGACC CTCGATGGCG AGCCGACGAG CGCGGAGCGC
ACCTACGATT CGCTCCCGCC GCTACGGGTA CTCGGTCAAC TCCACGAGAC GTACGTCATC
GCGGAAGCGC CAGACGGGCT CGTGTTGATC GACCAGCACG CCGCCGACGA GCGAGTGAAC
TACGAGCGCC TGCAGACCGC CTTCGCAGAC GGTGCCGACG CGCAGGCGCT CGCGGAACCA
GTTCGGATCG AACTCACCGC CCGGGAGGCC GCGCTGTTCG AGGAGTTCGT CGATGACCTC
GCGGGGGTCG GATTCCGAGC CGAGCGCGCG GACGAGCGCG AGGTGGTCGT CGAGTCGGTC
CCGGCGGTGT TCGACGCCGC GCTCGATCCC GAACTCCTCC GAGACGTGCT CTCCGCGCTC
GTCGGCGACG CGACCGCGGG CGACGAGCCG GTGACGGACG TGGTCGACGA ACTGCTCGCG
GATCTCGCGT GTTACCCCTC CGTGACCGGG AACACCTCGC TGACGGAGGG GTCGGTCGTC
GACCTGCTCG ACCGGCTCGA CGACTGCGAG AACCCCTACG CCTGCCCGCA CGGTCGGCCA
GTCGTGATCC GGCTCAACCG CGAGGAGATC GGCTCCCGGT TCGAGCGTGA CTACCCCGGT
CACGCGGGTC GACGCACAGA GTAG
 
Protein sequence
MEPPNIERLD ERTVQRIAAG EVVERPASVV KELIENSLDA GATRVAVSVE AGGTEGIRIR 
DDGVGIPADQ LEAAVAEHAT SKIGKIEDLD HGVGTLGFRG EALYTVGAVS RLTVRSRPPG
ADAGSEITVE GGDVGDVRPA GCPEGTTVEV DGLFYNTPAR KKFLKRTATE FDRVNAVVTG
YALANPGVAV SLEHDGRETF ATEGNGDLRS AVLAVYGREV ADAMVDMEWE PGNSDTDSPV
HSVTGLVSHP ETTRSSREYL ATYVNGRYVT AGALRDAVLD AYGGQLAPDR YPFAVLFVEV
PPGDVDVNVH PRKLEVRFDE EPAVRAAVEE AVEAALLDHG LIRSTAPRGQ SAPDQTEINP
EGPETEAIGG AGTDHERAAL EDRESGDRAG ESRDGNDDSA AGSAADASEL DPTDDDAWAV
GDVSSDDTAD PGGPPADRTG ESAGPTAPDG SNSSAGSASD RPSPRSWQSE PDDAEDGTEE
GDTGAVAGVE ADTEGDAGEA GGLDRFGGSA TDDNEDSGAT DTSPDPTADA SGGRREPTAQ
PRSTATAQRT LDGEPTSAER TYDSLPPLRV LGQLHETYVI AEAPDGLVLI DQHAADERVN
YERLQTAFAD GADAQALAEP VRIELTAREA ALFEEFVDDL AGVGFRAERA DEREVVVESV
PAVFDAALDP ELLRDVLSAL VGDATAGDEP VTDVVDELLA DLACYPSVTG NTSLTEGSVV
DLLDRLDDCE NPYACPHGRP VVIRLNREEI GSRFERDYPG HAGRRTE