Gene Hlac_1501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1501 
Symbol 
ID7400329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1508221 
End bp1510077 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content71% 
IMG OID643708563 
ProductDNA mismatch repair protein MutS domain protein 
Protein accessionYP_002566159 
Protein GI222479922 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACTGG AGGACTACTG GGGGATCGGC CCGAAGACGA GCGAGCGGCT CACGGAGTCG 
CTCGGGACCG AGCGGGCGAT CGAGGCGATC GAGGCGGCCG ACGTCCGGGC GCTCGTCGAC
GCCGGGCTCC ACCGCGGGCG AGCCACCCGG ATCCTCCGCC GCGCGAACGG CGAGGCCGGC
ATGGGTGTCC TCGCGACTGG CGACGCACGC TCGGTGTACG ACGACCTCCT CACGGTAGCG
GCCGGCCACG CGCTGACGGA CCACGCCGCC GACCGAATCC GGGTGTTGAC GCCCCTCACC
GAGCGGAGCG CGATTGAGTC GCGGCTCGAT GAGGTGGTCG CCGCTCGCGA CGCGTGGGCA
GCACTCGACG ACGGCGAGCG CGACCGCGTC GTCGCCGCGT TCGACGACTA CGACGCGGCC
GAGGGCTCCG ATCTGGCGGC CGTCGAGACC GCGGTCGCGC TGCGCGATGT GGGTCTCACG
GAGACGCCCT TCGAGGATAT CGGGGCGCTG GATGGCGACA GCCTGCGCGA CGCCGCCGAC
GCCCTCGCCG ACGTGCGGGG TGCGATCGAC CCGACGGGCG TCGACGGCGA CGGCGAGATC
GAGGTCGCGC GCGGTGCGGA CGACGAGCTT GACCGCCTGC GTGAGCAGTT CGACGCGGCG
GAGGAGCTGG CGAACTCCGC GTTCGACGTG CTTGATACGG TTCGGGACGG CTCCCTGCGC
GACTTCGAGG CGCTGGAGGC AGCGACGATC GACCACGTCG CTCGCGAGAC CGGCGTCGAT
CCGGCGACGG TGCGCTCGGT CGCGCCGGAC GACGCGATCG ACGCCGCCGA CTTCGTCTCC
GCCACGCTCC GCGATCTGGT GACGGAACTG GAGGCGGCGG TCGCGGAGCG CGAGGAGACC
GTCGCGGCCG ACATCCGCGA GCGGATCGGC GGGATGCGAG TTGGGGATGA GGGAGACAAA
GGTGAGAAAG ATGACGAAGC TGACGAAGCG GCGACCGGAA CCGTCGCCGG CGCGGTCGCG
GCCGTCTCCG ACGCCGCGTT CCTGCTGTCG CTCGCGCGGT TCGCGGTCGC GTACGATCTG
ACTCGACCGA CCCTCGTCGA CGACGGCGTC GCGGTGCGAA ACGCTCGCAA CCTCTTCATC
GACGGCGAGG TCCAGCCGGT GTCGTACGCG ATCGGCTCAC ACTCGCTTGC GGGCGAACCC
GGCGTCGCGA GCGTCGACGC GCCGCCGACC GGCGACCGCG TGAGCGTCCT CACGGGGGCG
AACTCGGGCG GGAAAACCAC CCTGTTGGAG ACGCTGTGTG CGGTGGCACT GCTGGCGTCG
ATGGGGCTTC CGGTGCCGGC CGAGGAGGCG GAGGTCGGTG CGTTCGATCG GATCGTGTTC
CACCGACGGC ACGCCTCCTT CAACGCCGGT GTGTTGGAGT CGACGCTGAA GTCGGTCGTC
CCGCCGCTGG TCGAGGACGG GCGGACGCTG ATGCTCGTCG ACGAGTTCGA GGCGATCACG
GAGCCGGGCC GAGCCGCCAA CCTGCTGAAC GGGCTCGTGA CGCTCACCGT GGACCGCGGC
GCCCTCGGCG TGTACGTCAC GCACCTCGCG GAGGACTTGA GCCCGCTGCC CGAGGCCGCC
CGGATCGACG GTATCTTCGC CGAGGGACTC ACGAACGACT TGGACCTCCG CGTCGACTAC
CAGCCGCGGT TCGGTACCGT CGGGAAGTCG ACGCCGGAGT TCATCGTCTC GCGGCTCGTG
GCGAACGCGA AAGACCGCGG CGTCCGCGCC GGGTTCGAGC ACCTCGCCGG CGCGGTCGGC
GAAGAGGCGG TCCAGCGCAC CCTCTCGGAC GTGGAGTGGT CGGAAGGCGA TGACTGA
 
Protein sequence
MRLEDYWGIG PKTSERLTES LGTERAIEAI EAADVRALVD AGLHRGRATR ILRRANGEAG 
MGVLATGDAR SVYDDLLTVA AGHALTDHAA DRIRVLTPLT ERSAIESRLD EVVAARDAWA
ALDDGERDRV VAAFDDYDAA EGSDLAAVET AVALRDVGLT ETPFEDIGAL DGDSLRDAAD
ALADVRGAID PTGVDGDGEI EVARGADDEL DRLREQFDAA EELANSAFDV LDTVRDGSLR
DFEALEAATI DHVARETGVD PATVRSVAPD DAIDAADFVS ATLRDLVTEL EAAVAEREET
VAADIRERIG GMRVGDEGDK GEKDDEADEA ATGTVAGAVA AVSDAAFLLS LARFAVAYDL
TRPTLVDDGV AVRNARNLFI DGEVQPVSYA IGSHSLAGEP GVASVDAPPT GDRVSVLTGA
NSGGKTTLLE TLCAVALLAS MGLPVPAEEA EVGAFDRIVF HRRHASFNAG VLESTLKSVV
PPLVEDGRTL MLVDEFEAIT EPGRAANLLN GLVTLTVDRG ALGVYVTHLA EDLSPLPEAA
RIDGIFAEGL TNDLDLRVDY QPRFGTVGKS TPEFIVSRLV ANAKDRGVRA GFEHLAGAVG
EEAVQRTLSD VEWSEGDD