Gene Hlac_2227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2227 
Symbol 
ID7399935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2211745 
End bp2213499 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content74% 
IMG OID643709299 
ProductMutL dimerisation 
Protein accessionYP_002566874 
Protein GI222480637 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000451621 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.617767 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGTG AGGAGCGCAC TGCCGGCGGG GGAGCGGGAA CCCCGGACCG CGTCCGCCGG 
CTCGATCCCG CCACCGTCGA CCGGATCGCC GCCGGCGAGG TAGTCACTCG TCCCGCCCGA
GTCGTCGGCG AACTGATCGA CAACGCGCTC GACGCCGGCG CCTCGCGGGT CGAGGTCGCG
GTCGAGGGCG ACGGCACCGA CCGGATCCGG GTCGACGACG ACGGCCGCGG AATGAGTCGC
GAGGACGCCC GACTCGCCGT CGAGCGCCAC GCGACGAGCA AGCTCGCACC CGACGGCGAC
CCGGTCGGCG TCGGCTCGCT CGGCTTCCGC GGCGAGGCGC TGGCGGCGAT CGCCGAGGCC
GCGCGGCTCG AACTCGTCAC CAGCGACGGC GACCCGGTCG GGACGCGGGT GGTCGTCGGC
GGGGCGGCGA GCGACCTCGA TTCGGACGGT CCCGCCGTCA CCGACGCCGG ACGCGCCCGC
GGCACCACCG TCGTCGTCGA GGATCTGTTC GCGACCCGAC CCGCGCGTCG GGAGTCGCTG
GCGGGGTCGG CGGCCGAGTT CGCGCGGGTC TCCTCGCTGG TCGCTGACTA CGCGCTCGCG
AACCCCGGAG TGGCGTTCAC GCTGGATCAC GACGGGTCGC GGACGCTGTC GACGCCCGGG
TCCGGGATCA CGGAGGCGCT GCTCGGCGTG TACGACCGCG AGACCGCGAG CCGCTCGACG
GAGATCGCGG CGAGCGTCGA GATCGAGAGA GAGGGGGGAG GCGTCGGAGG AGACGGGGCC
GCGTCGACCG CGGTCGAGGT CGCCGGCGCC CTCGCGTATC CCTCGGTAAC GCGTTCTTCT
CGGGACCACG TTCGAGTCTC GGTGAACGGG CGCCCGGTCC GGAACGACCG GCTCGCGGCC
GCGGTCCGAG CGGGATACGG CCGCCTGCTC CCTGACGGCC GCGAGCCGGT GGCAACCATC
GACGTGTCGC TGCAGCCCGC CCGCGTGGAC CCGAACGTTC ACCCGGCGAA GCGGGAGGTC
GGGCTCCGCG ACGCCGACGC GGTCGCGGAC GCTGTCGAGT CGGTCGTCGC CGACGCGCTC
ACGGGTGCCG ACCTCCGGCG GAAAGCCGAG GTCGAGACGG GACTCGACGG GGCGTTAGAG
CCGGTCGGTG GCGAGGAGGG CGAGCGCCTG GCGACGTTTG CGGACGCGGA CCCGATCGGG
ACGTTCTGCG ACCTCTACCT CCTCGTCGAG GCCGGCGACG AGCTGCTCGT GGTCGACGGC
CACGCCGCCC ACGAGCGCGT GAACTACGAG TGGCTCGCCC GGGCGTTCGA TGGCGAGGCG
GTGCCGACCG CCGACCTCGA CCCGCCGGTG GCCGTCTCGC TGTCGGCCGA CGAGGCGGCG
GCCGCGGAGG CGCACGCCGA CGCCCTCGCC GCACTCGGGT TCGAGACGGA GCCGTTCGGC
GGGGGGACCG TCAGGCTCCG GACCGTCCCG GCGCCGTTCG GGCACGCGGT CGACGCGACC
GCGTTCCGCG ATGCGCTCGC GACCCTCTCA GGCGGGGCGT CGCCGCGGGA CGCCCGCGAA
ACGCTGCTCG CCGATCTGGC GTGTCACCCG TCGCTGAAGC GCGGCGAGAT CGGGGATCTT
GATGCGGAGG AGCTGCGATC GCTGCTCGAT CGGCTCGGCG AGTGCGACCG CCCGTACGCG
TGTCCGCACG GTCGGCCGAC CGTTCTTTCG GTGGACGAAG CGACGTTCGC AGCCGGGTTC
GGGCGAGACC GATAG
 
Protein sequence
MTGEERTAGG GAGTPDRVRR LDPATVDRIA AGEVVTRPAR VVGELIDNAL DAGASRVEVA 
VEGDGTDRIR VDDDGRGMSR EDARLAVERH ATSKLAPDGD PVGVGSLGFR GEALAAIAEA
ARLELVTSDG DPVGTRVVVG GAASDLDSDG PAVTDAGRAR GTTVVVEDLF ATRPARRESL
AGSAAEFARV SSLVADYALA NPGVAFTLDH DGSRTLSTPG SGITEALLGV YDRETASRST
EIAASVEIER EGGGVGGDGA ASTAVEVAGA LAYPSVTRSS RDHVRVSVNG RPVRNDRLAA
AVRAGYGRLL PDGREPVATI DVSLQPARVD PNVHPAKREV GLRDADAVAD AVESVVADAL
TGADLRRKAE VETGLDGALE PVGGEEGERL ATFADADPIG TFCDLYLLVE AGDELLVVDG
HAAHERVNYE WLARAFDGEA VPTADLDPPV AVSLSADEAA AAEAHADALA ALGFETEPFG
GGTVRLRTVP APFGHAVDAT AFRDALATLS GGASPRDARE TLLADLACHP SLKRGEIGDL
DAEELRSLLD RLGECDRPYA CPHGRPTVLS VDEATFAAGF GRDR