Gene Hhal_0668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0668 
Symbol 
ID4710062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp750168 
End bp752030 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content74% 
IMG OID639855130 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_001002252 
Protein GI121997465 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.422126 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACGC GCGGTATTCG TCCCCTGCCG GATAACCTGA TCGACCAGAT CGCCGCTGGC 
GAGGTGGTCG AGCGCCCCGG TTCGGTGGTC AAGGAACTGG TGGAGAATAG CCTCGACGCT
GGCGCCGGGC GTGTCGAGGT GCAGATTGAG CGCGGCGGCA AGCAGCGTAT CCGCATCGCC
GATGACGGCG ACGGCATCCC GCCCGAGGAG CTGGAGTTGG CGCTGCGCCG GCACGCCACC
AGCAAGCTCA CCGGCCTCGA GGAGCTGGAG CGCATCGCCA GTCTGGGGTT CCGGGGGGAG
GCCCTGCCGA GCATCGCTGC CGTTTCGCGG CTGACGCTGG CCTCGCGCAC CGCCGAGGCG
GAGCTGGGCC ATCAGCTGCG CTGTGACGGC GGCGCACTGG GTGCGCCGGA ACCGGTGGCT
CACCCCCCGG GGACCACGGT CACCGTCGAC GACCTGTTCT ACAACACGCC AGGGCGTCGC
AAGTTCCTGC GCACCGAGCG CACCGAGCTC TACCACGTCC AGGAGGCGTT GCGCCGGCTG
GCGCTGAGCC GCTTCGACGT CGGTTTCTCG CTCGTCCATC AGGGGCGGCG GCTCTGGTCG
GTGCCCCGGG CGGAGAGCGA GACGGAGCGG CACGAGCGTC TGGCGGAGCT GCTCGGTCGT
GCCTTTGCCG ATCACGCCCT GGCGGTGGAG TTGGAGGGGG CCGGGCTGCA GCTGCGGGGC
TGGCTGGGGC TGCCTACGGC GGCGCGGCGC CAGGGGGATC TTCAGTACTT GTTCGTCAAC
GGCCGGCTGG TGCGCGACCG GGGGGCGGCC CACGGCATCC GCCAGGCTTA CAGCGACTGC
CTCTACCGCG ATCACTACCC GGCCTACGTC CTCTTTCTGG AGATGGATCC GGCTCGGGTG
GATGTCAACG TCCACCCCAT GAAGCATGAG GTGCGTTTTC GCGACGGGCG GACGGTGCAC
GACTTCCTCG CCCGGCGCAT CGCCGACGCC CTGGCCACCG CCGAGCCGGC CGGTGCCGCG
GCACCGCCTG CGGCGGAGCG GCCGCCGTCG GGCGCGCCGA CCGGCCCCGG GCAACCGGCG
GCGGCCGAGC GGACGGGGCC GGCCTCCGAA GGGCTCGGCG GCACGGCGGA GCTGGGGCTG
CCGCTGGCCG AGGCGCGGCA GCTCTATGGG GGCGCCGACG CCGCTGCCGA GGGCCCTGGC
GGCGCGGCTG AAGGGGCGTC GTCGGCCATT GCCGGATCCC CCGGGCCGTC GCAGACCCGG
GACCGGGAGA GTGAGGCGAC CCCGGAGCTC GGCTACGCCG TCGGGCAGAT CCGCGACGCC
TATATTCTGG CCGAGTCGCA GCGGGGGCTG GTGGTGGTGG ACATGCACGC CGCCCACGAG
CGGGTGGTCT ACGAGCGGAT GAAGGCGCAG CTGGTGGCGT CGGGGATCGC CACGCAGTCG
TTGCTGGTGC CGGTGAGTGT GCCGGTGACC CCGGCGGAGG CCGAGCGGGT GGAGCTGCAC
GCGGCTACGC TGGCCCGGGC GGGTCTGGAG GTGGACCGCG CCGGCCCCGA GTCGGTGCGT
GTTCACCGAG TACCGGCGCT GCTCGCCGAG GCCGACGCCG CGGCCCTGGT CCGCGACGCG
GTGGCGGCGC TGGAGTCCGA GGGGACCGGT GGGCGGGTCG AGGACCGGGT CCACGCGCTG
CTGGCGCAGA TGGCCTGCCA CGGGTCGGTC CGCGCCGGAC GGCGTCTGGA GCGCGCCGAG
ATGGACGCCC TGCTGCGGGA TATCGAGCGC ACCCCGCGGG CGGCGCAGTG CAACCACGGG
CGGCCGACCT ACACCGTGCT CGACGACGAG GCGCTGGCCC GGCTGTTCAT GCGGGGGCGG
TGA
 
Protein sequence
MTTRGIRPLP DNLIDQIAAG EVVERPGSVV KELVENSLDA GAGRVEVQIE RGGKQRIRIA 
DDGDGIPPEE LELALRRHAT SKLTGLEELE RIASLGFRGE ALPSIAAVSR LTLASRTAEA
ELGHQLRCDG GALGAPEPVA HPPGTTVTVD DLFYNTPGRR KFLRTERTEL YHVQEALRRL
ALSRFDVGFS LVHQGRRLWS VPRAESETER HERLAELLGR AFADHALAVE LEGAGLQLRG
WLGLPTAARR QGDLQYLFVN GRLVRDRGAA HGIRQAYSDC LYRDHYPAYV LFLEMDPARV
DVNVHPMKHE VRFRDGRTVH DFLARRIADA LATAEPAGAA APPAAERPPS GAPTGPGQPA
AAERTGPASE GLGGTAELGL PLAEARQLYG GADAAAEGPG GAAEGASSAI AGSPGPSQTR
DRESEATPEL GYAVGQIRDA YILAESQRGL VVVDMHAAHE RVVYERMKAQ LVASGIATQS
LLVPVSVPVT PAEAERVELH AATLARAGLE VDRAGPESVR VHRVPALLAE ADAAALVRDA
VAALESEGTG GRVEDRVHAL LAQMACHGSV RAGRRLERAE MDALLRDIER TPRAAQCNHG
RPTYTVLDDE ALARLFMRGR