Gene Dret_1087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1087 
Symbol 
ID8418912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1277232 
End bp1278980 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content62% 
IMG OID645037659 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_003197953 
Protein GI258405211 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0917186 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0241297 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCGC CTATTCGGGT CCTTCCCCCG GTATTGCAGA ACCAGATCGC CGCTGGAGAG 
GTCGTCGAAC GCCCGGCCAA TGTCGCCAAG GAATTGATCG AAAACAGTCT GGATGCCGGT
GCGACGCGGA TCGAAATCAA TCTGGAAAAC GGCGGTCAGG GATTGATCCA GGTCCAGGAC
AACGGGCACG GCTTGGCGGC CGAGGACATT CCCCTGGCCC TAACCCGGCA CGCCACGAGC
AAAATCCAGG GCATCGAGGA TCTCGTGCGT ATCGCTAGCC TCGGTTTCCG GGGCGAAGCC
CTGCCGAGTA TCGCCTCCGT CTCCCGGCTC TCTTTCGCTT CTGCCCCTGC CGGCGTCGAA
GCCGGACACG AGGTCGAAGT CGCTTTTGGC GAAATGGCCT CGGAAGGGCC GGTGGCCTTG
CAGACTGGGA CCCGCATCCA GGTCCGCGAC CTGTTCTCCA ACACCCCGGC CCGGCTCAAA
TTTCTCAAAA CCCAGGCCAC AGAAACCCGG CGCTGCCAGG AAACAGTGTT TAAAATCGCC
CTGGCCCACC TGGATAAGGA GTTCACGCTC AGCATCCAGG GACGCAACCA GCTCCATCTG
GTTAAACATC AGGCATTGCC GCAACGGCTG GCTGCAGTGT GGCCGCAACA GGTTATCGAC
GGGCTGCTCC CGCTCCAGGG AGAACGTCAC GGGCTCCAGG TGGAAGGGGT TATCGGATCC
CCACGGACCG CCCAGGGCCG GGCGCAACGG ATCCATTTTT TCATCAACGG CCGCCCGGTC
CAGGACAAGA TCCTGCTCAA AGCCCTGCGC GAGGGATACA AGGGGGCGCT GCTGAGCAAG
GAGTACCCCC AGGCCGTGCT GTTTTTGACC CTGCCGCTGG AAGAGGTGGA CATCAATGTC
CATCCTGCCA AATCCGAGGT CCGCTTCCGC GATGAGCGCA GTGTGGCTGG TCTGGTGTGC
ACAACGGTTC GCAATGCACT TGAAAAATGT GACCCCAGTC AGGCTGTACT CGGGGAAGCC
CCTGCCACAC AGTCTTCGGT CCAGGCCCCA CAAGCGGCGG CTCCGAAATT CGGAACCTTC
CATGAGTACC TGGAAAGCGC GGACGCACCA TCGCCGCAGA CACGAGCGCG GTACACCTCT
TCGTCCCCAA ATACACCACA GACCGAGACA GCACCGCAGA TTCGCGAATC GGGCCCGGCC
TCCCGGGAAC CGGCCTACCT TGGCCAGGTC GCCTCGACAT ATCTCTTGGT CGACGACGGG
CAGGGATTGC TCCTGGTCGA TCAGCACGCG GCCCACGAGC GCATCCTGTT CAACGCCCTG
CGCCAGCAGC ATGACGGTGT CGAGCAACAG CTCCTGGCCC TGCCGCTGGA GCTGGCCCTG
CAGCCTGGTG AAAGGGAACG GTTTCAGGAC GTGGCGTTGC AATTGCGCCA GATGGGATTC
TCCTGTTCCC TGGTCGAATC CTCCCTGCTT TCGGTACAGG CCATCCCCGC CAGTCTGGCC
ACGGATCAGG CCCGGGCGTT GCTTCGCGAC ATCTTGGGAG GCAAAGAACG CTCCGTGGAT
GACATGCGGG TCGTCCTTGC CTGCCGCATC GCCATCAAGG CAGGGGACCC ACTGAGCCGG
GACGAGGCAC TGCACCTCTT ACAGGCCTGG GAACAGAGTC AGGAGCGATT TTATTGCCCC
CACGGTCGCC CGGTGGCCGT CCGCTTCGGG CCGGGTGACC TGGAGCGACT TTTCAAACGG
GGACAATAG
 
Protein sequence
MSAPIRVLPP VLQNQIAAGE VVERPANVAK ELIENSLDAG ATRIEINLEN GGQGLIQVQD 
NGHGLAAEDI PLALTRHATS KIQGIEDLVR IASLGFRGEA LPSIASVSRL SFASAPAGVE
AGHEVEVAFG EMASEGPVAL QTGTRIQVRD LFSNTPARLK FLKTQATETR RCQETVFKIA
LAHLDKEFTL SIQGRNQLHL VKHQALPQRL AAVWPQQVID GLLPLQGERH GLQVEGVIGS
PRTAQGRAQR IHFFINGRPV QDKILLKALR EGYKGALLSK EYPQAVLFLT LPLEEVDINV
HPAKSEVRFR DERSVAGLVC TTVRNALEKC DPSQAVLGEA PATQSSVQAP QAAAPKFGTF
HEYLESADAP SPQTRARYTS SSPNTPQTET APQIRESGPA SREPAYLGQV ASTYLLVDDG
QGLLLVDQHA AHERILFNAL RQQHDGVEQQ LLALPLELAL QPGERERFQD VALQLRQMGF
SCSLVESSLL SVQAIPASLA TDQARALLRD ILGGKERSVD DMRVVLACRI AIKAGDPLSR
DEALHLLQAW EQSQERFYCP HGRPVAVRFG PGDLERLFKR GQ