Gene Dret_1897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1897 
Symbol 
ID8419740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2178954 
End bp2181257 
Gene Length2304 bp 
Protein Length767 aa 
Translation table11 
GC content64% 
IMG OID645038483 
ProductMutS2 family protein 
Protein accessionYP_003198759 
Protein GI258406017 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.557227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0324553 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATCCC GTACGCTACG TTTATTGGAA TTTCCCCAAC TTTTGGAACA CCTTGCTTCG 
CGGGCCCAGT CCGAAGCCGG GCAAGCGGCT TGCCGCGATC TGGCCCCGAT GGCGGATCGC
AATGCCTTGC ACCACCGGCA CGCTCTGGTC GCCGAGGGGC TGGAATGGGC CGGGCAATGG
CTGAATGCCG TCCTGCCGTT TCCCGGACTG CAAGGTGTTT TCGAGTATGT ACAGCAGGAA
GACCGCTTTC TCGATGCCGA TGGATTGTGG GGCGTGGCCC AGGTTCTTCA GCGTGCTGTC
GCCGTCACCG AAGCCATCGA GACCATCAGT GACGAGAGGG CTGTGGCGCT GCACGCCTGG
CGCGCCCGGT GCCCCTGGCC GGAGAAGCTT ACTTCCGCGC TCAAGCGGTG TCTGGGACCC
GACGGACGGC TGACCGACGA GAGTTCCCCC GGTCTGTTGG CCGTGCGGGT CGAATTGCGG
CGCATCCAGC AGCAGTGCAC AAAAAAAGTC CAGGATTCCC TGCACGATGA GGCCCTGTCG
GCCTATCTCC AGGACGAGTA TTTTACCGTC TCCTCGGACC GCTATGTGCT GGCCCTGAAA
GCCAATTTCA AGGGCCGTAT CCCTGGTATT GTCCACGACT ACTCCCAAAG CGGGGACACC
TGCTATCTGG AACCCTTTTT CCTGGTCGAT CTCAATAACG CCCTCCAGGA ACTCAAGCAG
CAGGAACGTG AGGAAGAACG CGAAGTCCTG CGGTATCTGA CCGGGCTATT GCATCAGGAA
CGCGACGAGG TGGAGGCGCT GTACGACTGG CTGGTGCAGA CCGATGTCCT GGCCGCCAAG
GTCCGCTTGG CCCAGGCGAT GGACGGGGCC CCACTGGTAC CCGACCCCGG CGCTGCCCTG
CGTTTGAGCG AGGCGCGTCA CCCTTTGCTC GCCCTGGGCG CTGAAACGGT GCAGCCGGTG
GACATCGCCC TGCATGAAGG CCAGCGCGCC CTGGTCATCA GCGGGGGGAA CGCGGGCGGC
AAGACCGTCT GCTTGAAGAC CCTGGGCCTG GTGGCACTCA TGGCCCACGC CGGCCTGCCG
GTCCCTGTGG CGGCCGACAG CACCCTGCCG TTGTGGGAGA CCATATTTGC CGTTCTCGGT
GACGAGCAGA GTCTGGAAGA CCATTTGAGC ACCTTCACCG CACAGATCGG CCATTTTCAA
CGCGCCTGGC CCCAGATGGG TCCCTCGAGC CTGGTCATTC TCGACGAATT CGGCGCCGGG
ACCGACCCGA GTCAGGGCGC CGCCCTGGCC CAGGCAGTCC TCGACGGATT GCTGTCCGCC
GGGGCCTGGA TCGGTGCCGC GACCCACTTC CCGGCCTTGA AGGCCTATGC TATGGGCACA
GAGGGAGTCC GGGCCGCGTC GGTTCTCTTT GATCCCGATT CCCGTCGTCC GCTGTACCGT
CTGGCCTACG ACCAGGTCGG AGCGAGTCAG GCCCTGGATG TGGCCGAAGA CCAGGGGCTG
CCGCAACAGA TTTTGGATCG GGCCAAGGAA TACCTGCTTC TGGAAAGCGA CGAGACCGAC
CGCCTTTTGG AGCGGCTCAA CCGCTTGGCG GCCCGGCGCC AGGAGACCCT GGACGAACTC
GAGGGTCAAA AGGCGCAGCT CGAACGCGAG CGTCAACGGC TACAGGACCG GTTTGAACGG
GAAAGCCGTG CCCTGCTCAA AGATCTCAAG GCCGCCTCCC AGGAGATTGT CCGGGAGTGG
AAGGCCGGCA AGAAAAGCCG GAAAAAGGCC CAGGAAGAAT TGGCTCGGAT GCGGCACCAG
ATCGAAGCCC AGGCCGAAGC CGAGGCGCAG TCCCAGCCGC TGACTCTGGA GGAGATCGAG
CCGGGGCATC GGTTGATCTA TGTGTCCTGG GGGAAGAAGG GCGCCGTCGA GGAGGTCGAC
AGCCGCAAAA AGCGGATCAA GCTCGATCTC GGCGGCGTGT CGTTATGGGC TTCACTTGCT
GAGGTCCAGC GCGTTGCAGG CGAGGGCACG AAGTCGCCGT CTTCCTCGGT CCAATTCGTT
CAGACCGAAG AGTCCGGCCT GCCTTTGCGC CTCGATGTGC GGGGCATGCG GGCTGAAGAG
GCCCGCAACG AAGTCGAGCG GTTTCTGGAT CGCGCGCGCC TCGAAGGCCG GCGCCATCTG
GAAATCGTCC ACGGCAAGGG CGAGGGGGTG CTGCGCCGCG AGGTGCAGGA CATGCTCAGA
CGGGCGCCCG GTGTTGTCCA TTTCGAACTC GGCCGCCCCG AGCAGGGCGG CGACGGAGTG
ACGCAGGTGG AATTGGGTGA TTGA
 
Protein sequence
MESRTLRLLE FPQLLEHLAS RAQSEAGQAA CRDLAPMADR NALHHRHALV AEGLEWAGQW 
LNAVLPFPGL QGVFEYVQQE DRFLDADGLW GVAQVLQRAV AVTEAIETIS DERAVALHAW
RARCPWPEKL TSALKRCLGP DGRLTDESSP GLLAVRVELR RIQQQCTKKV QDSLHDEALS
AYLQDEYFTV SSDRYVLALK ANFKGRIPGI VHDYSQSGDT CYLEPFFLVD LNNALQELKQ
QEREEEREVL RYLTGLLHQE RDEVEALYDW LVQTDVLAAK VRLAQAMDGA PLVPDPGAAL
RLSEARHPLL ALGAETVQPV DIALHEGQRA LVISGGNAGG KTVCLKTLGL VALMAHAGLP
VPVAADSTLP LWETIFAVLG DEQSLEDHLS TFTAQIGHFQ RAWPQMGPSS LVILDEFGAG
TDPSQGAALA QAVLDGLLSA GAWIGAATHF PALKAYAMGT EGVRAASVLF DPDSRRPLYR
LAYDQVGASQ ALDVAEDQGL PQQILDRAKE YLLLESDETD RLLERLNRLA ARRQETLDEL
EGQKAQLERE RQRLQDRFER ESRALLKDLK AASQEIVREW KAGKKSRKKA QEELARMRHQ
IEAQAEAEAQ SQPLTLEEIE PGHRLIYVSW GKKGAVEEVD SRKKRIKLDL GGVSLWASLA
EVQRVAGEGT KSPSSSVQFV QTEESGLPLR LDVRGMRAEE ARNEVERFLD RARLEGRRHL
EIVHGKGEGV LRREVQDMLR RAPGVVHFEL GRPEQGGDGV TQVELGD