Gene Dhaf_0230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhaf_0230 
Symbol 
ID7257191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfitobacterium hafniense DCB-2 
KingdomBacteria 
Replicon accessionNC_011830 
Strand
Start bp251554 
End bp253923 
Gene Length2370 bp 
Protein Length789 aa 
Translation table11 
GC content51% 
IMG OID643560144 
ProductMutS2 family protein 
Protein accessionYP_002456734 
Protein GI219666299 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000337666 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTACTCA ATGAACGGGT GATCAGGAAA CTCGATTTTG ATAAAATCTT AGAACGTCTT 
GCCAATCAGT GCATTATGCC GCGGGCCCGG GAATTGGCAG AGCAATTGGA ACCCCACTCA
CATTTGGATT TGGTGCGGGA GGCTTTGGAG GAAACCGGTG AGGGCAAGGA TATACTAAGG
ATTAATCCGC TTTTTTCGGT GCGGGGAGCC CGGGAAATCC GGCCTTTAGT GGAGCGGTGC
TTAAAGGGAG GGACCTTAAC GACCGATGAG CTGCTGCAGA TTCGGGACAC TTTAAAAGCA
GCCCGCATCG TAAAGCAAGG CCTGCAAGAA GGCAAGGCGG AGGTTCCTCA TCTTAAGGGG
ATCATGGAAC AGGTTATTCT GCCCAAGGGT ATTGAGGAGG AGATTACCCG CTGTATTACG
GAGGATGGTC AGGTAGCGGA TCAAGCGTCC TCTGTTTTAG CGGATTTGCG TCGGAGCATC
TCCCGCCTGC AGACCAGAAT CAGGGAAACC TTAGATGGCA TCATTCGCAA CCCCGCCTAC
CAAAAGATTC TTCAGGATCC TATTGTTACT CAGCGCTCGG AACGCTATGT GGTCCCGGTC
AAACAGGAAT ACCGCCAATC ATTTCAAGGT ATCGTCCATG ATCAATCGGC CAGTGGAGCG
ACCCTTTATA TTGAGCCCAT GGCTGTGGTC AACTTGGGAA ATGAGCTGCG GGAAGTGGTC
CTTAAGGAAC AAAGGGAGGT TCAGCGAATC CTTTTACTCC TCTCAGCCCG TGTGGAAGGG
GAGGCAGAGG CCATAGCCGA TGCTCACGAG GCCCTGGCTC GGGTGGATTT CATCCTGGCT
AAAGCACGGC TCAGTGAAGA GATGAATGCA GGAGCGCCGA TTCTTACCGA AAAGCAGGAG
ATAAGCCTTG TCCAGGCGCG GCACCCTCTT CTTACAGGAA AGGTGGTGCC CTTAACCATC
CAATTGGGGA CTCGCTTCGA TACGGTGGTG GTCACTGGCC CCAATACCGG CGGGAAAACG
GTGGCTTTGA AAACCATTGG ACTTTTGGCG GCTATGGCTC AGTGCGGTTT GCATATCCCT
GCCGAGTCGG ACTCCCGGGT AGGGGTCTTT ACTCAGATCT TTGCTGATAT CGGGGATGAG
CAGAGTGTGG AACAGTCCTT AAGTACCTTT AGCGGTCATA TGAAAAACAT TGTGGAGATT
GTGGAAAAAG CCGATTGGCG TTCCCTGATC CTCCTTGATG AAGTGGGAGC GGGCACTGAC
CCTACGGAAG GTTCAGCTCT GGCTATGGCG ATCATTGCTG AGCTTCATGA ACGGGGAGCG
AGAATTGTGG CTACCACCCA TTATGGAGCG TTAAAGAATT TTGCCTACAA TACGACCCGG
GTAGAAAATG CTTCTGTAGA ATTTGATTCG GAGACCTTAA GACCTACCTA TCGCCTGCTG
ATCGGTATTC CCGGCAAAAG CAATGCCTTT TATATCGCCG GACGGCTGGG ACTGCCGGAA
GGGGTCTTGG ATCGGGCGCG AACCTTTGTT ACGGAGCGGG AAATGCAGGT GGCCGATTTA
ATCGAAAACC TTGAGGATAC TCAACGGGAG ATTGATTTGG AAAAACGCCG GGCCCGTGAG
GAACGGCAGG CCATTGAGAA GGAAAGTCTC GGACTGAAGG AGAAATCTCA AAAGCTGGAG
GATGACTATC AGGAACTTAT GGCCAAGGCT AGGGATCAGG CCACAGAGAT TGTCCGGGAG
GCAAGGAGAG AAGCGGAGAG GCTTATTGAT GAGTTGAAGC TTGCTCTTAA AGAGGAGCGA
AAAGATCAAC AAGCTATTGA AAAAACCCGC CAGGGGATTC GTAAATTGTC CAATAAAGTG
GGAGATCAGG ATACCCCGCT GAGAACTCCT CATGGAGTGG AGCCTCAAGA GATTAAACTG
GGGCAGATGG TATATATGAC TAAGCTCCGG CAAAAAGGGC AGGTCCTTAA GCTGCCCAAT
GATTCTGGAG AGGTTTTTGT CCAGGCAGGA GTGATTAAGT TGAATGTTCC TCTGAGTGAA
ATTCGGCTGA TCCAGGAAGA GAAAGCGGCC AAACCCACTC GATCCGTAGG CGGACAGGGG
AAAGTCGGGA TGAAAAAAGC CGAAACCATC CGTACGGAAA TCGATCTACG GGGGATGATG
GTGGAAGAGG CCGGCTATGA ATTGGATAAA TATCTGGATG ACGCCGTACT GACGGGAGTC
GGCCAGGTCT ATGTCATTCA TGGCAAAGGG ACAGGGGCTT TAAGACAGGG GATTCATGAA
TTCCTGCGCG GTCATCACCA TGTCAAATCT TTCCGCTTAG GACAGCATGG GGAAGGGGAT
CTTGGCGTAA CTGTCGTGGA ATTAAAATAA
 
Protein sequence
MVLNERVIRK LDFDKILERL ANQCIMPRAR ELAEQLEPHS HLDLVREALE ETGEGKDILR 
INPLFSVRGA REIRPLVERC LKGGTLTTDE LLQIRDTLKA ARIVKQGLQE GKAEVPHLKG
IMEQVILPKG IEEEITRCIT EDGQVADQAS SVLADLRRSI SRLQTRIRET LDGIIRNPAY
QKILQDPIVT QRSERYVVPV KQEYRQSFQG IVHDQSASGA TLYIEPMAVV NLGNELREVV
LKEQREVQRI LLLLSARVEG EAEAIADAHE ALARVDFILA KARLSEEMNA GAPILTEKQE
ISLVQARHPL LTGKVVPLTI QLGTRFDTVV VTGPNTGGKT VALKTIGLLA AMAQCGLHIP
AESDSRVGVF TQIFADIGDE QSVEQSLSTF SGHMKNIVEI VEKADWRSLI LLDEVGAGTD
PTEGSALAMA IIAELHERGA RIVATTHYGA LKNFAYNTTR VENASVEFDS ETLRPTYRLL
IGIPGKSNAF YIAGRLGLPE GVLDRARTFV TEREMQVADL IENLEDTQRE IDLEKRRARE
ERQAIEKESL GLKEKSQKLE DDYQELMAKA RDQATEIVRE ARREAERLID ELKLALKEER
KDQQAIEKTR QGIRKLSNKV GDQDTPLRTP HGVEPQEIKL GQMVYMTKLR QKGQVLKLPN
DSGEVFVQAG VIKLNVPLSE IRLIQEEKAA KPTRSVGGQG KVGMKKAETI RTEIDLRGMM
VEEAGYELDK YLDDAVLTGV GQVYVIHGKG TGALRQGIHE FLRGHHHVKS FRLGQHGEGD
LGVTVVELK