Gene Daud_1373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1373 
Symbol 
ID6026736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp1452829 
End bp1455183 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content66% 
IMG OID641594193 
ProductMutS2 family protein 
Protein accessionYP_001717514 
Protein GI169831532 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACGGTGG CGAGTGAGAA AACACTGGGC CGGCTCGAAT TCGACAAGGT CCTGGAACGG 
TTGGCCGGAC AGACCCTCTC ACCCCTGGGC CGGGAGCGGG CACGGGCTTT GAGGCCCGCG
GCGGGCCTTG ATGAAGTGCG CCGTTTGCAG GCCGAAACAG ACGAAGGGTA CAACATCCTG
CGGCTGGAAC CTAATGCCGA TTTCGGCGGC TGGCACGACG TCCGTGAACC GGTGCGCCGG
GCGGCGCGGG GTCAGGTGCT GGACGGGGGG CCCTTATTTC AGATCGGGCA GACGCTGGCC
GCGATCCGAA CCCAGAAGAA GTTTCTAATG GACCGCCGGG ACCGCTACCC GCTGCTGGCC
GGCTTGGCCG GCACGATGCC TGTGTTCCCC GAGCTCGAAA AGCGGCTGGT GGAGAGCATC
CTGCCCGGGG GAGAAGTGGC CGACGGCGCG TCCGCTCGTC TGGCCGACCT GCGCCGCCGC
CTCCAGGCGG GCCGCCTCCA GGTGCGGGAG CAGCTGGAGC GCCTGGTCCG CTCACTTGCC
CAGCAGAAGT ACCTCCAGGA GCCGATTATC ACCATCCGGG AAGGCCGTTA CGTGGTCCCG
GTGAAGATTG AATACCGGAA CCAGGTGCCC GGTTTGGTGC ATGACCAGTC GGCCAGTGGA
GCCACCCTGT TCATCGAACC GATGGCCGTG GTCGACAAGA ACAATGAACT CCGCCGGCTC
GAGGCGGCCG AGAAGCAGGA GATACTGAAG ATTCTGACCG AACTGTCCAC GGCGGTGGCC
CAAGCCGCCG ACGAGATCCT GCCCGCGGTG GATCAACTCG GACACTTTGA TTTTGTACTG
GCGAAGGCCC GCTTGAGCCG GCAGATGGCG GCCGTGCCGC CCCTGCTGGA AGACGGGGCT
TTCCTGGAGT TCAGCCGGGC CCGGCACCCC CTGATCCGCG GAAACGTGGT GCCGATCGAC
GGGCGCGTGG GCCGGGATTT TGACCTGCTG GTGCTCACCG GGCCGAACAC GGGAGGCAAA
ACGGTGGCTT TGAAGACCAT CGGTCTTTTA GTGTTGATGG CGCAGGCGGG CCTGCACGTG
CCCGCTTCAT CCTGTGCGGT GGGGCTCTTT GACAGGGTAT TCGCCGATAT CGGCGATGAG
CAGAGTATCG AGAACTCCTT GAGCACCTTT TCGTCCCACA TGGCGAACCT GGTGGACATC
ATCGGGCAGG TTGGGGCGAA GAGCTTGGTG CTGATCGACG AGTTAGGCAC CGGTACCGAT
CCGACCGAGG GCGCAGCGCT CGCCCAGGCG ATCTTGAATG AACTCCACCG CCGGGGGACG
CGGGGGGTGG TCACCACCCA CTACGGGGAG TTGAAGGAGT TCGCCACGGG GCGGGACCGG
GTGGAAAACG CCAGCGTCGA GTTCGATCTG GACACCCTCG AACCCACTTT CCGCCTAGTC
ACCGGCCGGC CAGGACGGAG CTACGCCTTC GAGATCGCCC TGCGCCTGGG GATGCCGGAA
TCGATCGTCT CCCGGGCGCG CGAGTTCCTG GCTCCGGAGC AGCGGCAGAC GGCCGAACTG
CTGCGGCAGT TGGAAGAGAG CCGGCAGGAG GCCGAACGCC AGCGGGAGGA GGCCCGGAAG
GAGGCCCGCG AAGCGTCCAT ACTGAAGCAA CGTTACGAGG CGGAACTGGC TAGCCTGTTA
GATAAGAAAA CGGCGCTCCG GGAGCGGGCG GCCCGGGAGG CGCAGGAATT GATCCGGCAG
GTGCGCCGGG AGGGCGAGGA AATCGTCCGG GAGCTGCGTC GCCAGATCAA CGCCGGAACG
AACCGGGAGA AGGAGCAGGC TATCCAGCAG GCCCGCGCGA GGATCGATGA GCTCGGCGCC
GGCCTTCCTG ATCCGGCAGT GCCGGAGACG GTGGAAGGGG AGCCGGAGCG GCTGGATGGC
GGGGAGGCGG TTTTCATCCC GCGTTTCAGC CAGCAGGGCG TAACCCTGGG TCCTTCCCGG
GATGGTGAGG TGCAGGTCCA GGTGGGCTCC GTCAAGGTGA ACCTGCCGCT GGCCGAGGTG
CGCCGCATGA TTCCTGCTCC CCACAGCACG GCTCCCAACG CCGGGACGGT CGTGGTGCAA
AAAACTCGGG ACGATGTGCG CACCGAATTG GACCTGCGCG GCCTGCACGC CGAGGAGGCG
CTGTCCGAAC TGGAGAAGTA CCTGGACGCA GCGATCCTGG CCGGCCTCCA GCGCGCCTAC
ATTATCCACG GGCTGGGCAC CGGAGTGCTC CGGGCTGCGG TTCAGAACCA CCTGAAGGGG
GACGGCCGCA TTCGGTCGTT CCGCCTGGGT GACAGGGGTG AGGGCGGTCT CGGAGTGACC
GTGGTCGAGT TTTAA
 
Protein sequence
MTVASEKTLG RLEFDKVLER LAGQTLSPLG RERARALRPA AGLDEVRRLQ AETDEGYNIL 
RLEPNADFGG WHDVREPVRR AARGQVLDGG PLFQIGQTLA AIRTQKKFLM DRRDRYPLLA
GLAGTMPVFP ELEKRLVESI LPGGEVADGA SARLADLRRR LQAGRLQVRE QLERLVRSLA
QQKYLQEPII TIREGRYVVP VKIEYRNQVP GLVHDQSASG ATLFIEPMAV VDKNNELRRL
EAAEKQEILK ILTELSTAVA QAADEILPAV DQLGHFDFVL AKARLSRQMA AVPPLLEDGA
FLEFSRARHP LIRGNVVPID GRVGRDFDLL VLTGPNTGGK TVALKTIGLL VLMAQAGLHV
PASSCAVGLF DRVFADIGDE QSIENSLSTF SSHMANLVDI IGQVGAKSLV LIDELGTGTD
PTEGAALAQA ILNELHRRGT RGVVTTHYGE LKEFATGRDR VENASVEFDL DTLEPTFRLV
TGRPGRSYAF EIALRLGMPE SIVSRAREFL APEQRQTAEL LRQLEESRQE AERQREEARK
EAREASILKQ RYEAELASLL DKKTALRERA AREAQELIRQ VRREGEEIVR ELRRQINAGT
NREKEQAIQQ ARARIDELGA GLPDPAVPET VEGEPERLDG GEAVFIPRFS QQGVTLGPSR
DGEVQVQVGS VKVNLPLAEV RRMIPAPHST APNAGTVVVQ KTRDDVRTEL DLRGLHAEEA
LSELEKYLDA AILAGLQRAY IIHGLGTGVL RAAVQNHLKG DGRIRSFRLG DRGEGGLGVT
VVEF