Gene Daro_3164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3164 
Symbol 
ID3568405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3400423 
End bp3402270 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content66% 
IMG OID637681635 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_286364 
Protein GI71908777 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGCCGC ATTATAATGC GCCGATGCCG ACCATTGCCC GCCTCCCCGA CCTGCTGATC 
AGCCAGATTG CCGCTGGCGA AGTGGTCGAA AGACCCGCCT CCGTCCTCAA GGAACTGCTC
GAAAACAGCC TCGATGCAGG CAGCAAGGCC ATCCAGGTCC ATCTGGAGGA AGGCGGCGTC
AAGTTGATCC GCATCACCGA CGACGGTTGC GGCATTGCCA GGGATGAACT GGCGCTGGCC
CTGACCCGCC ATGCCACCTC GAAGATTTCC AGCCTCGACG ATCTTGAGCG CGTCGGCACC
CTCGGTTTTC GCGGAGAGGC ACTGGCTTCG GTCGCTTCCG TCGCCCGCCT CAGCATCACC
AGCCGCGAAC GCGGTGCGGC GCATGCCTGG AAACTGCGCG GCGAACCTGG TGCCGAACCG
GAACCGGCCG CCCTGATGGC CGGAACCGTC GTCGAAATGC GCGACCTCTA TTTCAACACC
CCGGCCCGCC GCAAGTTTTT AAAATCGGAA AGTACCGAGT TCGCCCACTG CGCCGACGCC
GTCAAGCGCC TGGCGCTGAC CCGCCCGGAT GTCGCGATCA GTCTGACCCA CAACGGCCGC
AACCTGTTCC AGCTCGCCCC GGCCGATGCC CCCCGGCGCA TCGCCGACAT CCTCGGCGAC
GAATTCCTCG GCGCAGCCCG CGCCGTCGAT GCCGGTACCG GCGCGCTGTC GATTGGCGGC
TTCGCCATCG ACCCGACCCG TGCTACCGAT GCCAAGGACG GCCAGTATGT CTTCGTCAAT
GGCCGTTTCG TCCGTGACAA GATCATCAGC CACGCCCTGC GTGAAGCCTA TCGCGATGTA
CTGCATGGAA GCCGCCAGCC GGCCGTCTGC CTGTTCGTCA ATATCGACCC GGCACTGGTC
GACGTCAATG TGCACCCGGC CAAGACCGAA GTCCGCTTCC GCGATTCGCG TGCCATGCAC
CAGTTCGTCT TCCACGCCAT CCAGCGCACG CTGGCCGCGC CGGTGCAGGC AGAAGCTGCT
CCCGCACTGA TCGAGCGCGC TACGCCAAGC AGCGAATACC AAGCGCCTGC ACTAAGCCCG
GAAAATCGGC CAAATGTCGA ATACGCCCCG CCGCTGCGCC ATCAAGGCAG CCTCGGCGTC
GCCGAACCGG CCGCTGCCGC CTATCTCGCC TTCGCCCGTG CCGCGCAGGA CATCGGCCAG
TCACCGGCCC CGCGCAGTGA AGTCGCCCCG CAATTCCAGC CCATTGAAAG CACCGGCAGC
GATGGCCCGC CGCTCGGCTA TGCGCTTGCC CAGCTGCATG GCATCTACAT CCTCGCCCAG
AATGCCCGCG GCCTGATCCT GATCGACATG CATGCCGCGC ACGAGCGCAT CCTCTACGAA
AAACTGAAGA CTGCCTTCGA CAACCGCCAG ATCGCCACCC AGGCGCTGCT GATCCCGGCC
GTTTTCTCGG CCGACCCGCT CGACATCGCC GCCGTCGAAG AACACGCCGA CGCGCTGGCC
GACCTCGGTT TCCAGATTAC TCCGCTCGGC CCCAACCAAT TGGGCGTCCG CGCCGTTCCC
GCCCTGCTCC AGTCCGGCGA CCCCGCTGCA CTGGCCAAAT CCCTGATTGC CGAACTGCGC
GAACACGGCA TCACCCAGCT CGCCACCGCC CGCCGCAACG AACTGCTGGC CACCATGGCC
TGCCACGGCG CCGTCCGCGC CCGCCGCCAG CTGACCGTGC CGGAAATGAA CGCCCTGCTC
CGCCAGATGG AAGAAACCGA ACGCGCCGGC CAATGCAACC ACGGTCGGCC GACATGGACG
GAACTGACGA TGGATCAGCT CGACAAGCTT TTCCTACGCG GGCAGTAA
 
Protein sequence
MWPHYNAPMP TIARLPDLLI SQIAAGEVVE RPASVLKELL ENSLDAGSKA IQVHLEEGGV 
KLIRITDDGC GIARDELALA LTRHATSKIS SLDDLERVGT LGFRGEALAS VASVARLSIT
SRERGAAHAW KLRGEPGAEP EPAALMAGTV VEMRDLYFNT PARRKFLKSE STEFAHCADA
VKRLALTRPD VAISLTHNGR NLFQLAPADA PRRIADILGD EFLGAARAVD AGTGALSIGG
FAIDPTRATD AKDGQYVFVN GRFVRDKIIS HALREAYRDV LHGSRQPAVC LFVNIDPALV
DVNVHPAKTE VRFRDSRAMH QFVFHAIQRT LAAPVQAEAA PALIERATPS SEYQAPALSP
ENRPNVEYAP PLRHQGSLGV AEPAAAAYLA FARAAQDIGQ SPAPRSEVAP QFQPIESTGS
DGPPLGYALA QLHGIYILAQ NARGLILIDM HAAHERILYE KLKTAFDNRQ IATQALLIPA
VFSADPLDIA AVEEHADALA DLGFQITPLG PNQLGVRAVP ALLQSGDPAA LAKSLIAELR
EHGITQLATA RRNELLATMA CHGAVRARRQ LTVPEMNALL RQMEETERAG QCNHGRPTWT
ELTMDQLDKL FLRGQ