Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DET1195 |
Symbol | |
ID | 3229520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dehalococcoides ethenogenes 195 |
Kingdom | Bacteria |
Replicon accession | NC_002936 |
Strand | - |
Start bp | 1089265 |
End bp | 1090965 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637120758 |
Product | MutL/HexB family DNA mismatch repair protein |
Protein accession | YP_181908 |
Protein GI | 57234054 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0321912 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTATAA AACTGCTGGA CAAGGCAACT ATTGCCCGCA TAGCCGCCGG AGAGGTCATT GAAAGGCCGT CTTCCGTGGT CAAAGAGCTG CTGGAAAACT CTCTTGATGC CGGGGCTAAA CGGGTGGATG TTGTTATCCG CGAAGGCGGC ATTGGCTATA TAGAAGTAAG TGATGACGGG TGCGGCATAG TTTTTTCGGA AGTTTTGCTG GCCTTTGAAC GCCATGCCAC CAGCAAGCTT TCCAGCTTTG AAGATATTTA TGCTATCTCC AGTCTGGGTT TCAGGGGTGA AGCCCTGCCC AGTATTGCGG CGGTGGCAGA TTTGGAAATG CTGACGGCTG CCCGAACTGA AGAAAGCGGC ACTTATTTGT CTTTGTCCGG CGGCGAAATG GTTAAGCATA CCCGTATGGC ACGTTCACCA GGTACTACTA TAAAGCTCAC CCGCCTTTTC AGCCGGGTGC CTGCCCGCCT GAAATTCCTT AAAACCCCCC AGCATGAAGC CTCCAAAGTG TCGGAAGTGG TGCTGAGCTA TGCTTTGGCT TACCCTGAGG TCAAGTTTAC TCTGAGCATT GACGGGCGGA ATACCTTAAA TACCCCCGGC AACGGCAAAC TGCGGGATGC CGTGCTGGAA ATATACGGAA ACGACGTTGC AAGTAAAATG CTGGATTTGG AAACAGACTC TTACCGTTCA TCTGCCATAA ATATCAGCGG TCTGGTAAGC CCGCCTGAAA TCAGCCGTTC CAACCGTAAT TCCCTCCATT TCTTTGTTAA CCGCCGCCTT ATCCAGAGCA GGGCTTTGCA AAAAGCGGCA GAACAGGCCT ACAGCGGCTT GCTTATGGTG GGGCGTTACC CTCTGGGGGT TATAAATATA TGGCTGGATG GGGCGCTGGT AGATGTAAAT ATTCACCCCA CCAAGGCAGA AGTTAAATTT TCAGATGAAA GTGCCGTTTT TACCGCTGTC CAGCGGGCAG TCCGTTCGGT ACTGGTGGAG AAGCCACCCA CTCCTCATAT AGCCGAAGAA GCGTCTGTTT ACCGGCAGGA ATCTGCCAGA CAAGAGCCAA TCTGGGGTGA GACTCCAAAA CCCGCCGGTA CTGTCCAGCA GTATTTTTCG CCTGTTATCC AGAGTGCTAA AACATCGGTT TTGCCGCTGC TGCGGCTGGT GGGGCAGATA GGCGGCCTTT ACCTGCTGGC CGAAGGGCCG GATGGGCTTT ACATAATAGA CCAGCACGCC GCCCATGAGC GTATCCGTTA TGAAGAAATT GCCTCACAAA CCCCCTCTGA AAATGCGCGC CAGAGCCTTC TTGATCCGTT TATACTGGAA CTAAACCCGG TGCAGGAAGC CATGATTGAA AAATGCAAAT CAGAGCTGGA TTTAATGGGT TTTGAAATAG AAGAATTCGG CCGCAGAGTC TACCGTGTGC AATCAATTCC GGCCGGTTTT ACCGCACCCC AGGCCAAAGC CCTTCTTTCA GAGCTTGTTG ATAATCCCAA AGATGCCCCG GCAGAGATAA AGGAACGTTT ACAGCGGCTG ATGGCTTGCC ATACCGCAGT TCGGGCAGGA CAGGTGCTTA ACGAGGCGGA GATGCGTGAA CTGCTGCTGA AACTGGAGAA AACCGCTGTA CCCGGCCACT GCCCTCACGG GCGTCCCACT ATTGTAAAAA TAGACTTTTG CCAGCTTGAA AAAGACTTCA AGCGTACTTA G
|
Protein sequence | MPIKLLDKAT IARIAAGEVI ERPSSVVKEL LENSLDAGAK RVDVVIREGG IGYIEVSDDG CGIVFSEVLL AFERHATSKL SSFEDIYAIS SLGFRGEALP SIAAVADLEM LTAARTEESG TYLSLSGGEM VKHTRMARSP GTTIKLTRLF SRVPARLKFL KTPQHEASKV SEVVLSYALA YPEVKFTLSI DGRNTLNTPG NGKLRDAVLE IYGNDVASKM LDLETDSYRS SAINISGLVS PPEISRSNRN SLHFFVNRRL IQSRALQKAA EQAYSGLLMV GRYPLGVINI WLDGALVDVN IHPTKAEVKF SDESAVFTAV QRAVRSVLVE KPPTPHIAEE ASVYRQESAR QEPIWGETPK PAGTVQQYFS PVIQSAKTSV LPLLRLVGQI GGLYLLAEGP DGLYIIDQHA AHERIRYEEI ASQTPSENAR QSLLDPFILE LNPVQEAMIE KCKSELDLMG FEIEEFGRRV YRVQSIPAGF TAPQAKALLS ELVDNPKDAP AEIKERLQRL MACHTAVRAG QVLNEAEMRE LLLKLEKTAV PGHCPHGRPT IVKIDFCQLE KDFKRT
|
| |