Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4062 |
Symbol | mutL |
ID | 5901524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4396087 |
End bp | 4398018 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641564583 |
Product | DNA mismatch repair protein |
Protein accession | YP_001685685 |
Protein GI | 167648022 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGATCC GCCGCCTGCC GCCCGAGACC GTCAACCGCA TCGCCGCCGG CGAGGTGGTG GAGCGGCCGG CCAGCGCCAT CAAGGAGCTG GTCGACAACG CCATCGACGC CGGCGCCACG CGGATCGAGG TCGAGGCCCA TGGCGGCGGC CTGACCCGCA TCCTGGTGGC CGACGACGGC TGCGGCCTCT CGGCCGAGGA GTTGCCCGTC GCCCTCGAGC GCCACGCCAC CAGCAAGCTG GCGCCCGACG CCCAAGGCCT GTGGGACCTG CTGGCCATCC ACACCATGGG CTTCCGGGGC GAGGCCCTGC CGTCGATCGG CTCGGTGGCG CGGCTGTCGA TCTCTTCCCG GGCCAAGGGC TCGAGCGACG CCTGGTCGAT CCTGGTCGAG GGCGGCAGCG TCGGCGACGT GGCCCCCGCC GCCTTCGCCG GAGACCACGG GGCGCGGATC GAGGTGCGCG ACCTGTTCTA CGCCACCCCC GCGCGGCTGA AGTTCATGAA GTCCGAGCGC GCCGAGGCCC TGGCGATCAC CGAGGAGATC AAGCGCCAGG CCATGGCCAA CGAGAGCGTC GGCTTCAGCC TGGACATCGA CGGCCGCCGC ATCATCCGCC TGCCGCCCGA GCATCCCGGA CCGCAAGGCC GCCTGGCCCG ACTGTCGGCC GTGCTGGGCC GCGACTTCCA GGACAACGCC ATCGAGATCG ACCAGACCCG CGACGGCGTG CGGCTGACGG GCTTCGCGGG CCTGCCGACC TACAACCGCG GCAACGCCGC CCACCAGTAC CTGTTCGTCA ACGGCCGGCC GGTACGCGAC CGGCTGCTGC AAGGCGCCCT GCGCGCCGCC TATGCCGATT TCCTGGCCCG CGACCGCCAT CCGACGGCGG CCCTCTATAT CTCGCTCGAC ACCTCGGAAG TGGACGTCAA CGTCCACCCG GCCAAGGCCG AGGTGCGGTT CCGCGACCCG GCCCTGGTGC GCGGCCTGAT CGTCGGCGCC CTGCGCCACG CCCTGGCCGG GGCCGGCCAC CGCGCCTCGA CCACCACGGC GGCGAGCGCG CTGGACGCCA TCCGGGCGCA GAGCATGGTC CCACCCGGCG CGTACAGCGG CTACCAGCCC AATGCTTACC AAGGGGGCCC CTCCCCCGCC GGCTTCTCGG CCTGGCAGGC GGGCGGCTGG ACGCGGCCTT CGCCCCAGGT GCTGCCGGGC CTGTCGGACG TCTCGGCGCG GGTCGAACCC GGCGGCTACG GCGTGGCCGA GGCGGTGCGC GAGGCGGCGT TCGGCGAGTA CGACGCGCCC CCCAATGTGG CCTATCCCGG CGGCTTCAGC GAAGACCCCG CCCCGGTCTT CGACCCCGTC GACTTCCCGC TCGGCGCGGC CCGCGCCCAG GTGCACGAGA CCTATATCGT CGCCCAGACC CGCGACGGCG TGGTCATCGT CGACCAGCAC GCCGCCCACG AGCGCCTGGT CTATGAGCGG ATGAAGGGCG AGATGGCGGC CGGCCGCGTG GCCCGCCAGG CCCTGCTGCT GCCCGAGGTG GTCGAGCTGG ACCCCGCCGA GGCCGAGCGC GTCGTCGCCC GCGCCGAGGA ACTGGCGGCC CTGGGCCTGG TCATCGAAAG CTTCGGCCCC GGCGCGGTGC TGGTGCGCGA GACCCCGGCC CTGCTGGGCA AGACCGACGC CGCCGGCCTG GTCCGCGACA TCGCCGACGA CCTGGCCGAG AACGGCCAGG CCCTGGCCCT GAAGGAGCGG CTGGAAGAGG TCTGCTCCAC CATGGCCTGC CACGGGAGTG TGAGGGCGGG GCGGCGGCTG ACCGGGGCGG AGATGAACGC CCTGCTGCGC GAGATGGAGG CGACGCCGCA CTCCGGCCAG TGCAACCACG GGCGGCCGAC TTATGTGGAG CTGAAGTTGG CGGATATTGA GAGGTTGTTT GGGAGGCGGT AG
|
Protein sequence | MPIRRLPPET VNRIAAGEVV ERPASAIKEL VDNAIDAGAT RIEVEAHGGG LTRILVADDG CGLSAEELPV ALERHATSKL APDAQGLWDL LAIHTMGFRG EALPSIGSVA RLSISSRAKG SSDAWSILVE GGSVGDVAPA AFAGDHGARI EVRDLFYATP ARLKFMKSER AEALAITEEI KRQAMANESV GFSLDIDGRR IIRLPPEHPG PQGRLARLSA VLGRDFQDNA IEIDQTRDGV RLTGFAGLPT YNRGNAAHQY LFVNGRPVRD RLLQGALRAA YADFLARDRH PTAALYISLD TSEVDVNVHP AKAEVRFRDP ALVRGLIVGA LRHALAGAGH RASTTTAASA LDAIRAQSMV PPGAYSGYQP NAYQGGPSPA GFSAWQAGGW TRPSPQVLPG LSDVSARVEP GGYGVAEAVR EAAFGEYDAP PNVAYPGGFS EDPAPVFDPV DFPLGAARAQ VHETYIVAQT RDGVVIVDQH AAHERLVYER MKGEMAAGRV ARQALLLPEV VELDPAEAER VVARAEELAA LGLVIESFGP GAVLVRETPA LLGKTDAAGL VRDIADDLAE NGQALALKER LEEVCSTMAC HGSVRAGRRL TGAEMNALLR EMEATPHSGQ CNHGRPTYVE LKLADIERLF GRR
|
| |