Gene Caul_4062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4062 
SymbolmutL 
ID5901524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4396087 
End bp4398018 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content73% 
IMG OID641564583 
ProductDNA mismatch repair protein 
Protein accessionYP_001685685 
Protein GI167648022 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATCC GCCGCCTGCC GCCCGAGACC GTCAACCGCA TCGCCGCCGG CGAGGTGGTG 
GAGCGGCCGG CCAGCGCCAT CAAGGAGCTG GTCGACAACG CCATCGACGC CGGCGCCACG
CGGATCGAGG TCGAGGCCCA TGGCGGCGGC CTGACCCGCA TCCTGGTGGC CGACGACGGC
TGCGGCCTCT CGGCCGAGGA GTTGCCCGTC GCCCTCGAGC GCCACGCCAC CAGCAAGCTG
GCGCCCGACG CCCAAGGCCT GTGGGACCTG CTGGCCATCC ACACCATGGG CTTCCGGGGC
GAGGCCCTGC CGTCGATCGG CTCGGTGGCG CGGCTGTCGA TCTCTTCCCG GGCCAAGGGC
TCGAGCGACG CCTGGTCGAT CCTGGTCGAG GGCGGCAGCG TCGGCGACGT GGCCCCCGCC
GCCTTCGCCG GAGACCACGG GGCGCGGATC GAGGTGCGCG ACCTGTTCTA CGCCACCCCC
GCGCGGCTGA AGTTCATGAA GTCCGAGCGC GCCGAGGCCC TGGCGATCAC CGAGGAGATC
AAGCGCCAGG CCATGGCCAA CGAGAGCGTC GGCTTCAGCC TGGACATCGA CGGCCGCCGC
ATCATCCGCC TGCCGCCCGA GCATCCCGGA CCGCAAGGCC GCCTGGCCCG ACTGTCGGCC
GTGCTGGGCC GCGACTTCCA GGACAACGCC ATCGAGATCG ACCAGACCCG CGACGGCGTG
CGGCTGACGG GCTTCGCGGG CCTGCCGACC TACAACCGCG GCAACGCCGC CCACCAGTAC
CTGTTCGTCA ACGGCCGGCC GGTACGCGAC CGGCTGCTGC AAGGCGCCCT GCGCGCCGCC
TATGCCGATT TCCTGGCCCG CGACCGCCAT CCGACGGCGG CCCTCTATAT CTCGCTCGAC
ACCTCGGAAG TGGACGTCAA CGTCCACCCG GCCAAGGCCG AGGTGCGGTT CCGCGACCCG
GCCCTGGTGC GCGGCCTGAT CGTCGGCGCC CTGCGCCACG CCCTGGCCGG GGCCGGCCAC
CGCGCCTCGA CCACCACGGC GGCGAGCGCG CTGGACGCCA TCCGGGCGCA GAGCATGGTC
CCACCCGGCG CGTACAGCGG CTACCAGCCC AATGCTTACC AAGGGGGCCC CTCCCCCGCC
GGCTTCTCGG CCTGGCAGGC GGGCGGCTGG ACGCGGCCTT CGCCCCAGGT GCTGCCGGGC
CTGTCGGACG TCTCGGCGCG GGTCGAACCC GGCGGCTACG GCGTGGCCGA GGCGGTGCGC
GAGGCGGCGT TCGGCGAGTA CGACGCGCCC CCCAATGTGG CCTATCCCGG CGGCTTCAGC
GAAGACCCCG CCCCGGTCTT CGACCCCGTC GACTTCCCGC TCGGCGCGGC CCGCGCCCAG
GTGCACGAGA CCTATATCGT CGCCCAGACC CGCGACGGCG TGGTCATCGT CGACCAGCAC
GCCGCCCACG AGCGCCTGGT CTATGAGCGG ATGAAGGGCG AGATGGCGGC CGGCCGCGTG
GCCCGCCAGG CCCTGCTGCT GCCCGAGGTG GTCGAGCTGG ACCCCGCCGA GGCCGAGCGC
GTCGTCGCCC GCGCCGAGGA ACTGGCGGCC CTGGGCCTGG TCATCGAAAG CTTCGGCCCC
GGCGCGGTGC TGGTGCGCGA GACCCCGGCC CTGCTGGGCA AGACCGACGC CGCCGGCCTG
GTCCGCGACA TCGCCGACGA CCTGGCCGAG AACGGCCAGG CCCTGGCCCT GAAGGAGCGG
CTGGAAGAGG TCTGCTCCAC CATGGCCTGC CACGGGAGTG TGAGGGCGGG GCGGCGGCTG
ACCGGGGCGG AGATGAACGC CCTGCTGCGC GAGATGGAGG CGACGCCGCA CTCCGGCCAG
TGCAACCACG GGCGGCCGAC TTATGTGGAG CTGAAGTTGG CGGATATTGA GAGGTTGTTT
GGGAGGCGGT AG
 
Protein sequence
MPIRRLPPET VNRIAAGEVV ERPASAIKEL VDNAIDAGAT RIEVEAHGGG LTRILVADDG 
CGLSAEELPV ALERHATSKL APDAQGLWDL LAIHTMGFRG EALPSIGSVA RLSISSRAKG
SSDAWSILVE GGSVGDVAPA AFAGDHGARI EVRDLFYATP ARLKFMKSER AEALAITEEI
KRQAMANESV GFSLDIDGRR IIRLPPEHPG PQGRLARLSA VLGRDFQDNA IEIDQTRDGV
RLTGFAGLPT YNRGNAAHQY LFVNGRPVRD RLLQGALRAA YADFLARDRH PTAALYISLD
TSEVDVNVHP AKAEVRFRDP ALVRGLIVGA LRHALAGAGH RASTTTAASA LDAIRAQSMV
PPGAYSGYQP NAYQGGPSPA GFSAWQAGGW TRPSPQVLPG LSDVSARVEP GGYGVAEAVR
EAAFGEYDAP PNVAYPGGFS EDPAPVFDPV DFPLGAARAQ VHETYIVAQT RDGVVIVDQH
AAHERLVYER MKGEMAAGRV ARQALLLPEV VELDPAEAER VVARAEELAA LGLVIESFGP
GAVLVRETPA LLGKTDAAGL VRDIADDLAE NGQALALKER LEEVCSTMAC HGSVRAGRRL
TGAEMNALLR EMEATPHSGQ CNHGRPTYVE LKLADIERLF GRR