Gene Caul_0008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0008 
Symbol 
ID5897720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp10642 
End bp13356 
Gene Length2715 bp 
Protein Length904 aa 
Translation table11 
GC content72% 
IMG OID641560491 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001681644 
Protein GI167643981 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCC CCGCCTCCCC CGCCGTCCAG ACCTATGACG CCACCGGCGC GACGCCGGTG 
ATGCAGCAGT TCTTCGAGAT GAAGGCGCGG CACCCCGACG CCCTGATCTT CTTCCGGATG
GGCGATTTCT ACGAGCTGTT CTTCGACGAC GCCTACAAGG CCGCCGCGGC CCTGGGCATC
TCGCAAACCT TCCGGGGAAC CCACAACGGC CAGCCGATCC CGATGGCCGG CGTGCCGCAG
CACGCGGCGG AAGCCTATCT CTCCAAGCTG ATCCGCCTGG GCTTCAAGGT CGCCGTCTGC
GAGCAGATGG AAGACCCGGC CGAGGCCAGG AAGCGCGGCT CCAAGTCGGT GGTCCGCCGC
GACATCGTCC GGGTGGTCAC CCCCGGCACC CTGACCGAGG ACGGCCTGCT GGACGCCCGG
GGCGCCAACC GCCTGGCCGC CGTGGCCATC CGGGCCGGCC AGGCGGCCGT GGCGGCCGTC
GAGCTGTCCA CCGGCGAGGT CGAGTGCTTC CTGGTCGCCA AGGACGCCGT GGGGGCCATA
CTGGCCGCCC TGGCCCCGTC CGAGACCCTG GTCGCCGACC GCCTGCTCTC GGATGACTTG
CTGGCCCAGA CCTTGAAGAT CTGCGGCGGC CTGATCCAGC CCATGCCCTC GGCCCTGTCC
GAGCCCCAGG CGTCGGAGAC CCGGGTCAAA CGGCTCTACG GCGTCGACAC CCTGGACGGG
TTCGGCGGGC TTTCGGCGGC CGAGATCGGA GCTCTTGGCC TGATCGCCGC CCATCTGGAG
ATGACCCAGG CCGGCAAGCT GCCCGCCCTG CGCGCGCCGC GCCGCGCGGC CGAGGCCGAC
GTGATGTCGA TCGATCCGGC GACGCGGTCG AGCCTGGAGA TCGATCGCGC CCAAAATGGC
GACCGAAGCG GCTCGCTCCT GGCCGCCATC GACCGCACCG TCACGGCCGG CGGCGCGCGG
ATGCTGGCCT CGCGGCTGGC CCGTCCGCTG CTCGACCCCC ACGCCATCGA CGCCCGCCTG
GACGCCGTCG AGTGGTTCGT CGATCATCGC GGCTTGCGCG AACGGCTGCG CGAGGTGCTC
AAGGGAGCCG GCGACATGGC CCGCGCCCTG TCCCGCCTGG CCCTGGGCCG AGGCGGTCCG
CGCGACCTGG GCTGCCTCAA GGACGGCCTG AAGACGGGCG AGAAGCTGGC CGGCATGGTC
GGCGGCTCGG GCGACCCGCT GTCGCCGCCG CCCGCCCAGC TGGAGGGCGC GCTGAAGGCC
CTGACCCCGT CGCTGCAGGA AGGCCTGTCG CGCCTGCTGG CCCAGCTGGA GACGGGCCTT
GGTCCCGACC TGCCGGCCCT GGCCCGCGAT GGCGGCTATG TGGCGGCCGG GGTGCGCCCC
GAGCTCGACC AGGCCCGCGC CCTGCGCGAC GACAGCCGCC GGGTGGTCGC CGCCCTGGAA
AGCCGCCTGA TCCAGGAAAG CGGCGTGCCG CTGAAGATCC GCCACAACGG CGTGCTCGGC
TATTTCGTCG AGGCCACGGC CGGCAAGGCC GACCCGCTGT TTCAGCCGCC GCTCAACGCG
ACCTTCATCC ACCGCCAGAC CCTGGCCAAC CAGGTGCGGT TCACCACCGT CGAGCTGGCC
GACCTGGACG CCCGCATCGC CCAGGCCGCC GAGCGGGCCC TGGCCATGGA AGTCGCGGCC
TTCGAAGACT GGCGCGCCGA GGCCGTGGCC TTGGCCGAGC CGATTCAGCT GGCCGCCGAG
GCCCTGGCCA AGCTCGATGT CGCCGCCGCC CTGGCGGAAT GGGCCGAAGA CGCCGGCGCG
GTGCGCCCGA GCGTCGACAA GTCCCTGGCC TTCGAGGCCC GCGCCGCCCG CCATCCGGTG
GTCGAGGCCG CCGTCAAGCG GGCCGGCGAT CCCTACACCC CCAACGACTG CTGTCTCGAC
GCCGCCGGCG AGCGCGGCGC GCGGCTGTCG ATCGTCACCG GCCCGAACAT GGCGGGCAAG
TCGACCTTCC TGCGCCAGAA CGCGATCCTG GCGATCCTGG CCCAGTCGGG CTGCTACGTG
CCGGCCAAGA GCTTGCGCCT GGGTGTCATC GACCGGCTGT TCAGCCGGGT CGGGGCCGGC
GACGACCTAG CCCGGGGGCG CTCGACCTTC ATGATGGAGA TGGTCGAGAC CGCCGCCATC
CTGACCCAGG CCAGCCCGCG CAGTCTGGTG ATCCTGGACG AGATCGGCCG GGGCACGGCC
ACCTATGACG GCCTGGCCAT CGCCTGGGCC TGCGCCGAGG CCCTGCACGA CACCAACCGC
TGTCGCGCCC TGTTCGCCAC CCACTATCAC GAGCTGGCCA CGCTGGAGAC GCGCCTAGCC
CACGTCTCCA ACCTGTCCCT GCGGGCCAAG GAGTGGAACG GCGACCTGGT CTTCCTGCAC
GAGGCCGCCG CCGGACCCGC CGACCGCTCC TATGGCGTGC AGGTGGCCAA GCTGGCCGGG
GTGCCCCCGG CGGTGGTCGC CCGCGCCAAG GAAGTGCTCG ACCGCCTGGA GAGCAAGACC
GAATCACCGG CCCGCCTCGA CGATCTGCCG CTGTTCGCCA GCCACGCCCC GGGTCCCCTC
AATCAGTTTG GGGCGCCTGT TCAAGCGGCG CCCAGCCGCA CCGACGCGGC GCTGGGGGAC
CTGGATGTCG ACGGCATGAG CCCGCGCGAG GCGCTGGACG CGCTTTATCG CCTTAAGGCT
CTTCTCAAGA CCTGA
 
Protein sequence
MNAPASPAVQ TYDATGATPV MQQFFEMKAR HPDALIFFRM GDFYELFFDD AYKAAAALGI 
SQTFRGTHNG QPIPMAGVPQ HAAEAYLSKL IRLGFKVAVC EQMEDPAEAR KRGSKSVVRR
DIVRVVTPGT LTEDGLLDAR GANRLAAVAI RAGQAAVAAV ELSTGEVECF LVAKDAVGAI
LAALAPSETL VADRLLSDDL LAQTLKICGG LIQPMPSALS EPQASETRVK RLYGVDTLDG
FGGLSAAEIG ALGLIAAHLE MTQAGKLPAL RAPRRAAEAD VMSIDPATRS SLEIDRAQNG
DRSGSLLAAI DRTVTAGGAR MLASRLARPL LDPHAIDARL DAVEWFVDHR GLRERLREVL
KGAGDMARAL SRLALGRGGP RDLGCLKDGL KTGEKLAGMV GGSGDPLSPP PAQLEGALKA
LTPSLQEGLS RLLAQLETGL GPDLPALARD GGYVAAGVRP ELDQARALRD DSRRVVAALE
SRLIQESGVP LKIRHNGVLG YFVEATAGKA DPLFQPPLNA TFIHRQTLAN QVRFTTVELA
DLDARIAQAA ERALAMEVAA FEDWRAEAVA LAEPIQLAAE ALAKLDVAAA LAEWAEDAGA
VRPSVDKSLA FEARAARHPV VEAAVKRAGD PYTPNDCCLD AAGERGARLS IVTGPNMAGK
STFLRQNAIL AILAQSGCYV PAKSLRLGVI DRLFSRVGAG DDLARGRSTF MMEMVETAAI
LTQASPRSLV ILDEIGRGTA TYDGLAIAWA CAEALHDTNR CRALFATHYH ELATLETRLA
HVSNLSLRAK EWNGDLVFLH EAAAGPADRS YGVQVAKLAG VPPAVVARAK EVLDRLESKT
ESPARLDDLP LFASHAPGPL NQFGAPVQAA PSRTDAALGD LDVDGMSPRE ALDALYRLKA
LLKT