Gene Coch_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCoch_2031 
Symbol 
ID8368492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCapnocytophaga ochracea DSM 7271 
KingdomBacteria 
Replicon accessionNC_013162 
Strand
Start bp2431741 
End bp2433912 
Gene Length2172 bp 
Protein Length723 aa 
Translation table11 
GC content43% 
IMG OID644984485 
ProductDNA mismatch repair protein MutS domain protein 
Protein accessionYP_003142136 
Protein GI256820857 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACAA AAACACTTGA AGACTTAGAG TTCCCCATTG TACTCTCTCA TTTGTCTGAC 
CTTTGCCTTA CTGAACTCGG CAAGAAATAC GCCTTACGTA TCAAGCCTTT CGACAATCAG
GAGACGCTGT TATTGGCACT GAATCAAACC AATGAATACC TATCGTCTTT TGATAATAAC
AACACAATCC CTTCACACTA TTGCGAATCT ATCACTTCTG AAATTAAACT TCTATCTATT
GAAAACGCCT TGTTAGAAGT GTCGAGCATT AGGAAGATTC ATCGTATCAG CGAGATTGTA
AACACTCAAA TTCTCTTCTT TAAGAAGTTC AAAACGCTAT ACCCTACCCT TTTCGAAACG
GCTGATAGTA TCGAGTATAC TACCGAACTG CTGAACGCTA TCGATAAAGT GTTGGACAAG
TACGGCGAGA TAAAAAACGA AGCCTCTCCT ACGCTTGGCG ATATACGTCG CGAGCTGAGC
GCCTTGAAAG GAAAACTCAA CGAAAGTTTC AACCGTGCTC TCGCCGAGTA TAACACTGTC
GACTATCTCG ATGATATCCG CGAGACCGTA GTAGAAAACC GCCGCGTATT GGCAGTGAAA
GCAATGTACC GCCGAAAAGT ACAAGGCACC GTATGGGGGA GCTCAAAAAC GGGTAGCATA
GTGTATATAG AACCGCGACA AACCGAAATC TACTCTCGCG AACTCTCCAA TCTGCTTTAT
GACGAAAAAG AGGAAATCCA GCGTATATTA AGAGAACTTA CCGCCTTCAT CAGTCAGTTT
GCCGACCTGC TGAAAGACTA TCAGCGATAC ATCACAGCCG TTGATATTAT TTGCGCCAAA
GCCAAGTACG CTCACCAAAT GAACGCACTT TTGCCAGAAA TCACTCAGGA GCGCGAACTA
TTCCTCCGTG AAGCATATCA CCCTTTGCTG TATCTAAACA ATGCCAAAAA AGGTGTTACT
ACCTTTCCTC AAACCATAGA ACTGAATGAT GAAAATCGTA TTATCGTCAT CTCGGGACCT
AATGCGGGCG GAAAGAGTAT TACGCTTAAA ACCATTGGCT TATTGCAATT AATGTTACAA
AGTGGAATGC TCGTTCCGGT GCATCATCGT TCTAAAATGT GTTTGTTTGA ACGCATTCTT
ACCGATATCG GCGATAACCA ATCTATTGAA AACCACCTTA GCACATACAG CTACCGACTC
AAAAATATGA ATTACTTCCT CAAAAAATGC AACCACCGCA CCTTGTTTTT GATAGATGAG
TTCGGGACAG GTAGCGACCC CGAATTAGGA GGTGCTTTGG CTGAAATATT CTTAGAAGAG
TTCTACCACC GAAAGGCTTT TGGGGTGATT ACCACCCATT ACACCAATTT AAAAATGCTG
GCAGATGAAT TGCCCCACGC GAGCAATGCC AATATGCTTT TCAACGATAA AACCTTAGAG
CCCATCTATA AGCTAATAAT AGGCGAAGCG GGAAGTTCTT TCACTTTTGA AGTGGCGCAG
AAAAACGGCA TTCCGTTTAG CCTTATCAAT CGGGCTAAGA AAAAAATAGA GAAAGGAAAA
GTGCGTTTTG ATGCCACTAT CGCCAAATTG CAGAAAGAGC GCTCCAAAAT GGAAAAAACT
GCCGAAACCC TGAAAGATGA GGAAACCAAA GCCCGCGAAG AAGCCAAACG TTTGGAGGAA
CTTAATGATA AAGTAAAAAG CAAGCTCATA AATTATCAAG AACTCTATGA CCAAAGCCAA
CGTATGATTA CCTTAGGTAG TAAGGTAGAC CAGATAGCCG AACGTTATTT CTACGATGGT
AAGCGTCGCC CTTTGGTATC CGAATTTCTC AAACTCATTG AAATGGAAAA TGCTAAGCGC
AAACAAATAA GCAAAGAGGA ACGCACCAAG CAAAAAGAGG AGAAAAAAAC TACTACCGAA
GAAGTTCATA AGCAAATGGA AGCTATCCGC CAGCAACGCA AAGAAGAGAA GAAAGAGCGT
ATTGCCCAAG AACGCGCCGA AAAGGAAAAG CTACAACGCG TCCTTTCAGT AGGCGACCGT
GTGCGCATTA AAGACAGCCG AAGCGTGGGA AGTATCGATA AGATTGAAAA AGGCAAAGCC
ATAGTAAATT ACGGGGCGTT TACTACCTCT GTTTCTTTGG ACGAATTGGA GCTCGTACAG
AAAATCCGAT AA
 
Protein sequence
MNTKTLEDLE FPIVLSHLSD LCLTELGKKY ALRIKPFDNQ ETLLLALNQT NEYLSSFDNN 
NTIPSHYCES ITSEIKLLSI ENALLEVSSI RKIHRISEIV NTQILFFKKF KTLYPTLFET
ADSIEYTTEL LNAIDKVLDK YGEIKNEASP TLGDIRRELS ALKGKLNESF NRALAEYNTV
DYLDDIRETV VENRRVLAVK AMYRRKVQGT VWGSSKTGSI VYIEPRQTEI YSRELSNLLY
DEKEEIQRIL RELTAFISQF ADLLKDYQRY ITAVDIICAK AKYAHQMNAL LPEITQEREL
FLREAYHPLL YLNNAKKGVT TFPQTIELND ENRIIVISGP NAGGKSITLK TIGLLQLMLQ
SGMLVPVHHR SKMCLFERIL TDIGDNQSIE NHLSTYSYRL KNMNYFLKKC NHRTLFLIDE
FGTGSDPELG GALAEIFLEE FYHRKAFGVI TTHYTNLKML ADELPHASNA NMLFNDKTLE
PIYKLIIGEA GSSFTFEVAQ KNGIPFSLIN RAKKKIEKGK VRFDATIAKL QKERSKMEKT
AETLKDEETK AREEAKRLEE LNDKVKSKLI NYQELYDQSQ RMITLGSKVD QIAERYFYDG
KRRPLVSEFL KLIEMENAKR KQISKEERTK QKEEKKTTTE EVHKQMEAIR QQRKEEKKER
IAQERAEKEK LQRVLSVGDR VRIKDSRSVG SIDKIEKGKA IVNYGAFTTS VSLDELELVQ
KIR