Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_2780 |
Symbol | |
ID | 8358941 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | + |
Start bp | 3421331 |
End bp | 3423448 |
Gene Length | 2118 bp |
Protein Length | 705 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 644964960 |
Product | DNA mismatch repair protein MutS domain protein |
Protein accession | YP_003122460 |
Protein GI | 256421807 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.00383732 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAATATT TTCCCGATTC AGCACTGGTA CAGCTGGAAT TTGACAAGGT ACAGGCTTTG TTAAAAGAGC ATTGTAAAAC GGAGCTGGGC AAACAAATGG CAGATGAACT GCGTTTGCAC ACCCACATTG ATTATGTAAA AACGGCCCTG CAACAGGCCC ATGAATACAA GCAGCTGACT TTGCTGCAGG AACACTTTCC CAACGATTAT GTACTGAACC TGAAAAATGA ACTGCGTTTG CTGAGCATCC AGGGTGCTGT ACTGAGCGAA GAACAGATTA TGCAGTTGCG TAAGCTGGCG GAAAGTATGC ACAGCATTGT ACGTTTCTTT GATCACGACA GGCGTGTCAC TTATGCAGGA CTGTTCAGCG TGATTACTCA TACCCACTAT GAGAAGAAAA TCACCGCGCT CATCGACGAA GTACTTGATG AGCATGGAGA AGTACGCGAC AACGCCACCC CTGAACTGGC TAAAATCCGT ATGAACCTGT TCCGTAAAAG GAACGAACTA CGCCGCGCAT TTGAGCGCTG TCTTAGTAAA CTTAGTAAAC TTGGCTACCT CGCCGATACG GAAGAAGCCT TCCTGAACGG AAGAAGGGTA GTGGCGATCT ACGCCGAAAA CAAACGTATG GTGAAAGGTA TCCTCCACGG GGAGTCCGAT ACCCGCCGTA CTGCTTTCGT CGAACCGGAA GAAACGATAG AACTGAACAA TGACGTGATG TCCCTCGAAA GGGAAGAAAG TAAAGAGGTG TACAAAATAC TGCGTACACT GACCCACCAG CTCAGTCAGC ACAGTCACCT GCTCAATAAC TATCACGAGA TACTCGGACA ATACGACTTT ATCAGGGCAA AAGCCAAACT GGCGCTGGAC ATGGACGGTA ACTACCCCAT GCTGACGCCG CACGCGGAAG CGCATCTCGT AGACGCGCAC CACCCGCTGC TGCTCCTGTA TAACCGCCGT AACAAGAAAC CGACCATCCC GGTGAACCTG ACCCTGAACA AGGATAACCA TATCCTGGTG ATCAGCGGTC CGAATGCCGG CGGTAAAACC GTGACCATGA AAACGGTCGG CCTGATCCAG CTCATGCTGC AGGCAGGCCT GCTGGTGCCT GTACACCCCA GCTCACAACT CGGTATTTTC CGTCAGCTGA TGATCCATAT CGGTGATACT CAGTCCCTGG AATTTGAACT GAGTACTTAT AGTTCTCACC TGAAGAACAT GAAGTACTTC ATGGAGAACG CCAATGGTAA GACCATGTTC TTCATCGATG AATTAGGTAG CGGTTCTGAC CCGAATCTCG GGGGCGCCTT CGCAGAGGTG ATCATGGAAG AAATGGCCAA GAAACACGCT TTTGGTATCG TTACCACCCA CTACCTCAAC CTGAAGGTAA TGGCCAATAA GGTACGTGGT ATCATCAACG GCGCTATGGG CTTCGACGAG GAAAACCTGT CGCCTATGTA CAAACTGATC GTAGGTAAAC CAGGTAGCTC GTATACGTTC TCTATCGCGC AGCGTATCGG TCTGCAGCCT GCCCTCATTG CCCGGGCCAA AAACCTGGTG GACGAAGGGC ACTTCCAGCT GGATAAACTG CTCAACAAAG CAGAACAGGA TCTGCAGAAA GTAGAAGCCA AGGAGAAAGA ACTGCAGAAA CTCCTGAAGG AGAATGAAAA GCTGAAACGT GAATACGAGA TCCTGTCCGA TAAGGAAAGG AAAACCCAGC AATTCACTAC CCTGAAACTG CAAAACCAGA TCAAGGAAGA AGAACTCCAG TACCTCAAGG ATATGGAGCG CAAGCTGAAA CAGATTGTTT TTGAGTGGAA ACGTACAGAT GACAAACAGA AAGTCATGAA ACAGGCAGAA GCCCTGTTAT TCCATAAACG CCAGAAACAG ATCAACGAAC GCTTCGAAAA GAAAATTCAG GATAAATTCC TGGAGGTAGT AGGAGCCGAC CTGAAACTGG GCGACCAGGT AAAGATCAAG AGCAATGGCC AGGTAGGAAA GCTGATGGAA GTCCGCGACA AACGCGGTAT CGTAAAACTC GGCAATATTC CGATGAACGT GGCACTGTCT GACCTGGTAC TGGTGAAAGA GCGGCCGAAA GAAGAAACGG CTAAGTAG
|
Protein sequence | MKYFPDSALV QLEFDKVQAL LKEHCKTELG KQMADELRLH THIDYVKTAL QQAHEYKQLT LLQEHFPNDY VLNLKNELRL LSIQGAVLSE EQIMQLRKLA ESMHSIVRFF DHDRRVTYAG LFSVITHTHY EKKITALIDE VLDEHGEVRD NATPELAKIR MNLFRKRNEL RRAFERCLSK LSKLGYLADT EEAFLNGRRV VAIYAENKRM VKGILHGESD TRRTAFVEPE ETIELNNDVM SLEREESKEV YKILRTLTHQ LSQHSHLLNN YHEILGQYDF IRAKAKLALD MDGNYPMLTP HAEAHLVDAH HPLLLLYNRR NKKPTIPVNL TLNKDNHILV ISGPNAGGKT VTMKTVGLIQ LMLQAGLLVP VHPSSQLGIF RQLMIHIGDT QSLEFELSTY SSHLKNMKYF MENANGKTMF FIDELGSGSD PNLGGAFAEV IMEEMAKKHA FGIVTTHYLN LKVMANKVRG IINGAMGFDE ENLSPMYKLI VGKPGSSYTF SIAQRIGLQP ALIARAKNLV DEGHFQLDKL LNKAEQDLQK VEAKEKELQK LLKENEKLKR EYEILSDKER KTQQFTTLKL QNQIKEEELQ YLKDMERKLK QIVFEWKRTD DKQKVMKQAE ALLFHKRQKQ INERFEKKIQ DKFLEVVGAD LKLGDQVKIK SNGQVGKLME VRDKRGIVKL GNIPMNVALS DLVLVKERPK EETAK
|
| |