Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_3599 |
Symbol | |
ID | 5453736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | + |
Start bp | 3848991 |
End bp | 3851717 |
Gene Length | 2727 bp |
Protein Length | 908 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640879183 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001414854 |
Protein GI | 154254030 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGAGC CCGCCACCTC CTTCACTCCC GAAATCCCGT CTGCCCTGCC GTCCATCCCG GCAGATGCCA CACCCATGAT GGCGCAATAT CTGGAGATCA AGGCGCGCTG GCCCGAAGCC CTCCTCTTCT ACCGGATGGG CGATTTCTAC GAGCTCTTCT TCGAGGATGC CGTAGCGGCA TCCGCCGCCC TCGACATCGC GCTGACAAAG CGCGGCAAGC ATCTGGGCGA GGACATCCCC ATGTGCGGCG TCCCCGTCCA CAGCCATGAT GCCTATCTCC AGCGTCTCAT CCGCAAGGGC TTCAAGGTCG CCGTCTGCGA GCAGGTCGAG GACCCGGCGG AGGCGAAGAA GCGCGGCGCC AAATCCGTCG TCGCCCGCGC CGTCGCCCGC CTCGTCACGC CCGGCACCCT GACCGAGGAC ACGCTGCTCG ATGCGCGTGC CCATAATTAT CTGGCCGCCT TGTCCCGCAC CGGCGCGGAG GCGGGCTTCG GCCTCGCCTG GGTCGATGTC TCGACCGGCG ATTTCGCCGT CACCTCGCTT GCCCCCGTGG CGCTCGGCGC CGAGCTTGCG CGTCTCTCCC CCGGCGAGCT GCTGCTGCCG GAAACGCTGG ACGAGGACGA AGACCTCGCC GCCCTCCTCG CGCAATCGGG CGCGGCGCTG ACGCGGCTTC CCGCCATCCG CTTCGAGAGC GGCCAGGCCG AGCGCCGCCT GAAGTCCCAT CTCGGCGTCT CCGCGCTTGA TGGGTTCGGT GCCTTCGCCC GCGCCGAACT CGGCGCCATG GGCGCGCTTC TCGATTATGT CGAGCTGACG CAGGTCGGCC GTATGCCGGC CCTCATGCCG CCGCGCCGCG TCGCCGCCAC CGACACCATG GCGATCGACG CCGCCACCCG CGCCAATCTC GAACTGGTGA GGACTCTGCA GGGCGAGACC GCCGGCTCCC TCCTCGCCAC CATGGACAGG ACGGTGACGG GTGCCGGCGC GCGCGAGCTT GCCTCCCGTC TCGCCGCCCC TTTGACCGAT CCCGCCGCGG TCAACCGCCG CCTCGATGCG GTTGAGTGGT TCCATGATGC GCGCGACATG CGCGCCCGCC TCCGCGCCGG TCTCAAATCC GCGCCCGACA TCGCGCGCGC CCTCTCCCGC CTCTCGCTCG GCCGTGGCGG CCCGCGCGAT CTCGCCGCCA TCGCCAATGG TCTCGCTGCC GCGCATGGCC TCTGCGCCGC GCTTGATGGC GCTTCGCCCT CGCTCCTGCC GTTGCCGGAA GAAATCGCGC GCGAATGCGA GGCGATGCGC GGCACCGCCG CGTCCCTCCA GTCCCGCCTC GCCGCCATGC TGGCCGAAGA ACTGCCGCTC CTGGCCCGCG ACGGCGGCTT CATCGCAAGG GGCGCGAGCC CCGAACTCGA CGAGACCCGC GCGCTGCGCG ACGACGCGCG CAAATTGATC GCCGGGCTGC AGGCGAAATA TGCGGGCGAA AGCGGCATCG CCGCCCTGAA GATCCGCCAC AACAATGTGC TCGGCTATTA CATCGAGGTG CCGCCCCGCC ACGGCGAAAA GCTCCTCGCG CCGCCTTTCT CCGACAGCTA CATCCACCGT CAGACCATGG CGAATGCGAT GCGCTTCACC ACGGCGGAGC TTGCGGGCCT CGCCAGCCGA ATCGCGGAAG CCGCCGGCCG CGCCCTCGAA ATCGAACTCG CCCTCTTCGA CGAACTCGCC GCCGCGACGC TCCTTGAGGC GCCGTCTCTC TCCCGCGCCG CCGAGGCGCT TGCCCGTCTC GATGCGACGG CGGCGCTGGC CGAGCTTGCC GCCGAGCGGC GCTATGTCCG CCCCCGCCTC GATGCGAGCT TCGCCTTCGA TATTCGCGGC GGCCGCCATC CGGTGGTCGA GGCCGCACTC GCCCGCACAG GCCAGGCCTT CGTGCCGAAC GATACCAGCC TCTCGGCGGA GAGCGGCGGC GGCAAGCATA TCTGGCTGCT CACCGGCCCC AACATGGCGG GTAAATCGAC CTTCCTCCGC CAGAACGCGC TCATCGCCAT CATGGCGCAG ATGGGCTCCT TCGTGCCGGC GGACGAGGCG CATATCGGCG TTGTCGACCG CCTCTTCTCC CGCGTCGGCG CGGCGGACGA TCTCGCGCGC GGCCGCTCCA CCTTCATGGT CGAAATGGTC GAGACGGCGG CGATCCTCAA TCAGGCGGGC GAGCGTTCGC TGGTAATCCT CGATGAGATC GGCCGCGGCA CCGCGACTTT CGATGGTCTC TCCATCGCCT GGGCAACCGT CGAACATCTC CACGGCGTCA ACAAGTCGCG CGCCCTCTTC GCCACGCACT ACCACGAATT GACCGCGCTA TCGGAAAAGC TCGCCCATCT CGCCAACGCC ACCATGCGCG TCAAGGAATG GCAGGGCGAT GTCGTCTTCC TGCATGAAGT CGCGCCCGGC GCGGCGGATC GTTCTTATGG CATCCAGGTC GCAAAACTCG CGGGCCTCCC CGCGCCCGTC ATCGCCCGCG CCCAATCCGT CCTCGCCGCA TTGGAAGAAG GCGGCAACCA CGAAGCCCGC ACGAAACTGA TCGACGACCT GCCCCTCTTT TCCGCCACCG CAAAACCCGC CCCGGCGGTA AAGGTGAGCG CCGCCGAAGA GGAGTTGAAA AACCTCAACC CCGACGAACT CTCGCCGAAG CAAGCACTGG AACTTCTCTA CAAGCTGAAG GCGCTGGCGG CGAAGGACGG CGAATAA
|
Protein sequence | MSEPATSFTP EIPSALPSIP ADATPMMAQY LEIKARWPEA LLFYRMGDFY ELFFEDAVAA SAALDIALTK RGKHLGEDIP MCGVPVHSHD AYLQRLIRKG FKVAVCEQVE DPAEAKKRGA KSVVARAVAR LVTPGTLTED TLLDARAHNY LAALSRTGAE AGFGLAWVDV STGDFAVTSL APVALGAELA RLSPGELLLP ETLDEDEDLA ALLAQSGAAL TRLPAIRFES GQAERRLKSH LGVSALDGFG AFARAELGAM GALLDYVELT QVGRMPALMP PRRVAATDTM AIDAATRANL ELVRTLQGET AGSLLATMDR TVTGAGAREL ASRLAAPLTD PAAVNRRLDA VEWFHDARDM RARLRAGLKS APDIARALSR LSLGRGGPRD LAAIANGLAA AHGLCAALDG ASPSLLPLPE EIARECEAMR GTAASLQSRL AAMLAEELPL LARDGGFIAR GASPELDETR ALRDDARKLI AGLQAKYAGE SGIAALKIRH NNVLGYYIEV PPRHGEKLLA PPFSDSYIHR QTMANAMRFT TAELAGLASR IAEAAGRALE IELALFDELA AATLLEAPSL SRAAEALARL DATAALAELA AERRYVRPRL DASFAFDIRG GRHPVVEAAL ARTGQAFVPN DTSLSAESGG GKHIWLLTGP NMAGKSTFLR QNALIAIMAQ MGSFVPADEA HIGVVDRLFS RVGAADDLAR GRSTFMVEMV ETAAILNQAG ERSLVILDEI GRGTATFDGL SIAWATVEHL HGVNKSRALF ATHYHELTAL SEKLAHLANA TMRVKEWQGD VVFLHEVAPG AADRSYGIQV AKLAGLPAPV IARAQSVLAA LEEGGNHEAR TKLIDDLPLF SATAKPAPAV KVSAAEEELK NLNPDELSPK QALELLYKLK ALAAKDGE
|
| |