Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4299 |
Symbol | |
ID | 4071872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 5109048 |
End bp | 5111486 |
Gene Length | 2439 bp |
Protein Length | 812 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637986332 |
Product | MutS2 family protein |
Protein accession | YP_593373 |
Protein GI | 94971325 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01069] MutS2 family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCCTC GGCCACTCAC CCACTCCAGT GCCCCGGTCC TGGAATTCGA AGCGTTTCGC GAGTTGTTGC GCGGCTATGC GCAATCGGAA CTCGGGTCGG CACGCGTCCG CGAACTGGCG CCGTCGGCCG ATCGGGAGTG GATCGAGCGA GAACAGCAAC TCGCATCCGA GATTCGAGGG TACATTCGTG CTGCCGGGCG CTTCGACTTC GTCGGACTGA CCGATGCGAC CAAGTTGATT CAGAAAGCCC GCATCCGTGG TGCCGCACTG GAGATGGACG AAATTCGCAC CATCCTGCTA CTCGCGGAAC GCGCAGCAGA GTGGCGCGAG ATCTTGATTT CGCCTCCCGT GATGCGCGAG CCTTGGAAGG CCGTGGAAGA TCTCTCGAGT TCGCTCGCGG ATTTTCGAGA ATTCCTGCGC TACTTCAGCA ACAAATTACT TCCGGACGGA TCGCTCGACG ATCGCGCCTC AAGCGAGTTG CATCGTATTC GGCGCGAGAT CGAGCGGCAG AAGCGACACA TCCAGAGCTC GTTGCAATCG TTCCTGCGGA AGTTGTCCGA TGAAGGCACG GCGCAGGAAG AGTTGATCAC GATTCGCGGC GACCGCTTCG TTATCCCGGT AAAGGCGGAG CAGAAGCGAC GTGTGAATGG CGTGGTACAC GGCGCCAGTT CAAGCGGCCA GACGGTGTTC GTGGAGCCCA TGGAGACGAT CGAGCAAAAC AACGACCTAG TGCGGTTGCT GGAAGAAGAG CAGGAAGAGA TTCGGCGCAT TCTTGCCGAG ATGTCGCGAC GAATCGGGGA GCAGTCGGAA AACCTGCTCT TTGCGCTCTA CGTACTGGCT GAGTTGGAGC TTCAATTTGC GAAGGCGAAG TTCGCACAAG AGTATGAGTG CGTCGCGGTA AAGTTCCTGG CGGATGGCGG CGAGGATGTT CTGGTACTCG AGAAAGCGCG GCATCCGCTA CTCGAGCGTA ATTTGCGACC CAAGGGGATC GCGGTTGTGC CGATGGCTAT GCACATGGAT GCTCGGCACC GCCAGATCAT CATTAGCGGC CCAAATACAG GTGGCAAGAC GGTGTCGCTC AAGACTTTGG GCTTGCTCGC ATTGATCGCG CAAGCAGGCG TACCAGTTCC TGCGGACAGA GCGGAACTGC CGATTTTCAG CAGTGTTTTC GCTGACATCG GCGATTACCA GTCCATCGAG CAAAACCTTT CGACATTCTC GGCACACGTC ACGAACATCG ATCTCATCTC GCACACCGCC GGAGCAGATT CCTTGGTGCT GCTGGATGAA CTTGGCTCTG CAACGGACCC GGAGGAAGGC GCGGCACTCG CGGTCGCCAT TGCCGACTAC TTCCGCCAGA TCGGGTGCTT AAGCGTGATC TCGACCCACC ATACGTCATT GAAGGTATAC GCGGCCAACA CGGAGGGTGT GTTAAATGCC GCCGTCGGCT TCGATGAGCA GACGCTGCAG CCGACGTATG AGTTGCGTGT GGGAGTGCCG GGCGCCTCAG CGGGTATCAA CATTGCGAAG CGACTCGGGC TCAATTCGAC AATCATCGAA GCGTCGAAAC GACAGCTTAG CAATCAGGCA CAGGATGTTG CGAAGTTCCT CGACCGCCTG CATGCCGAGC TTCGCGCAGC TTCTGATGAG CGTGCCTCCA TCAAGCGAAC CGAAGAGGAA CTGGTGCGCG AGCGCAAGCG GCTTGAGGCA GAAGGCCAAA AAGAGCAGCG CGAGAAAATC CGCGACCTCG AAAAGAAGCT CGACGGGCTG CTCCACGACT TTGAGTACCA AGCGCGCGAG ATGGTGCAGG CGGTGCAGGA CCGTGCCGCA CAACAAAAGC TGTCGAAAGA TGCGGAACGC CGGATTGCAA AAATGCGCCG TGAGTTCCGA GAGCAGTTCG ACAACAGTGT AGTGGCTCAC GCCACCGGCG CTGACCAAGG CGATCCGAAT GCCCGGCCGG AACTCGTGAA ACACGTCTCC GAGGGTGACC GCGTAAAGCT GCGCTCCATG GGCCGCGAAG GCAAGGTGAT TAAGCGGCTG GGGGCTGACC TGTTCGAAGT GGAAATCGGC GTGATGAAGA TGAAGGTGCC GCGCGAGGAC ATAGCGGAGG TTACTTCTCG GCCGAGCGCG AATCCAGTCG CAGCGGCGCG TGCAAAAGGT GTAAGTGTTT CGCTCGTGAG CGACGACCTA TCGTCGCCGA TCGAGTTGAA CGTCATCGGT CAGAACGTAG ATGATGCGAC GCGCGAGGTT GAGCGTTTCC TCGACAAAGC GTTTCTTGCA GGCATGGTGC AGGTGCGAAT TGTGCACGGC AGCGGCATGG GGATCCTCCG CCGGGCGCTA CGGACCTACC TCAAGCATCA TCCGCACGTG TCGAACGTTG TCGAGCCTCC GCAGCAAGAG GGCGGGAACG GCGCCACGGT CGTCGAACTC AAGGTTTAG
|
Protein sequence | MIPRPLTHSS APVLEFEAFR ELLRGYAQSE LGSARVRELA PSADREWIER EQQLASEIRG YIRAAGRFDF VGLTDATKLI QKARIRGAAL EMDEIRTILL LAERAAEWRE ILISPPVMRE PWKAVEDLSS SLADFREFLR YFSNKLLPDG SLDDRASSEL HRIRREIERQ KRHIQSSLQS FLRKLSDEGT AQEELITIRG DRFVIPVKAE QKRRVNGVVH GASSSGQTVF VEPMETIEQN NDLVRLLEEE QEEIRRILAE MSRRIGEQSE NLLFALYVLA ELELQFAKAK FAQEYECVAV KFLADGGEDV LVLEKARHPL LERNLRPKGI AVVPMAMHMD ARHRQIIISG PNTGGKTVSL KTLGLLALIA QAGVPVPADR AELPIFSSVF ADIGDYQSIE QNLSTFSAHV TNIDLISHTA GADSLVLLDE LGSATDPEEG AALAVAIADY FRQIGCLSVI STHHTSLKVY AANTEGVLNA AVGFDEQTLQ PTYELRVGVP GASAGINIAK RLGLNSTIIE ASKRQLSNQA QDVAKFLDRL HAELRAASDE RASIKRTEEE LVRERKRLEA EGQKEQREKI RDLEKKLDGL LHDFEYQARE MVQAVQDRAA QQKLSKDAER RIAKMRREFR EQFDNSVVAH ATGADQGDPN ARPELVKHVS EGDRVKLRSM GREGKVIKRL GADLFEVEIG VMKMKVPRED IAEVTSRPSA NPVAAARAKG VSVSLVSDDL SSPIELNVIG QNVDDATREV ERFLDKAFLA GMVQVRIVHG SGMGILRRAL RTYLKHHPHV SNVVEPPQQE GGNGATVVEL KV
|
| |