Gene Acid345_4299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4299 
Symbol 
ID4071872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5109048 
End bp5111486 
Gene Length2439 bp 
Protein Length812 aa 
Translation table11 
GC content59% 
IMG OID637986332 
ProductMutS2 family protein 
Protein accessionYP_593373 
Protein GI94971325 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCTC GGCCACTCAC CCACTCCAGT GCCCCGGTCC TGGAATTCGA AGCGTTTCGC 
GAGTTGTTGC GCGGCTATGC GCAATCGGAA CTCGGGTCGG CACGCGTCCG CGAACTGGCG
CCGTCGGCCG ATCGGGAGTG GATCGAGCGA GAACAGCAAC TCGCATCCGA GATTCGAGGG
TACATTCGTG CTGCCGGGCG CTTCGACTTC GTCGGACTGA CCGATGCGAC CAAGTTGATT
CAGAAAGCCC GCATCCGTGG TGCCGCACTG GAGATGGACG AAATTCGCAC CATCCTGCTA
CTCGCGGAAC GCGCAGCAGA GTGGCGCGAG ATCTTGATTT CGCCTCCCGT GATGCGCGAG
CCTTGGAAGG CCGTGGAAGA TCTCTCGAGT TCGCTCGCGG ATTTTCGAGA ATTCCTGCGC
TACTTCAGCA ACAAATTACT TCCGGACGGA TCGCTCGACG ATCGCGCCTC AAGCGAGTTG
CATCGTATTC GGCGCGAGAT CGAGCGGCAG AAGCGACACA TCCAGAGCTC GTTGCAATCG
TTCCTGCGGA AGTTGTCCGA TGAAGGCACG GCGCAGGAAG AGTTGATCAC GATTCGCGGC
GACCGCTTCG TTATCCCGGT AAAGGCGGAG CAGAAGCGAC GTGTGAATGG CGTGGTACAC
GGCGCCAGTT CAAGCGGCCA GACGGTGTTC GTGGAGCCCA TGGAGACGAT CGAGCAAAAC
AACGACCTAG TGCGGTTGCT GGAAGAAGAG CAGGAAGAGA TTCGGCGCAT TCTTGCCGAG
ATGTCGCGAC GAATCGGGGA GCAGTCGGAA AACCTGCTCT TTGCGCTCTA CGTACTGGCT
GAGTTGGAGC TTCAATTTGC GAAGGCGAAG TTCGCACAAG AGTATGAGTG CGTCGCGGTA
AAGTTCCTGG CGGATGGCGG CGAGGATGTT CTGGTACTCG AGAAAGCGCG GCATCCGCTA
CTCGAGCGTA ATTTGCGACC CAAGGGGATC GCGGTTGTGC CGATGGCTAT GCACATGGAT
GCTCGGCACC GCCAGATCAT CATTAGCGGC CCAAATACAG GTGGCAAGAC GGTGTCGCTC
AAGACTTTGG GCTTGCTCGC ATTGATCGCG CAAGCAGGCG TACCAGTTCC TGCGGACAGA
GCGGAACTGC CGATTTTCAG CAGTGTTTTC GCTGACATCG GCGATTACCA GTCCATCGAG
CAAAACCTTT CGACATTCTC GGCACACGTC ACGAACATCG ATCTCATCTC GCACACCGCC
GGAGCAGATT CCTTGGTGCT GCTGGATGAA CTTGGCTCTG CAACGGACCC GGAGGAAGGC
GCGGCACTCG CGGTCGCCAT TGCCGACTAC TTCCGCCAGA TCGGGTGCTT AAGCGTGATC
TCGACCCACC ATACGTCATT GAAGGTATAC GCGGCCAACA CGGAGGGTGT GTTAAATGCC
GCCGTCGGCT TCGATGAGCA GACGCTGCAG CCGACGTATG AGTTGCGTGT GGGAGTGCCG
GGCGCCTCAG CGGGTATCAA CATTGCGAAG CGACTCGGGC TCAATTCGAC AATCATCGAA
GCGTCGAAAC GACAGCTTAG CAATCAGGCA CAGGATGTTG CGAAGTTCCT CGACCGCCTG
CATGCCGAGC TTCGCGCAGC TTCTGATGAG CGTGCCTCCA TCAAGCGAAC CGAAGAGGAA
CTGGTGCGCG AGCGCAAGCG GCTTGAGGCA GAAGGCCAAA AAGAGCAGCG CGAGAAAATC
CGCGACCTCG AAAAGAAGCT CGACGGGCTG CTCCACGACT TTGAGTACCA AGCGCGCGAG
ATGGTGCAGG CGGTGCAGGA CCGTGCCGCA CAACAAAAGC TGTCGAAAGA TGCGGAACGC
CGGATTGCAA AAATGCGCCG TGAGTTCCGA GAGCAGTTCG ACAACAGTGT AGTGGCTCAC
GCCACCGGCG CTGACCAAGG CGATCCGAAT GCCCGGCCGG AACTCGTGAA ACACGTCTCC
GAGGGTGACC GCGTAAAGCT GCGCTCCATG GGCCGCGAAG GCAAGGTGAT TAAGCGGCTG
GGGGCTGACC TGTTCGAAGT GGAAATCGGC GTGATGAAGA TGAAGGTGCC GCGCGAGGAC
ATAGCGGAGG TTACTTCTCG GCCGAGCGCG AATCCAGTCG CAGCGGCGCG TGCAAAAGGT
GTAAGTGTTT CGCTCGTGAG CGACGACCTA TCGTCGCCGA TCGAGTTGAA CGTCATCGGT
CAGAACGTAG ATGATGCGAC GCGCGAGGTT GAGCGTTTCC TCGACAAAGC GTTTCTTGCA
GGCATGGTGC AGGTGCGAAT TGTGCACGGC AGCGGCATGG GGATCCTCCG CCGGGCGCTA
CGGACCTACC TCAAGCATCA TCCGCACGTG TCGAACGTTG TCGAGCCTCC GCAGCAAGAG
GGCGGGAACG GCGCCACGGT CGTCGAACTC AAGGTTTAG
 
Protein sequence
MIPRPLTHSS APVLEFEAFR ELLRGYAQSE LGSARVRELA PSADREWIER EQQLASEIRG 
YIRAAGRFDF VGLTDATKLI QKARIRGAAL EMDEIRTILL LAERAAEWRE ILISPPVMRE
PWKAVEDLSS SLADFREFLR YFSNKLLPDG SLDDRASSEL HRIRREIERQ KRHIQSSLQS
FLRKLSDEGT AQEELITIRG DRFVIPVKAE QKRRVNGVVH GASSSGQTVF VEPMETIEQN
NDLVRLLEEE QEEIRRILAE MSRRIGEQSE NLLFALYVLA ELELQFAKAK FAQEYECVAV
KFLADGGEDV LVLEKARHPL LERNLRPKGI AVVPMAMHMD ARHRQIIISG PNTGGKTVSL
KTLGLLALIA QAGVPVPADR AELPIFSSVF ADIGDYQSIE QNLSTFSAHV TNIDLISHTA
GADSLVLLDE LGSATDPEEG AALAVAIADY FRQIGCLSVI STHHTSLKVY AANTEGVLNA
AVGFDEQTLQ PTYELRVGVP GASAGINIAK RLGLNSTIIE ASKRQLSNQA QDVAKFLDRL
HAELRAASDE RASIKRTEEE LVRERKRLEA EGQKEQREKI RDLEKKLDGL LHDFEYQARE
MVQAVQDRAA QQKLSKDAER RIAKMRREFR EQFDNSVVAH ATGADQGDPN ARPELVKHVS
EGDRVKLRSM GREGKVIKRL GADLFEVEIG VMKMKVPRED IAEVTSRPSA NPVAAARAKG
VSVSLVSDDL SSPIELNVIG QNVDDATREV ERFLDKAFLA GMVQVRIVHG SGMGILRRAL
RTYLKHHPHV SNVVEPPQQE GGNGATVVEL KV