Gene Xaut_0035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXaut_0035 
Symbol 
ID5424135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXanthobacter autotrophicus Py2 
KingdomBacteria 
Replicon accessionNC_009720 
Strand
Start bp34202 
End bp36967 
Gene Length2766 bp 
Protein Length921 aa 
Translation table11 
GC content72% 
IMG OID640879280 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001414951 
Protein GI154243993 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.892203 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCAG ACCGCGCCCG GACCGCCACC CCAGACCTTC CGCAGGACTC CCCCGAGCCC 
GCCAGCGTCG CTCCCGCCCC CGCGCGGGCG GAGGAGGCGC GTGTCACGCC CATGATGGCG
CAGTATCTGG AGATCAAGGC GGCCAATCCG GACAGCCTGC TATTTTACCG CATGGGCGAT
TTCTACGAGC TGTTCTTCGC CGACGCGGAA GCCGCCTCGC AGGCGCTCGG CATCGTGCTG
ACCAAGCGCG GCAAGCACCT GGGCGAGGAC ATTCCCATGT GCGGCGTGCC CATCGACCGC
GCCGAGGAAT ATCTGCACAA GCTCATCGCT CTGGGCTTCC GCGTGTCGGT GTGCGAGCAG
CTGGAGGACC CGGCGGAGGC GAAGAAGCGC GGACCGAAAT CCGTGGTGCG GCGGGACGTG
ACCCGCCTCG TCACCCCCGG CACGATCACC GAGGACGCGC TGCTGGATGC CCGGCGCGAG
AACGTGCTGG CGGCGCTCGC CCGCGTGCGG GCCGGATCGG GGCCGGAGGA TTTCGCCTAC
GCGCTGGCCT ATACCGATAT GTCCACCGGC AGCTTCCGGG TCACCGCCAC CGCGCGTGAC
GACCTCTCCG GCGACCTCGC CCGCCTCGAT CCGGCGGAGA TCCTGGTCTC CGACGCGGTG
CTGGACGACG GCGAACTGCG CGCGCTGCTG CGGGCCTTTC CGGCGGTGAC GCCCCTGCCG
CGCCAGTCCT TCGACGGAGC GGGGGCGGAA AAGCGCCTCG CCGATTTCTT CGGCGTGGCG
GCGCTCGACG CCTTCGGCAC CTTCGCCCGC GCCGAGCTGA TCGCGGCGGC GGCCATCGTC
GCCTATGTGG ATCGCACCCA GCTGGGCGCC AAGCCGCTGC TCTCCCGTCC GGTGAAGGAA
GCGGAAGGCG GCATCATGGC CATCGATGCC GGCACGCGGG CCAATCTGGA GCTGGTGCGC
ACCACCTCCG GCGAGCGGCG CGGCTCCCTG CTCGCCGCCG TGGATCGCAC GGTGACGGCG
GCCGGCGCGC GCCTCATCGC CCGCCGCATC GCCGAGCCGC TTACGGACCT TGCCGCCATC
CGTGCCCGGC ACGACGGCGT CGCCCATCTG GTGGAGGAGG GGGAATTGCG GCGCGAGCTG
CGCGCCCGGC TCTCCCGCGC CCCCGACATG GCCCGAGCAG TCACCCGCCT CGCCCTCCAG
CGCGGCGGCC CGCGCGACCT TGCTGCCGTG CGCGATGCGC TGGACGGTGC CCTCGCCATC
GCCGGCCTGT TCGCCGCCGC GCCACCGGCG GACCTTGCCC GCTCAGCGGC GGCGCTGGCG
CGGGTGCCCC ATGCCCTGGT GCTCGACCTC GCCTCGGCGC TCGCCGAGAG CCTGCCGCCC
CTCCGTCGCG ACGGCGGCTT CATCCGCGAA GGCTGCGACG CCGAGCTGGA TGCGACGCGT
GCCCTGCGCG ACGAGAGCCG GCGCGTGGTG GCGGCGCTGG AGCGGCGCTA CGTGGACGAG
ACCGGCGTGC GCGCCCTGAA GATCCGGCAC AATGCGGTGC TCGGCTATTT CGTCGAGGTC
TCCGCCCAGA ACGCCGACCG CCTGCGCGAG GCGCCACACG ACGCCGTTTT CGTCCACCGC
CAGACCATGG CGGGGGCGGT GCGCTTCTCG TCCGTCGAGC TGGGCGATCT TGAGAGCCGT
ATCGCCAGCG CCGGCGAGCG GGCGCTGGGG CTGGAGCAGG CCATCTTCGA CCGGCTGGCT
GCCGCCGTGG TGGCCGAGAC CGAGACCATC CGCGCCGCCG CGGAGGCGCT GGCGGAACTC
GATGTGGCTG CGGGCTTTGC AGAGCTTGCG GCGGTGGAGA ACCATGTGCG CCCGCACATG
GAGCCGGGGG TCGCCTTCGC CATCGCCGGC GGGCGCCATC CGGTGGTGGA GCAGGCCCTC
GCCAAAGAGG GCGGGCCGTT CGTGCCCAAC GATTGCGACC TCTCCCCGCC GGAGGGGTTC
GAGGACGGGC GCATCGTGCT GGTCACCGGG CCGAACATGG CCGGCAAGTC CACCTTCCTG
CGCCAGAACG CGCTCATCTG CGTGCTGGCG CAGGCCGGCG CCTTCGTGCC CGCCCGGTCC
GCGCGCATTG GCGTGGTGGA CCGCCTGTTC TCCCGCGTGG GCGCGGCGGA TGACCTCGCC
CGCGGGCGTT CCACCTTCAT GGTGGAGATG GTGGAGACCG CTGCCATCCT GAACCAGGCC
ACCGCCCGCT CCCTGGTCAT CCTCGACGAG ATCGGGCGCG GCACCGCCAC CTTCGACGGC
ATGTCCATCG CCTGGGCGAG CCTTGAGCAC CTGCACGAGG TGAACCGCTG CCGGGCGCTG
TTCGCCACAC ATTTCCACGA ACTGACCGCC CTGTCCCAGC GCTGCAAGCG GCTCTCCAAC
GCCACGGTGA AGGTCACCGA ATGGCATGGC GACGTGATCT TCCTGCATGA GGTGGTGCCG
GGGGCGGCGG ATCGTTCCTA CGGCATCCAG GTGGCCAAGC TCGCCGGCCT GCCGGAGGCG
GTGATCACCC GCGCCAAGGC GGTGCTGGCG GAGCTGGAAG CGGCCGAGCG CGCCTCTCCG
GCCCAGAAGC TCATCGATGA TCTGCCCCTG TTCGCGGTGC GCCCGAAGCC TGCGGCCGCC
GCATCAGCGG ACCCGAAGGC CGAGGCGACG CTCTCCGCCC TCGACGGCAT CGACCCCGAC
AGCCTGAGCC CGCGCGAGGC GCTGGATGCG CTCTATCGGC TGAAGGGCCT CCGGCGCGAG
GGGTGA
 
Protein sequence
MSSDRARTAT PDLPQDSPEP ASVAPAPARA EEARVTPMMA QYLEIKAANP DSLLFYRMGD 
FYELFFADAE AASQALGIVL TKRGKHLGED IPMCGVPIDR AEEYLHKLIA LGFRVSVCEQ
LEDPAEAKKR GPKSVVRRDV TRLVTPGTIT EDALLDARRE NVLAALARVR AGSGPEDFAY
ALAYTDMSTG SFRVTATARD DLSGDLARLD PAEILVSDAV LDDGELRALL RAFPAVTPLP
RQSFDGAGAE KRLADFFGVA ALDAFGTFAR AELIAAAAIV AYVDRTQLGA KPLLSRPVKE
AEGGIMAIDA GTRANLELVR TTSGERRGSL LAAVDRTVTA AGARLIARRI AEPLTDLAAI
RARHDGVAHL VEEGELRREL RARLSRAPDM ARAVTRLALQ RGGPRDLAAV RDALDGALAI
AGLFAAAPPA DLARSAAALA RVPHALVLDL ASALAESLPP LRRDGGFIRE GCDAELDATR
ALRDESRRVV AALERRYVDE TGVRALKIRH NAVLGYFVEV SAQNADRLRE APHDAVFVHR
QTMAGAVRFS SVELGDLESR IASAGERALG LEQAIFDRLA AAVVAETETI RAAAEALAEL
DVAAGFAELA AVENHVRPHM EPGVAFAIAG GRHPVVEQAL AKEGGPFVPN DCDLSPPEGF
EDGRIVLVTG PNMAGKSTFL RQNALICVLA QAGAFVPARS ARIGVVDRLF SRVGAADDLA
RGRSTFMVEM VETAAILNQA TARSLVILDE IGRGTATFDG MSIAWASLEH LHEVNRCRAL
FATHFHELTA LSQRCKRLSN ATVKVTEWHG DVIFLHEVVP GAADRSYGIQ VAKLAGLPEA
VITRAKAVLA ELEAAERASP AQKLIDDLPL FAVRPKPAAA ASADPKAEAT LSALDGIDPD
SLSPREALDA LYRLKGLRRE G