Gene Gdia_0299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0299 
Symbol 
ID6973691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp328534 
End bp331155 
Gene Length2622 bp 
Protein Length873 aa 
Translation table11 
GC content72% 
IMG OID643389830 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_002274711 
Protein GI209542482 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.391446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.153174 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTCC CTTCACCGGA CGGCGCCACT CCGGCCATGG CGCAATGGTT TTCCCTCAAG 
GCCGAACATC CCGAGGCCCT GCTGTTCTTC CGCATGGGTG ATTTCTACGA GATGTTCTTC
TCCGACGCCG AGGCGGCGGC CGGCGCGCTG GATATCGCCC TGACCGCGCG CGGCACGCAT
GGCGATATCC CGATTCCGAT GTGCGGCGTC CCGGTCGGGG CGGCATCGTC CTACCTGGCG
CGGCTGATCC GTCGCGGGTT CCGCGTGGCG GTGGCCGAAC AGACCGAACC GCCGCGCAAG
CCCGGCAAGG GCGGCCCCAA GGGCCCGCTG GCCCGCGCCG TGGTGCGGCT TGTCACCCCC
GGCACCCTGA CCGAGGACGA ATTGCTGGAG GCCGGACGGC AGAACCTGCT GCTGGCCGTC
GCCTCGCGCG GCGGACGCGG AACGCGCGGC TGGGGCGTGG CGTGGATCGA TATTTCCACC
GGCACATTCG AGACGGCCTC GGTTCCCGCC GAGGGGCTGA TGGACCTGCT GGGCCGGCTG
GACCCGGCGG AAATCCTGGC GGCGGCCGAG ATCGACCTGG GTGACTTCGC GGCCCGGCGC
GCGCCGGCAA CCATTCCCCC CGGCCCGGAA GCCGCCCGGC GGCGCGTGGC CGAGACCTTC
CATGTCGCCA GCCTGGACGC GTTCGGGACC TTTTCCGACG AGGAAGCGGT CGCCGGCGCC
ATGGCCCTGG ATTACGTCCG GCGCAGCCAG GCCGGCCAGA TGCCGCGCCT GTCGAGGCCC
CAGGCGCATG AAAATGGCGG AGTGCTGGGC CTGGACCCCG CCACGCGCGC CAGCCTTGAC
CTGCTGCGCA CGCGCGACGG CGGCACCGAG CATACGTTGC TGGCCAGCGT CGGCCGCACC
GTCACCGCGC CCGGCAGCCG ATTGCTGGCG GAATGGATCG CCGCCCCCCT GACCCGGGAC
GCCGCGATCG CCGCCCGCCA GGACGGCTGG TCGTGGCTGA TCGAGGAAGG CGGCGTGCGC
GCGGCCCTGC GGCAGGGATT GCGCGGTACG CCGGACATCG CCCGCGCGCT GGCCCGCCTG
TCGCTGGGCC GTGGCCTGCC GCGCGACGCG GCGGCGATCC GCGACGGGCT GGCCATCGCC
CGGCGCATCG CCCAGGCCCT GACGGACGGC CGCACCGCAC CGCCCGCGCT GCTGGCCGAC
GCCATGCGCC ACCTGGGCGA GGCAACGGAC CTGGAAGCCC GGCTGAAGCA GGCCCTGGCC
GAAGACCTGC CGGCCCGGGT GGAAGACGGC GGCGTGATCG CCGCCGGCTT CGATGCCGAA
CTGGACGCCG AACGGGCCCT GCGCGACGAC AGCCGCCGGG TCATTGCCGG GTTGCAGAAC
GATTATGCCC AGAAATTCGG GCTGGCCAGC CTGAAGATCC GCCATCACGC GCAACTGGGC
TACGTCATCG AGGTCCCGGC GGCGACCGCC CCCCGCCTGC GCGAGCGCAC CGACCTGATC
CTGCGCCAGG GCACGGCCAG CGCGGCCCGG TTCTCGACCG AGGAACTGGT CAGCCTGGAC
CGCCGCATCG CCGAGGCCGC GGAGCGTGCC GCAACGCGCG AGCGGCGGAT CTTCGCCGCC
CTGGTGCGCG AGATCCTGGA CGAACCCGCC CCGCCGGTCA TCGCGGGGGC GCTGGCGGTC
CTGGACGTGC TGCAATCCTG CGCCGACCTG GCCGCCGGCG GCATGTGGTG CCGGCCCGAG
GTCACGGACG ACGACGCCTT CACGCTGACG GCCTGCCGCC ACCCGGTGGT CGAGGCGGCG
CTGCCGCGCA GCGAGCGCTT CACCCCGAAC GATTGCGTGC TGGAACCCGC CCAGCGCGTC
ATGCTGCTGA CCGGGCCGAA CATGGCGGGC AAATCGACCT TCCTGCGCCA GACCGCGCTG
GCGGTCATCC TGGCGCAGGC CGGGCTGCCC GTCCCGGCCA AGGCCGCGCG GATCGGCGTC
GTCGACCGGC TGTTTTCCCG CGTGGGGGCA TCGGACGACC TGGCGCGCGG GCGGTCGACC
TTCATGGTGG AAATGACCGA GACCGCCGCC ATCCTGAACC AGGCCGGCCC CCGGTCCCTG
GTGGTGGTCG ATGAAATCGG GCGCGGCACC GCGACGCTGG ACGGGCTGGC CATCGCCTGG
TCGGTGCTGG AAGCCATGCA TTCGACATTG CGCTGCCGTT CCATTTTCGC AACGCATTTC
CACGAACTGG CCGAACTGGC GGAAAGCCTG CCGCGCCTGT CGCCCCACAC GATGAGCGTG
CGGGAATGGA AGGGCCAGGT GATCTTCCAG CACGAGGTCA TACCCGGATC GGCGCGCCGA
AGCTGGGGCG TGCACGTGGC CCGGCTGGCC GGCGTGCCGG AACCGGTTGT CCGCCGCGCC
GCGCGGCTGC TGGCGGGACT GGAGAAGGAA CGCGCGGTGG GCGCCAGGCC CCTGCCGCTG
TTCGCCCCGG CGGATAGCAC CCCGCCTGCC GAAGAACCCG ATCCGGGCGT GCCCGAACCG
GTGCGGCGGA TGCTGGAACA GCTCGACCCC GACGAGCTGA CGCCGCGCAC GGCGCTGGAC
ATGGTCTACG CGATCAAGAA ACTGATGCTT GAAGAATCCT GA
 
Protein sequence
MTLPSPDGAT PAMAQWFSLK AEHPEALLFF RMGDFYEMFF SDAEAAAGAL DIALTARGTH 
GDIPIPMCGV PVGAASSYLA RLIRRGFRVA VAEQTEPPRK PGKGGPKGPL ARAVVRLVTP
GTLTEDELLE AGRQNLLLAV ASRGGRGTRG WGVAWIDIST GTFETASVPA EGLMDLLGRL
DPAEILAAAE IDLGDFAARR APATIPPGPE AARRRVAETF HVASLDAFGT FSDEEAVAGA
MALDYVRRSQ AGQMPRLSRP QAHENGGVLG LDPATRASLD LLRTRDGGTE HTLLASVGRT
VTAPGSRLLA EWIAAPLTRD AAIAARQDGW SWLIEEGGVR AALRQGLRGT PDIARALARL
SLGRGLPRDA AAIRDGLAIA RRIAQALTDG RTAPPALLAD AMRHLGEATD LEARLKQALA
EDLPARVEDG GVIAAGFDAE LDAERALRDD SRRVIAGLQN DYAQKFGLAS LKIRHHAQLG
YVIEVPAATA PRLRERTDLI LRQGTASAAR FSTEELVSLD RRIAEAAERA ATRERRIFAA
LVREILDEPA PPVIAGALAV LDVLQSCADL AAGGMWCRPE VTDDDAFTLT ACRHPVVEAA
LPRSERFTPN DCVLEPAQRV MLLTGPNMAG KSTFLRQTAL AVILAQAGLP VPAKAARIGV
VDRLFSRVGA SDDLARGRST FMVEMTETAA ILNQAGPRSL VVVDEIGRGT ATLDGLAIAW
SVLEAMHSTL RCRSIFATHF HELAELAESL PRLSPHTMSV REWKGQVIFQ HEVIPGSARR
SWGVHVARLA GVPEPVVRRA ARLLAGLEKE RAVGARPLPL FAPADSTPPA EEPDPGVPEP
VRRMLEQLDP DELTPRTALD MVYAIKKLML EES