Gene Gdia_2646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2646 
Symbol 
ID6976076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2914291 
End bp2915865 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content64% 
IMG OID643392161 
ProductDNA mismatch repair protein MutS domain protein 
Protein accessionYP_002277002 
Protein GI209544773 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.264695 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCGC ACGATGCCAT CGGTTTCGAA AGCATTCTGA ATCCCGACCC CGCGATGCAG 
GCCCGGACCG AGCCGGTGCC CGATCGTCCG GCACCCGAAC CCCGCTGCTT CCACGACCTC
TGCCTGGATC AGATCCTCGA CCGGATGACC GCCGGGCGGG AGATGTTCGG GCTCGACCGG
GTATTCTGCA CGCCGGCGCC GGATCTCGGG ACGATCCGCT ATCGCCAGGC GTTCTGGCAG
GATCTCGAGC AGCCGGATAT CGGCGCGGCG TGCCGCGCTT TCACGCGGAA GATCCAGAGC
AGCCGCGCTC AACAGCAGAT GGTCGGGAAA AGCCAGGACG AATGGTCGGC GCGGCGCTGG
TTCCTCGATG CCGCAAGGAT GTACGGATCC GCCGTCCGGG ACCTCGACCG GGCCCTGAAC
GGGCTGAAAC CGACCTCCGA CGCCGTCGGC ATGCTGGCGC GGTGGCTCGA CACGCATCTC
TCATCGGACC ATTTCCGCCG CCTCGCCGAC GAGGGGGAGG CGATCGCGCG AGACCTGGAG
GACATTGTGT ATGCGGTATC CGTGGGCGAA GGGGATTTCC GGGTCCAGCA TCCCGGACGC
GAAAGCGATT ACAGCGCGGA AATCGAGGAC ACCTTTGCGC GGTTCCGGCA GGGCGACGTC
CGCAGCCACC TCGTGGACCT GCAGGACACC CTGGGGCTCG ATCATATCGA GGCGACGATC
CTCGCATTCG TCGCACGGCT GAATCCGAAC GTGTTCGACC GCCTGCGGAC ATTCTGCGAG
GATTTCGCCA CCTACGAAAA TCCGGTCCTG ATCAGGCTGG ACAGGGAACT GCATTTCTAC
CTCGCCTATG CCGATTTCAT CGCGCCGATG CGTGCCGCGG GCCTGCCATT CTGTTGCCCG
GACATGTCGG ACACCGACAA GGCGGAGCGG GTAGCCGACA TGTTCGATCC CGCCCTGGCC
GTGCGGCTGG TCGATGACGG CAAGGCGGTC GTCACGAACG ATTTCGAACT CTCCGGCCCC
GAACGGATCA TCATGGTCAC CGGCCCCAAT CAGGGCGGAA AGACGACCTT CGCCCGGGCA
TTCGGTCAAT TGCACTATCT CGCCCGTCTC GGACTGCCCG TTCCCGGACG CGAAGCGCAT
CTGTTCCTGG TCGACGAGAT CCATACGCAT TTCGAGCGGG AGGAGAACGC GGCCGACCTG
CGCGGCAAGC TGGAAGACGA ACTGGTGCGG ATTCACGACA TCGTCACGCA TGTCTCGCCG
CGCAGCCTCG TGATCATGAA CGAAAGCTTC AACGCGACGA CGGCCGACGA CGCGGCCAGC
CTGTCCGCTG CCGTTCTGGA GGACTTCATC GGACAGGACC TGATCTGCGT CTGCGTGACC
TTCATCGATG AGATCGCGAC ATTGTCCCAC ACGATCGTCA GCATGGTCAG TACGGTCGAT
CCGGACCGCG ACGACGCACG GACATTCAGG ATCGTCCGCC GACCCTCCGA CGGACGGGTC
TATGCTGCCT CCCTGGCCCA TAAATATCAC CTCACGGGCG CCGATATCCG GCGCCGCCTG
ACGGAGGCAC CATGA
 
Protein sequence
MAAHDAIGFE SILNPDPAMQ ARTEPVPDRP APEPRCFHDL CLDQILDRMT AGREMFGLDR 
VFCTPAPDLG TIRYRQAFWQ DLEQPDIGAA CRAFTRKIQS SRAQQQMVGK SQDEWSARRW
FLDAARMYGS AVRDLDRALN GLKPTSDAVG MLARWLDTHL SSDHFRRLAD EGEAIARDLE
DIVYAVSVGE GDFRVQHPGR ESDYSAEIED TFARFRQGDV RSHLVDLQDT LGLDHIEATI
LAFVARLNPN VFDRLRTFCE DFATYENPVL IRLDRELHFY LAYADFIAPM RAAGLPFCCP
DMSDTDKAER VADMFDPALA VRLVDDGKAV VTNDFELSGP ERIIMVTGPN QGGKTTFARA
FGQLHYLARL GLPVPGREAH LFLVDEIHTH FEREENAADL RGKLEDELVR IHDIVTHVSP
RSLVIMNESF NATTADDAAS LSAAVLEDFI GQDLICVCVT FIDEIATLSH TIVSMVSTVD
PDRDDARTFR IVRRPSDGRV YAASLAHKYH LTGADIRRRL TEAP