Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2646 |
Symbol | |
ID | 6976076 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 2914291 |
End bp | 2915865 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643392161 |
Product | DNA mismatch repair protein MutS domain protein |
Protein accession | YP_002277002 |
Protein GI | 209544773 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.264695 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGCGC ACGATGCCAT CGGTTTCGAA AGCATTCTGA ATCCCGACCC CGCGATGCAG GCCCGGACCG AGCCGGTGCC CGATCGTCCG GCACCCGAAC CCCGCTGCTT CCACGACCTC TGCCTGGATC AGATCCTCGA CCGGATGACC GCCGGGCGGG AGATGTTCGG GCTCGACCGG GTATTCTGCA CGCCGGCGCC GGATCTCGGG ACGATCCGCT ATCGCCAGGC GTTCTGGCAG GATCTCGAGC AGCCGGATAT CGGCGCGGCG TGCCGCGCTT TCACGCGGAA GATCCAGAGC AGCCGCGCTC AACAGCAGAT GGTCGGGAAA AGCCAGGACG AATGGTCGGC GCGGCGCTGG TTCCTCGATG CCGCAAGGAT GTACGGATCC GCCGTCCGGG ACCTCGACCG GGCCCTGAAC GGGCTGAAAC CGACCTCCGA CGCCGTCGGC ATGCTGGCGC GGTGGCTCGA CACGCATCTC TCATCGGACC ATTTCCGCCG CCTCGCCGAC GAGGGGGAGG CGATCGCGCG AGACCTGGAG GACATTGTGT ATGCGGTATC CGTGGGCGAA GGGGATTTCC GGGTCCAGCA TCCCGGACGC GAAAGCGATT ACAGCGCGGA AATCGAGGAC ACCTTTGCGC GGTTCCGGCA GGGCGACGTC CGCAGCCACC TCGTGGACCT GCAGGACACC CTGGGGCTCG ATCATATCGA GGCGACGATC CTCGCATTCG TCGCACGGCT GAATCCGAAC GTGTTCGACC GCCTGCGGAC ATTCTGCGAG GATTTCGCCA CCTACGAAAA TCCGGTCCTG ATCAGGCTGG ACAGGGAACT GCATTTCTAC CTCGCCTATG CCGATTTCAT CGCGCCGATG CGTGCCGCGG GCCTGCCATT CTGTTGCCCG GACATGTCGG ACACCGACAA GGCGGAGCGG GTAGCCGACA TGTTCGATCC CGCCCTGGCC GTGCGGCTGG TCGATGACGG CAAGGCGGTC GTCACGAACG ATTTCGAACT CTCCGGCCCC GAACGGATCA TCATGGTCAC CGGCCCCAAT CAGGGCGGAA AGACGACCTT CGCCCGGGCA TTCGGTCAAT TGCACTATCT CGCCCGTCTC GGACTGCCCG TTCCCGGACG CGAAGCGCAT CTGTTCCTGG TCGACGAGAT CCATACGCAT TTCGAGCGGG AGGAGAACGC GGCCGACCTG CGCGGCAAGC TGGAAGACGA ACTGGTGCGG ATTCACGACA TCGTCACGCA TGTCTCGCCG CGCAGCCTCG TGATCATGAA CGAAAGCTTC AACGCGACGA CGGCCGACGA CGCGGCCAGC CTGTCCGCTG CCGTTCTGGA GGACTTCATC GGACAGGACC TGATCTGCGT CTGCGTGACC TTCATCGATG AGATCGCGAC ATTGTCCCAC ACGATCGTCA GCATGGTCAG TACGGTCGAT CCGGACCGCG ACGACGCACG GACATTCAGG ATCGTCCGCC GACCCTCCGA CGGACGGGTC TATGCTGCCT CCCTGGCCCA TAAATATCAC CTCACGGGCG CCGATATCCG GCGCCGCCTG ACGGAGGCAC CATGA
|
Protein sequence | MAAHDAIGFE SILNPDPAMQ ARTEPVPDRP APEPRCFHDL CLDQILDRMT AGREMFGLDR VFCTPAPDLG TIRYRQAFWQ DLEQPDIGAA CRAFTRKIQS SRAQQQMVGK SQDEWSARRW FLDAARMYGS AVRDLDRALN GLKPTSDAVG MLARWLDTHL SSDHFRRLAD EGEAIARDLE DIVYAVSVGE GDFRVQHPGR ESDYSAEIED TFARFRQGDV RSHLVDLQDT LGLDHIEATI LAFVARLNPN VFDRLRTFCE DFATYENPVL IRLDRELHFY LAYADFIAPM RAAGLPFCCP DMSDTDKAER VADMFDPALA VRLVDDGKAV VTNDFELSGP ERIIMVTGPN QGGKTTFARA FGQLHYLARL GLPVPGREAH LFLVDEIHTH FEREENAADL RGKLEDELVR IHDIVTHVSP RSLVIMNESF NATTADDAAS LSAAVLEDFI GQDLICVCVT FIDEIATLSH TIVSMVSTVD PDRDDARTFR IVRRPSDGRV YAASLAHKYH LTGADIRRRL TEAP
|
| |