Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2645 |
Symbol | |
ID | 6976075 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 2912768 |
End bp | 2914294 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643392160 |
Product | DNA mismatch repair protein MutS domain protein |
Protein accession | YP_002277001 |
Protein GI | 209544772 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.260435 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCCC ACCTGCTCTT TCGCGATCGG GACCTCGATC CGAAAGCCCC CCTGCCGCCG CAGGCGGACG CCCTGATCGA CGATTTGCGC CTGAACGTCA TATTCGACGC CATGGCGGCA GGCGACGAGC TGATTGCCCG GGTCAGCCCA CGCGTGATGC TGAACACCCT GGCCGACCCC GAAGACATCC GCTACCGGCA GGAAATCACA TCGGAAAGCC TCGACCGGGA GGACCTGGTC CGGGAACTGT ACAGCCTGGC CTCGGACGCG ATCGAGGCGG AACGCAAAAG CTATATTCAT GCCGGTTTTC GCAGTCCCGG CAGCATCGTA TTCGAATCCG TGTCGGTCCT GAAGCTGCTT CTGGGAACCC TGACGCGCCT GCGGGCCATC AGCGACGCGG AACGAGGCCG TTCAACGTCG CGCGGCCTGC GGACGCTGTT CGAAACGCTT TCCCGCGAAC TGGATGACCC TTTCTTCGAA CGCATTCGCG GCTATATCAG TGACCTGAGC CGGCCGCGCA TGCTGTTTAC CGCCCGGCTG GGCGTCGGGA ACAAGGCAAC CGACCATGTG CTGCGCAAGC CGCTGCCGCC GGAAGGCAAC TGGCTGGCAC GCGCCTTCGC GGGGAAGCCG GAAGGGTACA GCTTCCGGCT GAATGAACGC GACGAAAGCG GCGCACGCGC CCTGAGCGAC ATTCGCGATC GCGCGCTCAA TCGCGTGGCG GATGCGCTCG GACAAGGGAA GGACCACGTG CTTGCCTTCC TGAACGCGCT TCGGAACGAG CTGGCTTTCT ATGTCGGCGC GATCAATCTT CACGCGCGGC TGGTGGAACT GGCGCTGCCG ACCTGCCTTC CGGACTTCCA GTCTTCCGAA GACCATGATT TCGCGGCGAC CGGACTTTAT GACGTCGCCC TTGCACTGAC ATCGGACAGG CAGGTCGTGG GCAACGATAT CGACGCCACG GGCCCGCACC GTACCGTCAT CATCACCGGG GCCAACCAGG GGGGGAAAAC GACCTTCCTG CGCAGCGTCG GCCTGTCCTT CGTGATGGGA CAATGCGGGC TGTTCGCCGG AGCCGGGTCA CCGCGTACCG GCGCGGCCGG AAACGTGTTC ACGCATTTCA AGCGCGAGGA AGACCGCGCG CTCGAAAGCG GAAAATTCGA TGAGGAGCTG CATCGTATGA GCGTGCTGGT GGATCAGCTT CGGCCGCATT CGGTCATGTT GCTCAACGAA TCGTTTGCCT CGACCAATGA CCGGGAAGGT TCCGATATTT CCTACGAAAT CGTCAGCAGC CTTCAGGATG TCGGCGTCCG GGTGTTCTTC GTGACGCATC AGTACAGCTT CGCCCACCGG TTCTTCGCGA ACCATCGCGC CGATACGCTG TTCCTGCGGC CCGAAAGACT GGAGAACGGC ACCCGTACCT TCCGGCTCCG CCCCGGAGAG CCGGAGACGA CAAGCTACGG GCAGGATCTT TACGCGCGTA TTTTCGGGCA TGCCCTGCCC CGCCATGATA CGCGCCAGAC GCCCTGA
|
Protein sequence | MKAHLLFRDR DLDPKAPLPP QADALIDDLR LNVIFDAMAA GDELIARVSP RVMLNTLADP EDIRYRQEIT SESLDREDLV RELYSLASDA IEAERKSYIH AGFRSPGSIV FESVSVLKLL LGTLTRLRAI SDAERGRSTS RGLRTLFETL SRELDDPFFE RIRGYISDLS RPRMLFTARL GVGNKATDHV LRKPLPPEGN WLARAFAGKP EGYSFRLNER DESGARALSD IRDRALNRVA DALGQGKDHV LAFLNALRNE LAFYVGAINL HARLVELALP TCLPDFQSSE DHDFAATGLY DVALALTSDR QVVGNDIDAT GPHRTVIITG ANQGGKTTFL RSVGLSFVMG QCGLFAGAGS PRTGAAGNVF THFKREEDRA LESGKFDEEL HRMSVLVDQL RPHSVMLLNE SFASTNDREG SDISYEIVSS LQDVGVRVFF VTHQYSFAHR FFANHRADTL FLRPERLENG TRTFRLRPGE PETTSYGQDL YARIFGHALP RHDTRQTP
|
| |