Gene Gdia_2645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2645 
Symbol 
ID6976075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2912768 
End bp2914294 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content63% 
IMG OID643392160 
ProductDNA mismatch repair protein MutS domain protein 
Protein accessionYP_002277001 
Protein GI209544772 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.260435 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCCC ACCTGCTCTT TCGCGATCGG GACCTCGATC CGAAAGCCCC CCTGCCGCCG 
CAGGCGGACG CCCTGATCGA CGATTTGCGC CTGAACGTCA TATTCGACGC CATGGCGGCA
GGCGACGAGC TGATTGCCCG GGTCAGCCCA CGCGTGATGC TGAACACCCT GGCCGACCCC
GAAGACATCC GCTACCGGCA GGAAATCACA TCGGAAAGCC TCGACCGGGA GGACCTGGTC
CGGGAACTGT ACAGCCTGGC CTCGGACGCG ATCGAGGCGG AACGCAAAAG CTATATTCAT
GCCGGTTTTC GCAGTCCCGG CAGCATCGTA TTCGAATCCG TGTCGGTCCT GAAGCTGCTT
CTGGGAACCC TGACGCGCCT GCGGGCCATC AGCGACGCGG AACGAGGCCG TTCAACGTCG
CGCGGCCTGC GGACGCTGTT CGAAACGCTT TCCCGCGAAC TGGATGACCC TTTCTTCGAA
CGCATTCGCG GCTATATCAG TGACCTGAGC CGGCCGCGCA TGCTGTTTAC CGCCCGGCTG
GGCGTCGGGA ACAAGGCAAC CGACCATGTG CTGCGCAAGC CGCTGCCGCC GGAAGGCAAC
TGGCTGGCAC GCGCCTTCGC GGGGAAGCCG GAAGGGTACA GCTTCCGGCT GAATGAACGC
GACGAAAGCG GCGCACGCGC CCTGAGCGAC ATTCGCGATC GCGCGCTCAA TCGCGTGGCG
GATGCGCTCG GACAAGGGAA GGACCACGTG CTTGCCTTCC TGAACGCGCT TCGGAACGAG
CTGGCTTTCT ATGTCGGCGC GATCAATCTT CACGCGCGGC TGGTGGAACT GGCGCTGCCG
ACCTGCCTTC CGGACTTCCA GTCTTCCGAA GACCATGATT TCGCGGCGAC CGGACTTTAT
GACGTCGCCC TTGCACTGAC ATCGGACAGG CAGGTCGTGG GCAACGATAT CGACGCCACG
GGCCCGCACC GTACCGTCAT CATCACCGGG GCCAACCAGG GGGGGAAAAC GACCTTCCTG
CGCAGCGTCG GCCTGTCCTT CGTGATGGGA CAATGCGGGC TGTTCGCCGG AGCCGGGTCA
CCGCGTACCG GCGCGGCCGG AAACGTGTTC ACGCATTTCA AGCGCGAGGA AGACCGCGCG
CTCGAAAGCG GAAAATTCGA TGAGGAGCTG CATCGTATGA GCGTGCTGGT GGATCAGCTT
CGGCCGCATT CGGTCATGTT GCTCAACGAA TCGTTTGCCT CGACCAATGA CCGGGAAGGT
TCCGATATTT CCTACGAAAT CGTCAGCAGC CTTCAGGATG TCGGCGTCCG GGTGTTCTTC
GTGACGCATC AGTACAGCTT CGCCCACCGG TTCTTCGCGA ACCATCGCGC CGATACGCTG
TTCCTGCGGC CCGAAAGACT GGAGAACGGC ACCCGTACCT TCCGGCTCCG CCCCGGAGAG
CCGGAGACGA CAAGCTACGG GCAGGATCTT TACGCGCGTA TTTTCGGGCA TGCCCTGCCC
CGCCATGATA CGCGCCAGAC GCCCTGA
 
Protein sequence
MKAHLLFRDR DLDPKAPLPP QADALIDDLR LNVIFDAMAA GDELIARVSP RVMLNTLADP 
EDIRYRQEIT SESLDREDLV RELYSLASDA IEAERKSYIH AGFRSPGSIV FESVSVLKLL
LGTLTRLRAI SDAERGRSTS RGLRTLFETL SRELDDPFFE RIRGYISDLS RPRMLFTARL
GVGNKATDHV LRKPLPPEGN WLARAFAGKP EGYSFRLNER DESGARALSD IRDRALNRVA
DALGQGKDHV LAFLNALRNE LAFYVGAINL HARLVELALP TCLPDFQSSE DHDFAATGLY
DVALALTSDR QVVGNDIDAT GPHRTVIITG ANQGGKTTFL RSVGLSFVMG QCGLFAGAGS
PRTGAAGNVF THFKREEDRA LESGKFDEEL HRMSVLVDQL RPHSVMLLNE SFASTNDREG
SDISYEIVSS LQDVGVRVFF VTHQYSFAHR FFANHRADTL FLRPERLENG TRTFRLRPGE
PETTSYGQDL YARIFGHALP RHDTRQTP