Gene Gdia_1408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1408 
Symbol 
ID6974816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1571745 
End bp1573664 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content75% 
IMG OID643390938 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_002275803 
Protein GI209543574 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGCG ACTCGTCCAG CCCGCCCGCC CGTCCGGTCA TCCGGCGCCT GTCGGAACAG 
GTCGTCAACC GCATCGCCGC CGGCGAGGTC ATCGAACGGC CGGCCGCGGC GTTGAAGGAA
CTGGTCGAAA ACGCCATCGA TTCCGGGGCC CGGCGCATCG CCGTCCTGCT GGAGCGCGGC
GGTATCGAGC GGATCGAGGT CGTCGATGAC GGCTGCGGCA TGACGCCGGA CGACCTGCTG
CTGGCGGTCG AGCGGCATTG CACCTCCAAA TTGCGCGACG AGACGCTGGT GCGGATCGAA
ACGCTGGGCT TCCGGGGCGA GGCCCTGCCA TCGATCGGCG CCGCCGCGCG GCTGACCGTC
ACGTCCCGCC CCCATGGGGC CGACAGCGCG TGGCGCGTGT CGGTGGAAGG CGGGCTGGTG
GGCGCGCCCG CCCCCTGCGC CGGCCCCCCG GGCACCCGCG TGACGGTCGA GGACCTGTTC
TTCGCCACCC CGGCCCGGCG CAAATTCCTC AAGAGCCCCC GGGTCGAGGC CGGCCATGCC
GACATTACGG TGCGCCGCCT GGCCCTGTCG GTGCCGGACG TGGCGTTCCG CCTGCAACTG
GACGACCGGG TGGTGTTCGA CCTGCCGGCC CAGGACCTGG AATCCCGCGT CGCCGCAATT
CTCGAATCCG AGGGCGCGGA CGGCATGCTG CCGGTCGAAG GCCGGCGCGG CGACCTGGTG
CTGGACGGCT TCGCCTGCGG CCCCTCGGTA CACCGGGCCA CGGCGTCGGG GCAGATCCTG
CTGGTCAACG GCCGGCCGGT GGTCGACCCG GTGCTGCGCA CGGCGGTGCG CGTGGCCTAC
CGGCATGTGA TCGAACACGG CCGGCACGCG GTGGTCGCCC TGTCGCTGAC CATCCCGCCC
GACCTGGTTG ACGTGAACGT CCATCCGGCG AAGACCGAAC TGCGCTTCGC CGACCCCGCC
GCCGTGCGTG GGCTGGTCAT CGGCGCGCTG GGCCGTGCGC TGGGCAGCGG GGCCGGTGTC
GCGGGGGTGC GGCCCGGCCT GCTGCAATCG CGGCCCGCCA CGGCGTCGCG GATCTGGTAT
CCGTCCGCCG ACGCGCCATC GGCCGGCCTG GCGCCGGCCG CGGCGCCCGC GCCCGCGATC
TTCTCGGCCC CGGGCCGCGA CTCGCTGGCC GGCACGCGCC TGGATTTGGG CGCCCCGGCC
GCGCGGGTGC TGGACGACCC GCCCGCCGCC CCGCCGCCCG ATGCCGGTGC GGCGGGCGGG
GCAGGGGCCG GCGGCCCGGA CGCCGCGCCC CCGGACGCTG CCATTCTGGC CGATGCGGCC
GATTATCCGC TGGGGGCGTC GGTGGCGCAG GTCATGGGGA CCTATATCGT CGCGGTGTCC
GGCGACGGAT CGCTGGTGCT GGTGGACCAG CACGCGGCGC ACGAACGGCT GACCCATGAA
CGGCTGCGGG CGCGCTATCT GGACGGCACC CTGCGCGCCC AGCGCCTGCT GCTGCCGGAA
GTCGTCACCC TGCCGCGCGG CCAGGCCGAC CTGCTGCTGT CCTTCGCCGG CACGCTGGCG
GCGCTGGGGG TGGAAATCGA ACCGTTCGGC GGCGGCGCGG TACTGGTGCG CGCCCTGCCG
GCCCTGCTGG GCACGGACGA CCCCGCCGGC CTGCTGCGCG ACATGGCCGA CGAACTGGCC
GAGGACGACC TGGCCGACCC GGGCGACACC GGCGCCCTGG ACGGCAGGCT GGATGCCGTC
ATCGCCCGCA TGGCCTGCCA CGGCAGCGTC CGCGCCGGCC GCAGCCTGAC CCGCGCGGAA
ATGGACGCGC TGCTGCGCGA CATGGAACGG ACCCCCCGCG CCGGCACCTG CTCGCACGGG
CGGCCCACCT GGCTGAAGCT GAGCCGCACG GACCTGGAAA AACTCTTCGG CCGCAAATAG
 
Protein sequence
MMRDSSSPPA RPVIRRLSEQ VVNRIAAGEV IERPAAALKE LVENAIDSGA RRIAVLLERG 
GIERIEVVDD GCGMTPDDLL LAVERHCTSK LRDETLVRIE TLGFRGEALP SIGAAARLTV
TSRPHGADSA WRVSVEGGLV GAPAPCAGPP GTRVTVEDLF FATPARRKFL KSPRVEAGHA
DITVRRLALS VPDVAFRLQL DDRVVFDLPA QDLESRVAAI LESEGADGML PVEGRRGDLV
LDGFACGPSV HRATASGQIL LVNGRPVVDP VLRTAVRVAY RHVIEHGRHA VVALSLTIPP
DLVDVNVHPA KTELRFADPA AVRGLVIGAL GRALGSGAGV AGVRPGLLQS RPATASRIWY
PSADAPSAGL APAAAPAPAI FSAPGRDSLA GTRLDLGAPA ARVLDDPPAA PPPDAGAAGG
AGAGGPDAAP PDAAILADAA DYPLGASVAQ VMGTYIVAVS GDGSLVLVDQ HAAHERLTHE
RLRARYLDGT LRAQRLLLPE VVTLPRGQAD LLLSFAGTLA ALGVEIEPFG GGAVLVRALP
ALLGTDDPAG LLRDMADELA EDDLADPGDT GALDGRLDAV IARMACHGSV RAGRSLTRAE
MDALLRDMER TPRAGTCSHG RPTWLKLSRT DLEKLFGRK