Gene GSU2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2001 
SymbolmutL 
ID2688110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2192669 
End bp2194489 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content66% 
IMG OID637126692 
ProductDNA mismatch repair protein 
Protein accessionNP_953050 
Protein GI39997099 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0369313 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACACC GGATCAGGAT ACTGCCCGAG ATACTCACCA ACAAGATCGC TGCCGGCGAG 
GTGGTGGAAC GCCCCGCATC GGTGGTGAAG GAACTGGTGG AGAACGCCCT GGATGCCGGC
TGCGGCGAGA TAATCGTAGA GATCGAGGGG GGAGGGCGGC GCCTCATCCG GATCACCGAC
GACGGCTGCG GCATGTCGCG GGAAGACGCC CTCATGGCCC TTGAGCGTCA CGCCACCAGC
AAGATCGCCA CGGACGACGA CCTGTTCTCC CTGGCCACGC TCGGCTTCCG CGGAGAGGCG
CTCCCCTCCG TGGCCTCGGT CTCCCGCTTC ACCCTCGCCA CCCGCGAGCG GGGGAGCATC
GAAGGAACCG AGATTTACGC CGAAGGGGGA AAGATCCGCG AGGTGAAGGC CTGCGGCATG
GCCGAGGGGA CCGTGGTCTC GGTCCGCAAT CTCTTCTTCA ATACTCCGGC CCGTCTCAAG
TTCATGAAGA GTGTTGAGAC CGAGGGGGGC CACGTGGCAG ACCTGGTCAC CCGGCTGGCC
CTTTCCCGCC CGGAGGTTCG CTTCACCTGC GTGAGCGACG GCAAGACCCT GTTCCGGGCC
CTGGACGGCA CCCTGCTGGA CCGGGTGGCC GCACTTCTGG GCAAGACCGT TGCCTCAGCC
CTCCACCCCG TGGATCTGGC CACCGAGGGG GTGCGGGTGA CCGGACTCGT GGCTCGCCCC
GATGTGAGCC GTTCCGCGGC ATCGCACCTC TACACCTACA TCAATGGTCG TTTCATCCGC
GACCGCGTCG TGCAGCACGC CATACTCCAG GCCTATCGCA ACTTCCTGGA GCGGGGGCGC
TACCCGGTGG TGGTTCTCTT TATCGAGGTT TCTCCCGGCG AAGTAGACGT GAACGTCCAT
CCTACCAAGC ACGAGGTCCG CTTCAGGCAG CAGGGGATCG TGCACGATGT GATCCAGGGG
GCCGTGGAGG AGACGCTCCG GCTCACGCCC TGGATTCGCC GCTCCGAGTC ACCCGTTGCT
GCGCCGCTGG CGGTGCCCCG GCCGCAACAA TACGTTCAGG GCACGGGTGC TCGTCAGGTG
GAAGAAGTGC GGGAGCTTCT GGAAAACTAC CGTCCCGCCG TGGCTCCCCA TCGGCCGCTC
TTCGCGCCAC AACCCGCACC GCAGCCGGAT CGGGAGCCGC CTTTGCCCGA TTCTGGCAGC
CGGATGCTGG ACGACACGGC CGTGCGTCGT CACGGGGGCT ATTTCTCTTC CCTTGCCGTC
ATCGGCCAGT ACAACGCATC CTACATCCTG TGCCAGGATG GCTCTGATCT GGTCATCATC
GACCAGCATG CGGCCCACGA GCGGGTCGCC TTCGAGCGCC TCAAGACGCA GTTCGCTGCC
GGCCGGGTCG AGGGGCAGGG GCTTCTTTTC CCCGAGACCG TGGAGCTGTC TCACCGGGAA
TCGGCGGTGG TGCGGGAACA CGGCGGCGAA CTGGGCCGGC TCGGTTTCGA TCTAGAGGAT
TTCGGGGGTA CTACCTGGAT CGTAAAGGGG ATTCCGCGCC TGCTGGCAGG CACCGACTAC
CTGCGCCTGC TACGCGACAC CCTTGAGGAG CTCCAGTCCC TCGGAGCCAG CCGCAGCATC
GCCGATGCCG TGGAGGATAT CCTGACCCGC GTCGCCTGCC ACAGCGTTGT GCGGGGCGAG
CACCCCCTGA CCACCGGCGA GATTGAGGCC CTCTTCGCCC ATATGGACGC CACCGACTTT
TCCACCAATT GCCCCCATGG CAGGCCCGTC CTCCAGCGAC TTACGCTGGG GGAGGTGGAG
AAGATGTTCA AGAGAGTGTG A
 
Protein sequence
MPHRIRILPE ILTNKIAAGE VVERPASVVK ELVENALDAG CGEIIVEIEG GGRRLIRITD 
DGCGMSREDA LMALERHATS KIATDDDLFS LATLGFRGEA LPSVASVSRF TLATRERGSI
EGTEIYAEGG KIREVKACGM AEGTVVSVRN LFFNTPARLK FMKSVETEGG HVADLVTRLA
LSRPEVRFTC VSDGKTLFRA LDGTLLDRVA ALLGKTVASA LHPVDLATEG VRVTGLVARP
DVSRSAASHL YTYINGRFIR DRVVQHAILQ AYRNFLERGR YPVVVLFIEV SPGEVDVNVH
PTKHEVRFRQ QGIVHDVIQG AVEETLRLTP WIRRSESPVA APLAVPRPQQ YVQGTGARQV
EEVRELLENY RPAVAPHRPL FAPQPAPQPD REPPLPDSGS RMLDDTAVRR HGGYFSSLAV
IGQYNASYIL CQDGSDLVII DQHAAHERVA FERLKTQFAA GRVEGQGLLF PETVELSHRE
SAVVREHGGE LGRLGFDLED FGGTTWIVKG IPRLLAGTDY LRLLRDTLEE LQSLGASRSI
ADAVEDILTR VACHSVVRGE HPLTTGEIEA LFAHMDATDF STNCPHGRPV LQRLTLGEVE
KMFKRV