Gene GSU0547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0547 
Symbol 
ID2685958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp579087 
End bp581465 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content68% 
IMG OID637125213 
ProductMutS2 family protein 
Protein accessionNP_951605 
Protein GI39995654 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAGAA CCGAAACCCT CCGTACCCTC GAATTCGATA AAATACTTTC GGCCGTGTCG 
GGCTACGCCC ACAGCGGCGC CACGCGGGAC GAAACCGCCC TGATCCGGCC CCTGGACGAC
CGGGAGGCCA TTGTCCGGCG CTTCGGCCAG GTGGACGAGA TCCGGCGGCT GCGCCAGCTG
GGGATCGATC TTTCCCTCCG CTCCTTCGAG GACATCGCAC CGCTGCTGGC GGCAGTCCGC
CCCGACGGCG CGGTGCTGGA CCCCACGGAG CTGGTGGTCC TGTTCCCGAT CCTGCGGACC
ATGACCGCCA TTGCCAAGCA GTTCGCCTAC CGGACCGACA TCCCCCTGCT GCGGGAACTG
GCCGGCACCC TGACCGGCTT CCCCGACCTC CTGGACGAGC TGGAGGTATC CATCGACTCC
GAGGGGGAGA TCCTCGATTC CGCCTCGCCG CTCCTCTCGG ACCTGCGCCA GAAGAAGCGC
CACCTCACCG AGCGGATCAG GCGGCGGCTG GCCGAGATCG TCCGCGAGAC CGGGGTCACC
ACCTTCCTCC AGGACGATTT CATCACCCAG CGGGGCGGGC GGTGGGTGAT CCCGGTGCGG
ATGGATTCCA AGGGAATGGT CCCCGGCGTG GTGCACGACG TCTCCAATTC AGGCGAGACC
GCCTTCATGG AGCCCCTGGA GATCATCGGC CTGGCCAACG AGCTGGAAAA CCTGGTGGCT
GAGGAAAAGG CCGAGATGAT CCGCATCGTC CGGACCATCT GCCGGATGAT CCGCCAGGAA
GCCGACGGCC TGGATGAGCA GTTCCGCATT CTGGTCCGGC TTGACGTGCT GAACGGCATT
GCCTTGTTCG CCGATTCCCT CGGCGCCGAG ACCCCCGAGA TCACCGACGC CCGCTTCATT
CGGGTCCGGG AGGGGCGCCA CCCGCTCCTG GCGCTCATGG CGCGGGAACG GGGGGTCGGC
CGGGTGGTGC CGCTGGACCT GGGGCTGGGG GGAGCTGAAC GTCCTGATTC GCATGTGGCG
AATCAGGTCA TGGTCATCAC CGGGCCCAAC GCGGGAGGCA AAACCATCTC CCTCAAGACC
ACCGGCCTCC TTCACCTCAT GGCCCTGGCG GGGATTCCGG TGCCGGCGGC CTCCACTTCG
TCGTTCCCCC TGATCTCCGA CCTCCTGGTG GACATCGGCG ACGAGCAGTC CATCGAGCAG
AGCCTTTCCA CCTTCTCGGC CCACGTCTCC AACATCGCCG GCATCCTGGA GCGGGCCGAC
CGCCGCACCG TGGTGCTGCT GGACGAACTG GGCACCGGCA CCGAGCCGGT CCAGGGGGCG
GCCATCTCCT GCGCCGTCCT GGCCGACCTG CAGGACAAGG GGGCGCTGGT CATCGCCACC
ACCCACCTGA CCGACATCGT GGGCTTCGTC CACAAACGGG ACGGCATGGT CAACGCTTCC
ATGGAGTTCG ACCGCCAGAC CCTCACCCCC CTCTACCGTC TCACCGTGGG CGAGCCGGGC
CAGTCCCACG CCCTGGAGAT CGCTCGCCGG TATGGCCTTC CCGACCGGGT CGTGGCCGTG
GCCACCGGCA TGCTCTCTCG CATGGAGACT GAATTCCACG AACTGCTGGC CGAGCTCAAG
GACCAGCGCC GGCGCCACGA AGAGGCCCTG GCCGAGGCGG AACGGCTCCG GCGCGATGCC
GAGGAGAAGG CCCGCATCGC CCGCGAGCGG CTGGCCGAGG CCGAGACCCG GCGGCGCGAG
GCAACCGAGA AGGCCCTCCA GGAGGCAAAG GAGATCGTCC GGGCCGCCCG GCGCGACGTG
AACGCCATCA TCGAGGAAGC CCGCAGGGAG AAGAGCAGGG AGGCCCGGAA GAAGATCGAC
GAGGCCGAGG CCGCGGTGGA GGCGAAGCTC CAGGAGTTCC ACCCCGAGGA GACCCTTTCC
CTGGATGCCG TCCGCGAGGG CGACACTGTC TTCGTCAAGG CCATCGGCCA CGACGGCACC
GTCACCGCCG TGGACCGCCG GACCGGCCGG CTCCGGGTGC GGGCCGGCGC CATGGAACTG
GAAGTGGCGG CCACGGACGT TTCCCCGCGC CGGGGCAAGG CCACCGAGGC CAAAATCCGC
ACCGGCTCGG GCAGAAGACC GGCGCCGGAT GCGGAGACCC CCCGCGAAAT CAACCTGATC
GGCCTGCGAG TGGACGACGC CCTGGCGCGG TTGGAGCCCT TCCTGAACCA CGCCTCCCTG
GAAGGATACG GCGAGCTGCG GATTGTCCAC GGCAAGGGGA CCGGCGCCCT GATGCGGGCT
GTGCGCGAAT ACCTTGACGG TCACCCCCTG GTGCGCGAGT TTCGTCCCGG CGAGCCCTTC
GAAGGTGGCG AGGGCGCCAC GGTGGTGACG CTGCGGTAA
 
Protein sequence
MIRTETLRTL EFDKILSAVS GYAHSGATRD ETALIRPLDD REAIVRRFGQ VDEIRRLRQL 
GIDLSLRSFE DIAPLLAAVR PDGAVLDPTE LVVLFPILRT MTAIAKQFAY RTDIPLLREL
AGTLTGFPDL LDELEVSIDS EGEILDSASP LLSDLRQKKR HLTERIRRRL AEIVRETGVT
TFLQDDFITQ RGGRWVIPVR MDSKGMVPGV VHDVSNSGET AFMEPLEIIG LANELENLVA
EEKAEMIRIV RTICRMIRQE ADGLDEQFRI LVRLDVLNGI ALFADSLGAE TPEITDARFI
RVREGRHPLL ALMARERGVG RVVPLDLGLG GAERPDSHVA NQVMVITGPN AGGKTISLKT
TGLLHLMALA GIPVPAASTS SFPLISDLLV DIGDEQSIEQ SLSTFSAHVS NIAGILERAD
RRTVVLLDEL GTGTEPVQGA AISCAVLADL QDKGALVIAT THLTDIVGFV HKRDGMVNAS
MEFDRQTLTP LYRLTVGEPG QSHALEIARR YGLPDRVVAV ATGMLSRMET EFHELLAELK
DQRRRHEEAL AEAERLRRDA EEKARIARER LAEAETRRRE ATEKALQEAK EIVRAARRDV
NAIIEEARRE KSREARKKID EAEAAVEAKL QEFHPEETLS LDAVREGDTV FVKAIGHDGT
VTAVDRRTGR LRVRAGAMEL EVAATDVSPR RGKATEAKIR TGSGRRPAPD AETPREINLI
GLRVDDALAR LEPFLNHASL EGYGELRIVH GKGTGALMRA VREYLDGHPL VREFRPGEPF
EGGEGATVVT LR