Gene Glov_2782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGlov_2782 
Symbol 
ID6368181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter lovleyi SZ 
KingdomBacteria 
Replicon accessionNC_010814 
Strand
Start bp2989165 
End bp2991522 
Gene Length2358 bp 
Protein Length785 aa 
Translation table11 
GC content60% 
IMG OID642678198 
ProductMutS2 family protein 
Protein accessionYP_001953015 
Protein GI189425838 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.798995 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCCAC ACACGACCCT TAGCCGCCTT GAATTCAACA AAGTCCTGCA GTCCATTGCG 
AGCCATGCAC GCAGTGATAT CACTGCACAC AGCATCATGG CCATGCAGCC CTCCCGGCAG
CCCCAGGAGA TCCGCACCAG CTGGCAGCGT ATCGAAGAGA TCAGGGACCT GCTACGCCAA
CGGATCTGTC TGCGGATCAG CCGTTTCAGC GACATCCGCC CGCTGCTTGA AGCGGTCCGT
CCCAGTGGTG CCATCCTCTC CCCCTTTGAA CTGCTGGAAT TCATCCCGGT ACTCGGTTCG
CTGGCCGAGC TTGCCAGACA GCTGGCCCCG CGTGAGGATA TCCCGGCCCT GAAGCTGCTG
TCCCCCTTTC CGGTGGCATT CAACGACATC CTTGAGCCGC TTGCCGCCAC CCTGGATGAC
GAAGGCAACA TCCTGGACAG CGCCTCGCAG GAACTGTCCC AGATTCGCAA GGCCAAGCGC
ACCCTGGCGG CACGAGTCCG CAAAAAGCTG GAAGAATTTG TCCGCAGGCA TGAGACCGCC
ATCTTTTTGC AGGATGACTT CATCACCATC CGCTCCGGCC GCTGGGTCAT TCCGGTCAGG
ATGGACTCAA AGGGGATGGT ACCGGGAGTG GTGCATGACG TCTCATCCTC CGGTGAGACC
GCCTTCATGG AGCCGCTGGA GATCATCCCC TTTGTCAACG AGCTTGAAAA CCTCTCTGCC
GAAGAAAAGG CCGAAGAGAT CCGCATCCTG CGGCGCCTCT CAGCCTGGAT TCGTGAGGAT
GCCGAACAGA TCGGCGCCTG CTTCAAGAGC CTTGTCGAGC TGGACCGTCT GGATAGTGTG
GCGGCGTTTG CCGAAAAATT CAGCATGTCG GTCCCGGAGC TGAACCAGAA CGGCTCTCTG
CGCCTGCTAT CAGCCCGTCA CCCGCTCTTG CTGGTGATGC GGGAGCAACA GCAGGATACT
ACGCCGATTG TGCCGCTGGA CCTGGAGCTG GGCAACGAAA CCAGAGTGTT GACAATCAGC
GGCCCCAACG CGGGCGGCAA AACGATCGCA CTGAAGACCG TCGGTTTAAT CACTGCCATG
GCCCTGTCCG GGATGCCGGT ACCTGCCTCA CCCTCCTCAT CTATCCCGTT GCTGGATGCC
CTGCTGGTGG ATATTGGCGA TGATCAGTCG ATTGAACAGA GCCTTTCAAC CTTTTCAGCC
CATGTTGCGG CCATAGCCGG CATTTTGGGC CAGACCGGCA GCCGCAGTCT GGTACTGCTG
GATGAGCTGG GGACCGGTAC CGAGCCGCTG CAAGGGGCCG CCATCGGCTG CGCGGTGCTA
CATGAGCTGC AAAGCCGCGG CGCGCTGGTG CTTGCCACAA CCCACCTGAC CGAGATTGTC
GGTTTTGTGC AGCGCAGCCA AGGGATGCAG AACGCCGGAA TGGAGTTTGA CAGTGCCACC
TGGACCCCGC TCTACCGCCT GGTGATGGGT GAACCGGGGC AGTCCCATGC CCTGGAAACC
GCCCGGCGCT ACGGACTGCC AGAATCTGTG CTGCAGTTTG CCCGTAACCT GCTGGGAGAC
GCCGGCACCG CCTTTGCCGG TATCATTGAC GAACTGCGTC AGAAACGGAA CGCCCTGGCA
GATGAACTGG ACAGACAGCA GCAGGAGCGC CAGCGGCTTG ATGGTCTGGC TATGGCATTG
AAGCAGCAGG AGGCTGATCT TGTCCGTCTG AGACAGGAAA CCATTGAAAA AGCCCGCCAG
GATGCACGGG ACACCATCAC CGCTGCACGG CGCGAGATGA ATCAGTTACT GGAGCAATTC
AAGCAGGATC GCCGCAAGGA AACAGAGGTA AAATTCAGGC AAAAGGCAGA AGAACTGGAG
GCCAGCTTTG CTCCGACCGG GCAGCAGGCT CCATCCGCTG ATGCGCTACA GCCAGGCAGC
ATCGTGCAGG TGCGCTCTCT TGGGCGGGAA GCCACCGTCA TCAGCATTGA CCAGGCCCGT
CACAAGGTGC GGGTCAGGGC AGGCAGCATT GAGATGGAGG TACCGCTGCA CGGCCTGATT
ATCGGCACTA CTACAGCTGC CCCCAAACCA CGCAAAAATG TGCCTGAGAT CAACATGAAG
CGACAGACTG AAGAGGCCGC CAACGATCTG AACCTGATCG GCAAACGGGT GGAAGAGGCG
TTGATTGAGC TGGAAGGCTT CATTGACCAG GCAATCTTGG CCGGGCAGCG GGAAATCAGG
ATTGTCCACG GTATCGGCAC CGGCACCCTG CAGCGGGCCG TGCGGGAATT TCTGGGTCGC
CATCCCCAGG TGGCAGCATT CCGCCCCGGA GAACCCCACG AAGGCCGGGA TGGCGTCACC
ATAGCGGAGT TGGCTTGA
 
Protein sequence
MIPHTTLSRL EFNKVLQSIA SHARSDITAH SIMAMQPSRQ PQEIRTSWQR IEEIRDLLRQ 
RICLRISRFS DIRPLLEAVR PSGAILSPFE LLEFIPVLGS LAELARQLAP REDIPALKLL
SPFPVAFNDI LEPLAATLDD EGNILDSASQ ELSQIRKAKR TLAARVRKKL EEFVRRHETA
IFLQDDFITI RSGRWVIPVR MDSKGMVPGV VHDVSSSGET AFMEPLEIIP FVNELENLSA
EEKAEEIRIL RRLSAWIRED AEQIGACFKS LVELDRLDSV AAFAEKFSMS VPELNQNGSL
RLLSARHPLL LVMREQQQDT TPIVPLDLEL GNETRVLTIS GPNAGGKTIA LKTVGLITAM
ALSGMPVPAS PSSSIPLLDA LLVDIGDDQS IEQSLSTFSA HVAAIAGILG QTGSRSLVLL
DELGTGTEPL QGAAIGCAVL HELQSRGALV LATTHLTEIV GFVQRSQGMQ NAGMEFDSAT
WTPLYRLVMG EPGQSHALET ARRYGLPESV LQFARNLLGD AGTAFAGIID ELRQKRNALA
DELDRQQQER QRLDGLAMAL KQQEADLVRL RQETIEKARQ DARDTITAAR REMNQLLEQF
KQDRRKETEV KFRQKAEELE ASFAPTGQQA PSADALQPGS IVQVRSLGRE ATVISIDQAR
HKVRVRAGSI EMEVPLHGLI IGTTTAAPKP RKNVPEINMK RQTEEAANDL NLIGKRVEEA
LIELEGFIDQ AILAGQREIR IVHGIGTGTL QRAVREFLGR HPQVAAFRPG EPHEGRDGVT
IAELA