Gene GM21_1922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1922 
Symbol 
ID8137256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2229217 
End bp2232180 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content63% 
IMG OID644869536 
ProductSMC domain protein 
Protein accessionYP_003021733 
Protein GI253700544 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value0.664102 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATAA TCTCGGTCCA CCTGAAGAAC ATCAAGTCGC ACCGCGACAA GGAGATCGCC 
TTCTCTCCAG GCATCAACGT CCTCTCGGGC GCCAACGGCT CGGGCAAGAG CACCATCTTC
GAGGCCATCG GTTACGCCCT CTTCGGCGTC TCCGCGCAGG ACTTCGTCTC CAAGGCGGAC
CGCTTCCTCA CCATCGGCGC CAAAAAAGGT GAGATCAGTG TCGTCTTCGA ACCGGCCCCT
GGTGAGTTGT ACCGGGTGAC CCGCACCGTG GGAGGCGCCG GCAAGTGGCT CCTGGCGAAA
GATAACGGCG GCGGCTTCGA GATAGAGGAG CACGCGAACA TCCAGGAGAC CGAGCAGCGG
ATCGCCTCGC TTTTGGGGCT TGCCACCGGC CGGCCGCTGG CGGAACAGTT CAAGCTGGTC
ATCGGCCCCT TCCAGAACGA CTTCCTCGGC CCGTTCGTCA TCAAGCAGCC GACCAAACGC
CAGGACGCCT TCGACGAGAT TCTCGGCATC GACACCTGGC GCAAGACCTT CGACGGCACC
AAGACTCTCG CGAGCGCCAT CGTCGCCAAG ACCGAAACCA TCCAGGCGGA GGTCGCCGCG
AAGATGGAAC AGGTTTCGGT GCTTCCGGCC AAGGAGCAGG AACTGCGCGA TCTGCGCGAG
CAGGCGCTGG CGAAAGAGGC CCAGTTACGG CAAAAACTTG CCGAGCAGGA GCAGGTTAAC
CAGCTTTTCG CCGGAATGGA GGCGCGGAAG GAGCGGATCG ATTCGGTCGG GCGCGAGGTG
CAAGCCCTGG AAGAGCGGTC GATCTCCGGG AAGGAGTACA TCTCCACCCA GTTGCTGTTG
GTCGAGCAAT CGAAAGCGGC GGCAGCGCTC GTCGCGGAAG CCGCCCAGGG GAAGCAGGCC
TACGACGCAG CCGAAACAAA GCTCAGGCGT CTGCGCGATC AAGAACAGCA AAAGCTCGCC
CTGGAGCGAA AGCTGGGAGA GTCGGATAAG GAACAGGCGG GCCTGCAAAG CGCGCTGGCG
ATCGAGAACC GCGAACTGGC CGAGACGCGG ACCGCGATGG ATGGCGAAAA AGAGCGGCTG
GCGCAGGAAA AGGACGCGCT CTCAAATGCC CTTCTTGCTC TGAAGAAGTC GGAAGCGGCA
GCCGGGGAGG CTTTGGCGCT GGTGAACCGT TCTGCCGCGC AGTTCCGTGA GCTCCCCGTG
CATCGCATAG AGCAGGCCTT GCCGTATCTC TTCGTCGCCC TGGACCGCAT GGCGGCCATA
GACGAGCAGT CCACGGAGAG AAAGAAACTT GTCGCCGCCG GCGCAACGCT CAAGGCTGAA
GCCTCAGAGT TAGCCGCCCG GCAGGCCCTT TTGGATAAGG TCCAGGCCGA GCGCTCCGAG
CTGGCCGGGC GCAGGCTTTC GCTGGTGGAA GGGGAGGAAA AGCTGGGAGC GGGGGATTGC
CCGTTCTTCC ACGAGCCGTG CCGGAACCTC GCCTCGGGTG ACGCTTCCGG CGTCTTCGAG
GCGCGCATCG AACGGATCGA TGCCGAGATC TCGCGTCTCG ACCGGCAGGC GGTGGAACTC
GCCGCCCAGG TTGCCGCGGC GCAGGCAGCG GCACGAGAAC TGGCGGGACT AGAGCAGGTG
GCGAGCGAGC TGGAGAAGGC AGGGCGCGAG CGCCGGAAGC TGGAGCAGGA TTTCATCAAT
GGTTTTTCCG AGATCGCGCC CGCCGTGCTG GTCCCAAGCC TCGGGACCTG GCTCGATTCT
GCGGAACTGG GAATCGATCT GGCTGCGGAA CTTCCGGCCT TGCAGGTAGA GCTGGCCGCG
GAGCCGCAGC AGCGAAAGGC TGCACTCTCC GAAGCGAGCG ACGCTTGGCG GCGTGTCGTT
GTTTTGATCG AGGAAGGTCT GGATGCCAGG CTCAAGGAAG CTCAGGAGCC GGTTCAGGAG
TGCGCGCTGA AACTGGCCCA GCTTGCCGAG AAAGGGGAGA GCCTCGCCGC AAAGGAGCGG
GAACTCGCCG CCTCTGCCGA GAAGCTGGCC TTCCGCGAGC GGGCGATAGC CGATCATGGC
AAGCGGTTGG CGGCTCTCCT CGAAGCCGTA TGCGCGGTGA AGACGGATCT CGCCGCTCAT
GCCGGCCTCG ACCAGGCGAT AAAGGAGGCG GGAGAGGAAC TGGTTCGCTT CCAGGCAGAC
CGCGACCGCT ACATCGCCAA CGAAAAAGCC GCGCAAGAAC TGGATAAGCG CCAGGAGACA
CTGGCGAAAT ACCAGGCGAA GCTCAAGGAG ATAGAAGGGG CGCTTACCGC AAAAAGAGAG
GAACTGCGGC AGTCGGTGGA GGGGTACGAC CAGGCGCGGC ACGAGGCCGT TAAACAGCAG
CAGGTGGAGT TGGTCTCCGC AGTGGCGACG CTCGCCTCGG AGATCGGAGC GGTAGGCGAA
GGCGTTACCC GTCTCGAAGG GGAGACCGCC GCACTGAAAT GCATTGCCGG GGAGATCGAG
AAGAAGCTCG CCGCCGTCGA GGCGTTGAAG GAGCAGGGCG TTCTGGTCAA GTTCCTGCGC
AACCAGGTGT TCAAGAACGT CTCCAGCCAG CTTTCCGAGC GCTTCAGGGA AGAGATCAGT
TTTCGCGCCG ACCGGATCTA CCGCAGCATC TGCGCCTCGG ACGAGGAACT GGTCTGGGGC
GAGAATTACC AGGTCGTGCT GAAGGACATG GCGGAGGGGC AGGTCAGGGA GAGAAGCGAC
GACCAGCTCT CCGGCGGGCA GATGATGAGC GCCGTGGTGG CGCTGCGCCT TGCGCTTTTG
CAGACCATCG GAGCGCGCAT CGCCTTCTTC GACGAGCCTA CTTCGAACCT GGACGCCGAG
CGGCGGGAGA ATCTGGCGCG TGCCTTCCGC GCCATCGACG TCGGACAGGA GGAAGTCACC
GAACACTGGT ACGACCAACT CTTCCTGGTC AGCCACGACG TGAGTTTCAC CGAGATTACG
GATCAGACTA TCCAGCTCGA CTAG
 
Protein sequence
MRIISVHLKN IKSHRDKEIA FSPGINVLSG ANGSGKSTIF EAIGYALFGV SAQDFVSKAD 
RFLTIGAKKG EISVVFEPAP GELYRVTRTV GGAGKWLLAK DNGGGFEIEE HANIQETEQR
IASLLGLATG RPLAEQFKLV IGPFQNDFLG PFVIKQPTKR QDAFDEILGI DTWRKTFDGT
KTLASAIVAK TETIQAEVAA KMEQVSVLPA KEQELRDLRE QALAKEAQLR QKLAEQEQVN
QLFAGMEARK ERIDSVGREV QALEERSISG KEYISTQLLL VEQSKAAAAL VAEAAQGKQA
YDAAETKLRR LRDQEQQKLA LERKLGESDK EQAGLQSALA IENRELAETR TAMDGEKERL
AQEKDALSNA LLALKKSEAA AGEALALVNR SAAQFRELPV HRIEQALPYL FVALDRMAAI
DEQSTERKKL VAAGATLKAE ASELAARQAL LDKVQAERSE LAGRRLSLVE GEEKLGAGDC
PFFHEPCRNL ASGDASGVFE ARIERIDAEI SRLDRQAVEL AAQVAAAQAA ARELAGLEQV
ASELEKAGRE RRKLEQDFIN GFSEIAPAVL VPSLGTWLDS AELGIDLAAE LPALQVELAA
EPQQRKAALS EASDAWRRVV VLIEEGLDAR LKEAQEPVQE CALKLAQLAE KGESLAAKER
ELAASAEKLA FRERAIADHG KRLAALLEAV CAVKTDLAAH AGLDQAIKEA GEELVRFQAD
RDRYIANEKA AQELDKRQET LAKYQAKLKE IEGALTAKRE ELRQSVEGYD QARHEAVKQQ
QVELVSAVAT LASEIGAVGE GVTRLEGETA ALKCIAGEIE KKLAAVEALK EQGVLVKFLR
NQVFKNVSSQ LSERFREEIS FRADRIYRSI CASDEELVWG ENYQVVLKDM AEGQVRERSD
DQLSGGQMMS AVVALRLALL QTIGARIAFF DEPTSNLDAE RRENLARAFR AIDVGQEEVT
EHWYDQLFLV SHDVSFTEIT DQTIQLD