Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1922 |
Symbol | |
ID | 8137256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2229217 |
End bp | 2232180 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644869536 |
Product | SMC domain protein |
Protein accession | YP_003021733 |
Protein GI | 253700544 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 0.664102 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATAA TCTCGGTCCA CCTGAAGAAC ATCAAGTCGC ACCGCGACAA GGAGATCGCC TTCTCTCCAG GCATCAACGT CCTCTCGGGC GCCAACGGCT CGGGCAAGAG CACCATCTTC GAGGCCATCG GTTACGCCCT CTTCGGCGTC TCCGCGCAGG ACTTCGTCTC CAAGGCGGAC CGCTTCCTCA CCATCGGCGC CAAAAAAGGT GAGATCAGTG TCGTCTTCGA ACCGGCCCCT GGTGAGTTGT ACCGGGTGAC CCGCACCGTG GGAGGCGCCG GCAAGTGGCT CCTGGCGAAA GATAACGGCG GCGGCTTCGA GATAGAGGAG CACGCGAACA TCCAGGAGAC CGAGCAGCGG ATCGCCTCGC TTTTGGGGCT TGCCACCGGC CGGCCGCTGG CGGAACAGTT CAAGCTGGTC ATCGGCCCCT TCCAGAACGA CTTCCTCGGC CCGTTCGTCA TCAAGCAGCC GACCAAACGC CAGGACGCCT TCGACGAGAT TCTCGGCATC GACACCTGGC GCAAGACCTT CGACGGCACC AAGACTCTCG CGAGCGCCAT CGTCGCCAAG ACCGAAACCA TCCAGGCGGA GGTCGCCGCG AAGATGGAAC AGGTTTCGGT GCTTCCGGCC AAGGAGCAGG AACTGCGCGA TCTGCGCGAG CAGGCGCTGG CGAAAGAGGC CCAGTTACGG CAAAAACTTG CCGAGCAGGA GCAGGTTAAC CAGCTTTTCG CCGGAATGGA GGCGCGGAAG GAGCGGATCG ATTCGGTCGG GCGCGAGGTG CAAGCCCTGG AAGAGCGGTC GATCTCCGGG AAGGAGTACA TCTCCACCCA GTTGCTGTTG GTCGAGCAAT CGAAAGCGGC GGCAGCGCTC GTCGCGGAAG CCGCCCAGGG GAAGCAGGCC TACGACGCAG CCGAAACAAA GCTCAGGCGT CTGCGCGATC AAGAACAGCA AAAGCTCGCC CTGGAGCGAA AGCTGGGAGA GTCGGATAAG GAACAGGCGG GCCTGCAAAG CGCGCTGGCG ATCGAGAACC GCGAACTGGC CGAGACGCGG ACCGCGATGG ATGGCGAAAA AGAGCGGCTG GCGCAGGAAA AGGACGCGCT CTCAAATGCC CTTCTTGCTC TGAAGAAGTC GGAAGCGGCA GCCGGGGAGG CTTTGGCGCT GGTGAACCGT TCTGCCGCGC AGTTCCGTGA GCTCCCCGTG CATCGCATAG AGCAGGCCTT GCCGTATCTC TTCGTCGCCC TGGACCGCAT GGCGGCCATA GACGAGCAGT CCACGGAGAG AAAGAAACTT GTCGCCGCCG GCGCAACGCT CAAGGCTGAA GCCTCAGAGT TAGCCGCCCG GCAGGCCCTT TTGGATAAGG TCCAGGCCGA GCGCTCCGAG CTGGCCGGGC GCAGGCTTTC GCTGGTGGAA GGGGAGGAAA AGCTGGGAGC GGGGGATTGC CCGTTCTTCC ACGAGCCGTG CCGGAACCTC GCCTCGGGTG ACGCTTCCGG CGTCTTCGAG GCGCGCATCG AACGGATCGA TGCCGAGATC TCGCGTCTCG ACCGGCAGGC GGTGGAACTC GCCGCCCAGG TTGCCGCGGC GCAGGCAGCG GCACGAGAAC TGGCGGGACT AGAGCAGGTG GCGAGCGAGC TGGAGAAGGC AGGGCGCGAG CGCCGGAAGC TGGAGCAGGA TTTCATCAAT GGTTTTTCCG AGATCGCGCC CGCCGTGCTG GTCCCAAGCC TCGGGACCTG GCTCGATTCT GCGGAACTGG GAATCGATCT GGCTGCGGAA CTTCCGGCCT TGCAGGTAGA GCTGGCCGCG GAGCCGCAGC AGCGAAAGGC TGCACTCTCC GAAGCGAGCG ACGCTTGGCG GCGTGTCGTT GTTTTGATCG AGGAAGGTCT GGATGCCAGG CTCAAGGAAG CTCAGGAGCC GGTTCAGGAG TGCGCGCTGA AACTGGCCCA GCTTGCCGAG AAAGGGGAGA GCCTCGCCGC AAAGGAGCGG GAACTCGCCG CCTCTGCCGA GAAGCTGGCC TTCCGCGAGC GGGCGATAGC CGATCATGGC AAGCGGTTGG CGGCTCTCCT CGAAGCCGTA TGCGCGGTGA AGACGGATCT CGCCGCTCAT GCCGGCCTCG ACCAGGCGAT AAAGGAGGCG GGAGAGGAAC TGGTTCGCTT CCAGGCAGAC CGCGACCGCT ACATCGCCAA CGAAAAAGCC GCGCAAGAAC TGGATAAGCG CCAGGAGACA CTGGCGAAAT ACCAGGCGAA GCTCAAGGAG ATAGAAGGGG CGCTTACCGC AAAAAGAGAG GAACTGCGGC AGTCGGTGGA GGGGTACGAC CAGGCGCGGC ACGAGGCCGT TAAACAGCAG CAGGTGGAGT TGGTCTCCGC AGTGGCGACG CTCGCCTCGG AGATCGGAGC GGTAGGCGAA GGCGTTACCC GTCTCGAAGG GGAGACCGCC GCACTGAAAT GCATTGCCGG GGAGATCGAG AAGAAGCTCG CCGCCGTCGA GGCGTTGAAG GAGCAGGGCG TTCTGGTCAA GTTCCTGCGC AACCAGGTGT TCAAGAACGT CTCCAGCCAG CTTTCCGAGC GCTTCAGGGA AGAGATCAGT TTTCGCGCCG ACCGGATCTA CCGCAGCATC TGCGCCTCGG ACGAGGAACT GGTCTGGGGC GAGAATTACC AGGTCGTGCT GAAGGACATG GCGGAGGGGC AGGTCAGGGA GAGAAGCGAC GACCAGCTCT CCGGCGGGCA GATGATGAGC GCCGTGGTGG CGCTGCGCCT TGCGCTTTTG CAGACCATCG GAGCGCGCAT CGCCTTCTTC GACGAGCCTA CTTCGAACCT GGACGCCGAG CGGCGGGAGA ATCTGGCGCG TGCCTTCCGC GCCATCGACG TCGGACAGGA GGAAGTCACC GAACACTGGT ACGACCAACT CTTCCTGGTC AGCCACGACG TGAGTTTCAC CGAGATTACG GATCAGACTA TCCAGCTCGA CTAG
|
Protein sequence | MRIISVHLKN IKSHRDKEIA FSPGINVLSG ANGSGKSTIF EAIGYALFGV SAQDFVSKAD RFLTIGAKKG EISVVFEPAP GELYRVTRTV GGAGKWLLAK DNGGGFEIEE HANIQETEQR IASLLGLATG RPLAEQFKLV IGPFQNDFLG PFVIKQPTKR QDAFDEILGI DTWRKTFDGT KTLASAIVAK TETIQAEVAA KMEQVSVLPA KEQELRDLRE QALAKEAQLR QKLAEQEQVN QLFAGMEARK ERIDSVGREV QALEERSISG KEYISTQLLL VEQSKAAAAL VAEAAQGKQA YDAAETKLRR LRDQEQQKLA LERKLGESDK EQAGLQSALA IENRELAETR TAMDGEKERL AQEKDALSNA LLALKKSEAA AGEALALVNR SAAQFRELPV HRIEQALPYL FVALDRMAAI DEQSTERKKL VAAGATLKAE ASELAARQAL LDKVQAERSE LAGRRLSLVE GEEKLGAGDC PFFHEPCRNL ASGDASGVFE ARIERIDAEI SRLDRQAVEL AAQVAAAQAA ARELAGLEQV ASELEKAGRE RRKLEQDFIN GFSEIAPAVL VPSLGTWLDS AELGIDLAAE LPALQVELAA EPQQRKAALS EASDAWRRVV VLIEEGLDAR LKEAQEPVQE CALKLAQLAE KGESLAAKER ELAASAEKLA FRERAIADHG KRLAALLEAV CAVKTDLAAH AGLDQAIKEA GEELVRFQAD RDRYIANEKA AQELDKRQET LAKYQAKLKE IEGALTAKRE ELRQSVEGYD QARHEAVKQQ QVELVSAVAT LASEIGAVGE GVTRLEGETA ALKCIAGEIE KKLAAVEALK EQGVLVKFLR NQVFKNVSSQ LSERFREEIS FRADRIYRSI CASDEELVWG ENYQVVLKDM AEGQVRERSD DQLSGGQMMS AVVALRLALL QTIGARIAFF DEPTSNLDAE RRENLARAFR AIDVGQEEVT EHWYDQLFLV SHDVSFTEIT DQTIQLD
|
| |