Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0189 |
Symbol | |
ID | 8135492 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 223311 |
End bp | 226154 |
Gene Length | 2844 bp |
Protein Length | 947 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644867808 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_003020032 |
Protein GI | 253698843 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 1.0119000000000001e-25 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAACGG ACAAGATCAT CATCAAAGGT GCCTGCGAGC ACAACCTCAA ATGCATAGAC GTGGAAATTC CCCGCGACAA GTTGGTGGTG ATCACCGGCA TCTCGGGGTC TGGAAAATCC ACCCTCGCCT TCGATACCAT CTACGCAGAG GGGCAGCGCC GCTACGTGGA ATCGCTCTCC GCCTACGCTC GCCAGTTCCT GGAGCAGATG GAAAAACCCG ACGTGGAGTC GATCGAGGGC CTCTCCCCTG CCATCTCCAT AGAGCAGAAA ACGACCAGCA AGAACCCCCG TTCCACGGTA GGCACCGTCA CCGAGATCTA CGATTACCTG CGCCTGCTCT TCGCCCGGAT CGGCAAGCCG CACTGCTACA ACTGCGGCCG CATCATTACC TCCCAGACCG TTTCCCAGAT GGTGGACCAG ATCGCGGCGC TGCCGCAGGG GGCGAAGCTC ACCCTCCTCT CCCCCATCGT GCGCGGCCGG AAAGGGGAGT ACAAAAAGGA GCTGACCCAG TTCAGGAAGG ACGGCTTCGC CCGTGTCGTC GTCGACGGCG AGACCCACGA CCTCTCGGAA GAGATCACCC TGGACAAGAA GAAAAAGCAC GACATCGACA TCGTCGTCGA CCGCCTGGTG GTGAAGCCGG GGATCGAGAA GCGACTCGCC GATTCGCTGG AAACGGCACT CTCGCACGCG GAGGGAATCG TCAAGGTGGC CCTCACCCCT GACGCGGATC GCGGCATCAA GGAAGAAACA CTCCTTTTCT CCGAGTCGGC CGCCTGCATC GAGTGCGGCA TCTCCTATCC CGAGATGACC CCCCGCATGT TCTCCTTCAA CAACCCGTAC GGCGCCTGCC CCGACTGCAC CGGCCTCGGC ACCAGGATGT ACCTCGACAC CAACCTCGTG GTGCCGGACC ACGACCTCAC CCTGGCCGAA GGCGCCGTCG CCCCCTGGGA GACCCGTTTC TCCGGATGGT ACCAGCAGAC CCTCGCCGCC CTGGGTAAAA GCTACGGCTT CGACCTGCAC ACCCCTTACA AGCAGCTCTC CAAGAAGGCG AAGGACGTGA TCCTGAACGG CTCGGGGGGG GAATTGGTCG ATTTCTGGTG GGTGGACGAC GCAGGGAAGC GGCACACCTA CAAGAAGGCC TTCGAGGGGG TGCTGAACAA CCTGGAGCGG CGCCATCGCG AGACCGAGTC CGAGCAGGTG CGGGAGGAAC TGGAAAAGTA CATGGACGTC ATGCCCTGCC CCACCTGCCA AGGGGCGAGG CTCAAGAAGG AGGCTCTTTT CGTCAGGGTT GGGGGGGAGA ACATCCAGCA GGTTACCGCC TATTCCATCC AGGATGCACT CTCCTTCTTC GACTCGCTGG CGCTCAGCGA GAAGGAAGAG GACATCGCCC GGAGGATCCT CAAGGAGATC AGGGAGCGGC TGAACTTCCT GGTCAACGTC GGCCTCGACT ACCTGTCGCT GGACCGCTCC TCGGGAACCC TCTCCGGGGG CGAAGGGCAG AGGATCCGGC TCGCCACCCA GATCGGCTCT TCGCTGGTCG GGGTGCTCTA CATCCTGGAC GAGCCCTCCA TCGGCCTGCA CCAGCGCGAC AACGGCAGGC TGCTGTCGAC CCTGAGGCAC CTGCGCGACA TAGGCAACAC GGTCCTCGTG GTGGAGCACG ACGAGGAAAC CATCTCGGAG GCGGACTGGG TCATCGACAT GGGACCGGGC GCCGGGGTCC ACGGCGGCGA GGTGGTGGCC GAAGGCACCC CCGCCGAGAT CATGGCCAAT CCCCATTCCC TCACCGGGCG CTACCTCTCC GGCGCGCTGA CGATAGCGAT CCCCAAAAAG CGCAGGAAAG GGAGCCGGTT CCTCTCCATC GAGGGGGCCA ACGAGAACAA CCTGAAGGAC GTCTCTGTCG ACCTGCCGCT CGGCGTCATG ACCTGCATCA CCGGGGTGTC GGGGTCGGGG AAGTCCTCGC TCATCATCGA CACCCTCTAT AAGACCCTGA ACCAGCGGCT CTACAAAAGC CGGGAAAAGG CCGGAGCGGT CCGGGCCATC CACGGCATGG AGGTGCTGGA CAAGGTGATC AACATCGACC AGTCCCCAAT CGGCCGCACG CCTCGCTCCA ACCCCGCCAC CTACACCGGC CTCTTCACCG AAATCAGGGA GATCTTCGCT CAGCTCCCCG AGTCGAAGAT GCGCGGCTAC AAGCCCGGGC GCTACTCCTT CAACGTGAAG GGGGGGCGCT GCGAGGCCTG CGCCGGGGAC GGCATCATCA AGATCGAGAT GCACTTTCTC CCCGATGTGT ACGTGCAGTG CGAGGTCTGC AAGGGGGCGC GCTACAACAG GGAGACCCTT GAGGTCCGCT TCAAGGGGCG CTCCATCGCG GAAGTGCTAG ACATGACCGT CTCCCAGGCC CTGGTCTTCC TGGAGCATAT CCCGCGCCTG AAGGCTAAGC TGCAGACCCT GGAGGAGGTG GGCCTTGGTT ACATCAAGCT GGGGCAGTCC GCGACCACCT TGTCAGGCGG GGAGGCGCAG CGCGTCAAGC TCGCCAAGGA GCTTTCCAAA CGGGCCACCG GGCGGACCAT CTATATCCTG GATGAACCGA CCACCGGCCT GCACTTCGCC GACATAGCAA AGCTCTTGGA GGTGCTGCAC AAGCTGGTGG ACGCGGGAAA CAGCATCGTG GTCATCGAGC ACAACCTCGA TGTGATCAAG ACGGCGGACT GGATCGTCGA CCTGGGTCCC GAGGGGGGGG ACCGCGGCGG CGAGGTGATA GCGGTGGGCA CTCCGGAGCA GGTTTCCCGG GTGGAGCGGT CGTACACCGG GCAGTACCTC AAAAAGATGC TGCCACACGG GTAG
|
Protein sequence | MATDKIIIKG ACEHNLKCID VEIPRDKLVV ITGISGSGKS TLAFDTIYAE GQRRYVESLS AYARQFLEQM EKPDVESIEG LSPAISIEQK TTSKNPRSTV GTVTEIYDYL RLLFARIGKP HCYNCGRIIT SQTVSQMVDQ IAALPQGAKL TLLSPIVRGR KGEYKKELTQ FRKDGFARVV VDGETHDLSE EITLDKKKKH DIDIVVDRLV VKPGIEKRLA DSLETALSHA EGIVKVALTP DADRGIKEET LLFSESAACI ECGISYPEMT PRMFSFNNPY GACPDCTGLG TRMYLDTNLV VPDHDLTLAE GAVAPWETRF SGWYQQTLAA LGKSYGFDLH TPYKQLSKKA KDVILNGSGG ELVDFWWVDD AGKRHTYKKA FEGVLNNLER RHRETESEQV REELEKYMDV MPCPTCQGAR LKKEALFVRV GGENIQQVTA YSIQDALSFF DSLALSEKEE DIARRILKEI RERLNFLVNV GLDYLSLDRS SGTLSGGEGQ RIRLATQIGS SLVGVLYILD EPSIGLHQRD NGRLLSTLRH LRDIGNTVLV VEHDEETISE ADWVIDMGPG AGVHGGEVVA EGTPAEIMAN PHSLTGRYLS GALTIAIPKK RRKGSRFLSI EGANENNLKD VSVDLPLGVM TCITGVSGSG KSSLIIDTLY KTLNQRLYKS REKAGAVRAI HGMEVLDKVI NIDQSPIGRT PRSNPATYTG LFTEIREIFA QLPESKMRGY KPGRYSFNVK GGRCEACAGD GIIKIEMHFL PDVYVQCEVC KGARYNRETL EVRFKGRSIA EVLDMTVSQA LVFLEHIPRL KAKLQTLEEV GLGYIKLGQS ATTLSGGEAQ RVKLAKELSK RATGRTIYIL DEPTTGLHFA DIAKLLEVLH KLVDAGNSIV VIEHNLDVIK TADWIVDLGP EGGDRGGEVI AVGTPEQVSR VERSYTGQYL KKMLPHG
|
| |