Gene GM21_0189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0189 
Symbol 
ID8135492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp223311 
End bp226154 
Gene Length2844 bp 
Protein Length947 aa 
Translation table11 
GC content63% 
IMG OID644867808 
Productexcinuclease ABC, A subunit 
Protein accessionYP_003020032 
Protein GI253698843 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.0119000000000001e-25 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAACGG ACAAGATCAT CATCAAAGGT GCCTGCGAGC ACAACCTCAA ATGCATAGAC 
GTGGAAATTC CCCGCGACAA GTTGGTGGTG ATCACCGGCA TCTCGGGGTC TGGAAAATCC
ACCCTCGCCT TCGATACCAT CTACGCAGAG GGGCAGCGCC GCTACGTGGA ATCGCTCTCC
GCCTACGCTC GCCAGTTCCT GGAGCAGATG GAAAAACCCG ACGTGGAGTC GATCGAGGGC
CTCTCCCCTG CCATCTCCAT AGAGCAGAAA ACGACCAGCA AGAACCCCCG TTCCACGGTA
GGCACCGTCA CCGAGATCTA CGATTACCTG CGCCTGCTCT TCGCCCGGAT CGGCAAGCCG
CACTGCTACA ACTGCGGCCG CATCATTACC TCCCAGACCG TTTCCCAGAT GGTGGACCAG
ATCGCGGCGC TGCCGCAGGG GGCGAAGCTC ACCCTCCTCT CCCCCATCGT GCGCGGCCGG
AAAGGGGAGT ACAAAAAGGA GCTGACCCAG TTCAGGAAGG ACGGCTTCGC CCGTGTCGTC
GTCGACGGCG AGACCCACGA CCTCTCGGAA GAGATCACCC TGGACAAGAA GAAAAAGCAC
GACATCGACA TCGTCGTCGA CCGCCTGGTG GTGAAGCCGG GGATCGAGAA GCGACTCGCC
GATTCGCTGG AAACGGCACT CTCGCACGCG GAGGGAATCG TCAAGGTGGC CCTCACCCCT
GACGCGGATC GCGGCATCAA GGAAGAAACA CTCCTTTTCT CCGAGTCGGC CGCCTGCATC
GAGTGCGGCA TCTCCTATCC CGAGATGACC CCCCGCATGT TCTCCTTCAA CAACCCGTAC
GGCGCCTGCC CCGACTGCAC CGGCCTCGGC ACCAGGATGT ACCTCGACAC CAACCTCGTG
GTGCCGGACC ACGACCTCAC CCTGGCCGAA GGCGCCGTCG CCCCCTGGGA GACCCGTTTC
TCCGGATGGT ACCAGCAGAC CCTCGCCGCC CTGGGTAAAA GCTACGGCTT CGACCTGCAC
ACCCCTTACA AGCAGCTCTC CAAGAAGGCG AAGGACGTGA TCCTGAACGG CTCGGGGGGG
GAATTGGTCG ATTTCTGGTG GGTGGACGAC GCAGGGAAGC GGCACACCTA CAAGAAGGCC
TTCGAGGGGG TGCTGAACAA CCTGGAGCGG CGCCATCGCG AGACCGAGTC CGAGCAGGTG
CGGGAGGAAC TGGAAAAGTA CATGGACGTC ATGCCCTGCC CCACCTGCCA AGGGGCGAGG
CTCAAGAAGG AGGCTCTTTT CGTCAGGGTT GGGGGGGAGA ACATCCAGCA GGTTACCGCC
TATTCCATCC AGGATGCACT CTCCTTCTTC GACTCGCTGG CGCTCAGCGA GAAGGAAGAG
GACATCGCCC GGAGGATCCT CAAGGAGATC AGGGAGCGGC TGAACTTCCT GGTCAACGTC
GGCCTCGACT ACCTGTCGCT GGACCGCTCC TCGGGAACCC TCTCCGGGGG CGAAGGGCAG
AGGATCCGGC TCGCCACCCA GATCGGCTCT TCGCTGGTCG GGGTGCTCTA CATCCTGGAC
GAGCCCTCCA TCGGCCTGCA CCAGCGCGAC AACGGCAGGC TGCTGTCGAC CCTGAGGCAC
CTGCGCGACA TAGGCAACAC GGTCCTCGTG GTGGAGCACG ACGAGGAAAC CATCTCGGAG
GCGGACTGGG TCATCGACAT GGGACCGGGC GCCGGGGTCC ACGGCGGCGA GGTGGTGGCC
GAAGGCACCC CCGCCGAGAT CATGGCCAAT CCCCATTCCC TCACCGGGCG CTACCTCTCC
GGCGCGCTGA CGATAGCGAT CCCCAAAAAG CGCAGGAAAG GGAGCCGGTT CCTCTCCATC
GAGGGGGCCA ACGAGAACAA CCTGAAGGAC GTCTCTGTCG ACCTGCCGCT CGGCGTCATG
ACCTGCATCA CCGGGGTGTC GGGGTCGGGG AAGTCCTCGC TCATCATCGA CACCCTCTAT
AAGACCCTGA ACCAGCGGCT CTACAAAAGC CGGGAAAAGG CCGGAGCGGT CCGGGCCATC
CACGGCATGG AGGTGCTGGA CAAGGTGATC AACATCGACC AGTCCCCAAT CGGCCGCACG
CCTCGCTCCA ACCCCGCCAC CTACACCGGC CTCTTCACCG AAATCAGGGA GATCTTCGCT
CAGCTCCCCG AGTCGAAGAT GCGCGGCTAC AAGCCCGGGC GCTACTCCTT CAACGTGAAG
GGGGGGCGCT GCGAGGCCTG CGCCGGGGAC GGCATCATCA AGATCGAGAT GCACTTTCTC
CCCGATGTGT ACGTGCAGTG CGAGGTCTGC AAGGGGGCGC GCTACAACAG GGAGACCCTT
GAGGTCCGCT TCAAGGGGCG CTCCATCGCG GAAGTGCTAG ACATGACCGT CTCCCAGGCC
CTGGTCTTCC TGGAGCATAT CCCGCGCCTG AAGGCTAAGC TGCAGACCCT GGAGGAGGTG
GGCCTTGGTT ACATCAAGCT GGGGCAGTCC GCGACCACCT TGTCAGGCGG GGAGGCGCAG
CGCGTCAAGC TCGCCAAGGA GCTTTCCAAA CGGGCCACCG GGCGGACCAT CTATATCCTG
GATGAACCGA CCACCGGCCT GCACTTCGCC GACATAGCAA AGCTCTTGGA GGTGCTGCAC
AAGCTGGTGG ACGCGGGAAA CAGCATCGTG GTCATCGAGC ACAACCTCGA TGTGATCAAG
ACGGCGGACT GGATCGTCGA CCTGGGTCCC GAGGGGGGGG ACCGCGGCGG CGAGGTGATA
GCGGTGGGCA CTCCGGAGCA GGTTTCCCGG GTGGAGCGGT CGTACACCGG GCAGTACCTC
AAAAAGATGC TGCCACACGG GTAG
 
Protein sequence
MATDKIIIKG ACEHNLKCID VEIPRDKLVV ITGISGSGKS TLAFDTIYAE GQRRYVESLS 
AYARQFLEQM EKPDVESIEG LSPAISIEQK TTSKNPRSTV GTVTEIYDYL RLLFARIGKP
HCYNCGRIIT SQTVSQMVDQ IAALPQGAKL TLLSPIVRGR KGEYKKELTQ FRKDGFARVV
VDGETHDLSE EITLDKKKKH DIDIVVDRLV VKPGIEKRLA DSLETALSHA EGIVKVALTP
DADRGIKEET LLFSESAACI ECGISYPEMT PRMFSFNNPY GACPDCTGLG TRMYLDTNLV
VPDHDLTLAE GAVAPWETRF SGWYQQTLAA LGKSYGFDLH TPYKQLSKKA KDVILNGSGG
ELVDFWWVDD AGKRHTYKKA FEGVLNNLER RHRETESEQV REELEKYMDV MPCPTCQGAR
LKKEALFVRV GGENIQQVTA YSIQDALSFF DSLALSEKEE DIARRILKEI RERLNFLVNV
GLDYLSLDRS SGTLSGGEGQ RIRLATQIGS SLVGVLYILD EPSIGLHQRD NGRLLSTLRH
LRDIGNTVLV VEHDEETISE ADWVIDMGPG AGVHGGEVVA EGTPAEIMAN PHSLTGRYLS
GALTIAIPKK RRKGSRFLSI EGANENNLKD VSVDLPLGVM TCITGVSGSG KSSLIIDTLY
KTLNQRLYKS REKAGAVRAI HGMEVLDKVI NIDQSPIGRT PRSNPATYTG LFTEIREIFA
QLPESKMRGY KPGRYSFNVK GGRCEACAGD GIIKIEMHFL PDVYVQCEVC KGARYNRETL
EVRFKGRSIA EVLDMTVSQA LVFLEHIPRL KAKLQTLEEV GLGYIKLGQS ATTLSGGEAQ
RVKLAKELSK RATGRTIYIL DEPTTGLHFA DIAKLLEVLH KLVDAGNSIV VIEHNLDVIK
TADWIVDLGP EGGDRGGEVI AVGTPEQVSR VERSYTGQYL KKMLPHG