Gene GM21_0134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0134 
Symbol 
ID8135437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp163019 
End bp165007 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content63% 
IMG OID644867753 
Productexcinuclease ABC subunit B 
Protein accessionYP_003019977 
Protein GI253698788 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones94 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAAAT TCGAACTGGT AACGAGTTTC GAGCCACGCG GCGACCAACC CAGGGCCATC 
GCAGAACTGG CGGACGGAGT GCTGCGGGGC GACCCGCACC AGGTGCTCCT CGGGGTCACC
GGGTCGGGCA AGACCTTCAC CATGGCCCAG GTGATTGCAC GCTGCAACTG CCCCACCCTG
GTTCTGGCCC CCAACAAGAC TCTGGCCGCT CAGCTTTACG GCGAGTTCAA GGAACTCTTC
CCCAACAACG CCGTCGAGTA TTTCGTCTCC TATTACGACT ACTACCAGCC GGAAGCCTAC
CTCCCCTCCT CAGACACCTT CATCGAGAAG GACTCCTCGA TAAACGACGA GATCGACAAG
TTCCGGCACT CCGCCACCAG GAGCCTTTTG ACCCGCCGCG ACGTCATCAT CGTAGCCTCG
GTTTCCTGCA TCTACGGCAT AGGCTCCCCC GAGTCTTACC AGGAGATGCA GATCCGTTTC
CGCGAAGGGG ACGAGGTCGG GCGCGACGAG ATGCTGCAGC GGCTTGTCGC GATCCAGTAC
CAGCGAAACG ACGTCGATTT CCACCGCGGC TCCTTCCGGG TTCGTGGGGA TACGGTCGAG
GTCTTCCCCG CCCACGACGA CGAGCGGGCG CTCAGGATCG AGTTCTTCGG GGACACGGTG
GACGCCATCT CCGAGATAGA CCCCCTGCGC GGGGTGCAGC TGCAGAAACT TTCCCGCTGC
GCCATCTACC CCGCCTCCCA TTACGTCGCC AGCCGTCAGA CCCTGGAGCG GGCCGTGGAG
CTGATTCGGC TCGAACTGGA GGAGCGGATC CGCTACTTCA ATGCGCAGAA CATGCTCCTT
GAGGCGCAGC GCATCGAGCA GAGAACCTTC TTCGACATCG AGATGATGGA GGAGATGGGC
TTTTGCCAGG GGATCGAGAA CTACTCGCGC CATTTCGACG GTCGCGCCGC GGGGGAACCC
CCTTACACGC TGATCGACTA TTTCCCCAAG GACTTCCTGC TGGTGATCGA CGAGTCCCAC
ATCACCGTTT CGCAGGTGGG GGGGATGTAC CGCGGCGACC GCAGCCGAAA AGAGACCCTG
GTGAACTACG GTTTCAGGCT CCCCTCGGCC TTGGACAACC GCCCGCTCAC CTTCCAGGAG
TTCCAGAAGA AGCTGCATCA GACCATCTAC GTTTCCGCGA CCCCGGCGGA CTACGAGCTG
AAGCAGGCGG GAGGGGTCGT GGTGGAGCAG TTGATCCGCC CGACCGGCCT CATCGACCCG
GCCATCGAGG TGCGCCCGGC CGCGGGGCAG GTGGACGACC TCCTGCACGA GGCGCGCGAG
ACGGCGGCCA GGGGAGAGCG GGTGCTGGTC ACCACCCTTA CCAAGCGGAT GGCCGAGGAA
CTCACCGACT ACTATCGCGA GCTCGGTATC CGCGTCCGTT ACCTTCACTC CGACATCGAC
ACCTTCCAGC GCATGGAGAT CCTCAGGGAC CTAAGGCTCG GCGAGTTCGA CCTGTTGGTC
GGGATCAACC TGCTCAGGGA AGGGCTCGAC CTCCCCGAGG TCTCGCTGGT GGCGATCCTC
GATGCCGACA AGGAAGGCTT CCTCCGCTCC ACCAGGTCGC TGATCCAGAC CTGTGGGCGC
GCGGCGAGGA ACTTGTCCGG ACGCGTGCTC ATGTACGCGG ACAAGGTGAC CGGCTCCATG
CAGGCTGCCA TCGACGAGAC CGTGAGGAGG CGCGCACTGC AGACGGCCTA CAACGAGGAG
CACGGCATCA CGCCGGAGAG CGTGCGGAGG ATCATCGGCA ACGTGCTGCA GGCCCCCGAG
GAGAAGGATT GGGTCACGGT GCCGGCCTCG GCTGAGGAGT TCGTGAGCGC CAAGGAGCTG
GAGAAGACGC TGAAGAGGCT GAGAAAGGAG ATGCTGGCGG CGGCGAAGGC TCAGGAATTC
GAGAGGGCGG CGGAGCTGAG GGACAAGATC AAGCGGCTGG AGGTCGCGGA AATCATGAGA
AGCAATTGA
 
Protein sequence
MDKFELVTSF EPRGDQPRAI AELADGVLRG DPHQVLLGVT GSGKTFTMAQ VIARCNCPTL 
VLAPNKTLAA QLYGEFKELF PNNAVEYFVS YYDYYQPEAY LPSSDTFIEK DSSINDEIDK
FRHSATRSLL TRRDVIIVAS VSCIYGIGSP ESYQEMQIRF REGDEVGRDE MLQRLVAIQY
QRNDVDFHRG SFRVRGDTVE VFPAHDDERA LRIEFFGDTV DAISEIDPLR GVQLQKLSRC
AIYPASHYVA SRQTLERAVE LIRLELEERI RYFNAQNMLL EAQRIEQRTF FDIEMMEEMG
FCQGIENYSR HFDGRAAGEP PYTLIDYFPK DFLLVIDESH ITVSQVGGMY RGDRSRKETL
VNYGFRLPSA LDNRPLTFQE FQKKLHQTIY VSATPADYEL KQAGGVVVEQ LIRPTGLIDP
AIEVRPAAGQ VDDLLHEARE TAARGERVLV TTLTKRMAEE LTDYYRELGI RVRYLHSDID
TFQRMEILRD LRLGEFDLLV GINLLREGLD LPEVSLVAIL DADKEGFLRS TRSLIQTCGR
AARNLSGRVL MYADKVTGSM QAAIDETVRR RALQTAYNEE HGITPESVRR IIGNVLQAPE
EKDWVTVPAS AEEFVSAKEL EKTLKRLRKE MLAAAKAQEF ERAAELRDKI KRLEVAEIMR
SN