Gene GM21_3785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3785 
Symbol 
ID8139159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4358282 
End bp4360081 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content64% 
IMG OID644871404 
ProductATP-dependent DNA helicase RecQ 
Protein accessionYP_003023562 
Protein GI253702373 
COG category[L] Replication, recombination and repair 
COG ID[COG0514] Superfamily II DNA helicase 
TIGRFAM ID[TIGR00614] ATP-dependent DNA helicase, RecQ family
[TIGR01389] ATP-dependent DNA helicase RecQ 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones110 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGCAT CCCCCATACA GATTCTCAAC GACGTGTTCG GCTTCAAGTC GTTCCGGTCG 
CCGCAGCATG AGATCGTGGA AACGGTTCTC TCCGGCAGGG ACGCCTTCGT CCTCATGCCG
ACCGGCGGCG GCAAATCGCT CTGCTACCAG ATCCCGGCCC TCTGCTTTCC CGGCACAGCC
TTGGTGGTCT CCCCCCTCAT CTCGCTGATG AAGGACCAGG TCGACGCCCT GCGCGAGAAC
GGCATCTCCG CCGCGTGCTA CAACTCTGCG CTGGGAGAGG CGGAGGCGCG CCGGGTGCTG
GCCCAACTGC ACGCAGGCGA GCTGAAGCTC CTCTACGTCG CGCCCGAGCG GCTTTTGAGC
GACGGCTTCC TGGAAAGGAT CAAGCCGCTT TCCATCTCCC TCTTCGCCAT CGACGAGGCG
CACTGCGTCT CGCAGTGGGG GCACGACTTC CGTCCCGAGT ACGCCCAGCT TGGCGTCTTG
CGCGAGATAT TCCCCGAGAT ACCGATGATC GCGCTTACCG CGACGGCCGA CGCCCAGACC
CGGGGAGACA TACTTTCCCG GCTCGGGCTT CAGGGGGCTA CCTGCTACTG CGCCGGTTTC
GATCGCCCCA ACATCCGTTA CAGCGTCATC GACAAGAATA AGCCCTTCAA CCAGCTCACC
GGCTTCTTGT CGAGCCGCAA GGACGAGGCG GGGATAGTGT ATGCCCTGTC CCGGAAGCGG
GTGGAGGAGG TGGCCAGGAA GCTTTGCGCA GCCGGGATCA AGGCGGCCGC CTATCACGCC
GGCCTCCCCG ATAAGGAGCG GCACCGGGTC CAGGAGGCCT TCCTCAAGGA CGACATCAAG
ATCGTGGTGG CGACGGTCGC CTTCGGCATG GGGATAGACA AGTCCAACGT CCGCTTCGTG
GTGCACTACG ACATGCCCAA GAGCATCGAG AGCTACTATC AGGAGACCGG GCGTGCCGGC
CGGGACGGGC TTCCCGCAGA CGCGCTCCTC TTGTTCGGTT ACGGCGACGT AGCCGTGGCC
CGCGGACTCA TCGGCAACGG CGGCAACGCG GAGCAAAACA GGATCGAGCT GCACAAGCTC
AACTGCATGA CCGGGTTCGC CGAGGCCCAG ACCTGCCGGC GCCGGGTGCT GCTCGGATAT
TTCGGGGACC GGCTGGAGCA GGACTGCGGC AACTGCGACA TCTGCGAGAG CCCGCCCGAG
CGCTTCGACG CCACCGAGGA TGCCCAGAAG GCCCTCTCCT GCGTCTACCG AGTCGGGCAG
CGGTTCGGGA TGGGGCACGT CATCGACGTT TTGCGCGGTT CGCAGAACCA CCGGATCCAG
GAGCTGAAAC ACGACCAGCT TTCCACCTTC GGCATCGGAA AGCAACACTC CCAGGAGTTC
TGGGGCAACC TGCTGCGCCA ATTGATCCAC CTGGGGTATC TGGAGCAGGA CCTGGCGAAT
TTCTCGGTCT TGAAGCTGAC CGAGGGGGCG CGCCCGCTTT TGAGGGGGGA GGTTCGGTTG
GAGCTAGCCA AGCCGCGCGA CACCAAGGTG GTGGAGAAGA AGTCGGCCGC CAAAAAGCCG
AGTTACGACG GCGTGCTGTT CCAGGAGCTG CGTGAGCTAA GGAAGGGGAT CGCGGACGAG
CAGCAGGTGC CGCCCTTCGT GGTGTTCGCC GACGCCACGC TCGCCGAGAT GGCGGCACAG
ATGCCCAAGG ACAAGTGGGA GTTGCTGAAG ATAACCGGGG TGGGGCAGCA CAAGATGGCC
CGCTACGGAG ACGCCTTCCT GCGGGTGATC AGGGAACATC TGGAAAAGGC GGAGAACTGA
 
Protein sequence
MPASPIQILN DVFGFKSFRS PQHEIVETVL SGRDAFVLMP TGGGKSLCYQ IPALCFPGTA 
LVVSPLISLM KDQVDALREN GISAACYNSA LGEAEARRVL AQLHAGELKL LYVAPERLLS
DGFLERIKPL SISLFAIDEA HCVSQWGHDF RPEYAQLGVL REIFPEIPMI ALTATADAQT
RGDILSRLGL QGATCYCAGF DRPNIRYSVI DKNKPFNQLT GFLSSRKDEA GIVYALSRKR
VEEVARKLCA AGIKAAAYHA GLPDKERHRV QEAFLKDDIK IVVATVAFGM GIDKSNVRFV
VHYDMPKSIE SYYQETGRAG RDGLPADALL LFGYGDVAVA RGLIGNGGNA EQNRIELHKL
NCMTGFAEAQ TCRRRVLLGY FGDRLEQDCG NCDICESPPE RFDATEDAQK ALSCVYRVGQ
RFGMGHVIDV LRGSQNHRIQ ELKHDQLSTF GIGKQHSQEF WGNLLRQLIH LGYLEQDLAN
FSVLKLTEGA RPLLRGEVRL ELAKPRDTKV VEKKSAAKKP SYDGVLFQEL RELRKGIADE
QQVPPFVVFA DATLAEMAAQ MPKDKWELLK ITGVGQHKMA RYGDAFLRVI REHLEKAEN