Gene GM21_3854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3854 
Symbol 
ID8139228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4438845 
End bp4441337 
Gene Length2493 bp 
Protein Length830 aa 
Translation table11 
GC content64% 
IMG OID644871471 
Producthelicase c2 
Protein accessionYP_003023629 
Protein GI253702440 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1199] Rad3-related DNA helicases 
TIGRFAM ID[TIGR01407] DnaQ family exonuclease/DinG family helicase, putative 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.000920192 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAAAAGT CTTTTTCCGA TCAGGCCATT CAGCAGCTCC GCAGCGCAAT CGACGAGGCA 
AACGGCAACG AAGTCTTCTT CCTTGGGCGG ACCGACGAGG CGCGCATCGT CGTTGAGGTC
GAGCCCCTGG CGCGCGGCAA CCGCGACGCG GTCCCCGCCA TCATGATCGC CTGCTCCTTC
GGCGACGTGG TGATCCACAA CCACCCCTCC GGCAACCTCA CCCCCTCCCA GCCCGACATC
GAGATCGCGT CGCTCCTGGG AAACCAGGGG GTCGGATTCT ACATCGTCGA CAACCAGGCC
GACCGCTGCA ACCAGGTCGT CGCCCCCTTC TCCCGCAAGG TGGTGGAATC GCTCTCCTAC
TCCGAGATCG AGCGTTTCTA CGCGCCGGAC GGGGTATTGT CCCAGGCGCT CCCCGGTTAC
GAGCACCGCC CAGAGCAGAC CCGGATGGCG CTGAACCTCT CCGAGGCGTT CAACGACGAG
AAGGTCGCCG TGGTCGAGGC GGGGACCGGC ACCGGGAAGT CGCTCGCCTA TCTGCTTCCG
GCGGCGCTTT GGTCGGTGCG CAACAAGGAG CGGGTGGTGG TTTCCACCAA CACCATCAAC
CTCCAGGAGC AGTTGATCCA GAAGGACATC CCCTTCCTGC AGCAGTACGC CCAGATCAAG
TTCCGCGCCG TCCTGGTGAA GGGGCGCGGC AACTACCTCT GCCTGAGAAA GCTCCACGTG
AACGCCGCCG ACGCCTCCCT TTTCAAGGAC GAGACCGCGC AGGAACTGGA CGCCATCGCC
TCCTGGAGCA AGAAGACCGA CGACGGCTGC CGCTCGGACC TGGCCTTCAT ACCGAAGGAC
GAGGTCTGGG AGGAGCTTTG CTGCGAATCG GACCAGTGCG GACGGGTCAG GTGCCCCGAT
TACGCCCGCT GCTTCTTCTA CAAGGCCAGG CGCGAGGCTG CCGGGGCGGA CCTCCTGGTG
GTGAACCACG CGCTTCTTTT GGCCGACCTT TCCGTGCGTC AGGAAACCGG CTACGACGCG
ACAGCCATCC TACCGCCGTT CACCAGGCTC ATCTTCGACG AGGGGCACCA CCTGGAGGAC
GTCGCCACCA ACTTCCTCTC CAGCCAGGTC TCGCGCCTGG GGCTCGTCAA GCTCCTGGGC
AAACTGCAGC ACCCCAAGAA GGCGCACCGC GGCATCCTGC CGCAGCTCTC TTCCCTCCTC
TCCTCCGCGG TCCCGGACGA TCAGGACGAC ATCTACCTGG AGATTGCCGA GGTCCTTGAG
GACCGGCTGA TCCCGAGGAG GGTGTCCGTA CTCGACGCGG TGACCCGGGG GATGGACGCG
ATCGGCGAGT CCCTTTTTCA GAAGCTCAAG AAGGAGGCGG GCGAACAAAA GCTGCGGGTA
ACCCCCGCCT TGTACGGGAC GCAGCTTTGG CAGGAGGTGA CCGAGCAGGT GGAGATCATG
TGCCAGGCGC TCTCCGAGTA CGCGCTCGCC ATGCAGACCT TCCTGAAGCG GTGCGAGAAG
CTCTCCGACA AGGTGCTGGA GAAGCTCGCC GGGCCGCTCA CCGACCTTCG GGGGGTGAAG
GGGCGTGTGG AGTGCGCGGT CGATGCCCTT CGCTTCTTCA CCGCGCGCGA GGAGGAGCAC
TGCCGGTGGT TCGAGCTGAG AAAAGGGCCG CTTGTGAAGC TCTGCTCTTC GCCGCTGGAA
GTTGCCGAAT CGATCAAGAA GGCGATCCTG GACCGATTCA AGACCGTGGT GCTCACCTCG
GCCACGCTCG CCGTGGGGGA AAAGTTCGAC TTTTTAAAGC GCAGGACCGG GATCGAACTC
CTCCCCAAGG AGCGCGTCAG CACGCTCCTT CTCCCCTCCC CCTTCGACTA CGCCCGGCAG
GCGCTCGTCG GCGCCCCCTC CGACATGCCC GAGCCGACCT CGCCGCTTTT CGAAGGAAGG
CTTTGCGAGC ACCTTTTGAA GGCGCTCAAG ATCTCCCAGG GGCGCGCCTT CGTCCTCTTC
ACCTCCTATG ATCTGTTGAT CCGGGTCTTC AACCGCCTGG CGAAGCCGCT CAAGGCGGCC
GGGCTCACCC CGATGCGCCA GGGGGAGACC AACCGCCACA TGCTTCTTTC CAACTACAGA
AGCGCGGTCA ACCCGGTCCT CTTCGGCACC GATTCCTTCT GGGAGGGGGT GGACGTTCAG
GGGCGGGGGC TGGAGCTGGT GGTGATCACC CGGCTTCCCT TCCGGGTTCC GACCGAGCCG
ATCCTGGAAG CGAGAAGCGA GCACATCGCG GCGCTCGGGG GAGATCCCTT CATGAGCTAC
ACGGTCCCCC AGGCGGTGAT CAAGTTCAAG CAGGGGTTCG GGAGGCTGAT CAGGAGCAAG
GAGGACCGGG GCGCGGTGCT GATCCTCGAC TCGCGGGTCT TGACCAAGAA CTACGGCAAG
GTTTTCCTGA CCGCCCTGCA CGGGGTGGAG GTGGTGCGCG GGGAAGAGGC GCTACTCTGC
GAAAAGCTGG AAGCGTTCTT CGGAAAAACC TAA
 
Protein sequence
MQKSFSDQAI QQLRSAIDEA NGNEVFFLGR TDEARIVVEV EPLARGNRDA VPAIMIACSF 
GDVVIHNHPS GNLTPSQPDI EIASLLGNQG VGFYIVDNQA DRCNQVVAPF SRKVVESLSY
SEIERFYAPD GVLSQALPGY EHRPEQTRMA LNLSEAFNDE KVAVVEAGTG TGKSLAYLLP
AALWSVRNKE RVVVSTNTIN LQEQLIQKDI PFLQQYAQIK FRAVLVKGRG NYLCLRKLHV
NAADASLFKD ETAQELDAIA SWSKKTDDGC RSDLAFIPKD EVWEELCCES DQCGRVRCPD
YARCFFYKAR REAAGADLLV VNHALLLADL SVRQETGYDA TAILPPFTRL IFDEGHHLED
VATNFLSSQV SRLGLVKLLG KLQHPKKAHR GILPQLSSLL SSAVPDDQDD IYLEIAEVLE
DRLIPRRVSV LDAVTRGMDA IGESLFQKLK KEAGEQKLRV TPALYGTQLW QEVTEQVEIM
CQALSEYALA MQTFLKRCEK LSDKVLEKLA GPLTDLRGVK GRVECAVDAL RFFTAREEEH
CRWFELRKGP LVKLCSSPLE VAESIKKAIL DRFKTVVLTS ATLAVGEKFD FLKRRTGIEL
LPKERVSTLL LPSPFDYARQ ALVGAPSDMP EPTSPLFEGR LCEHLLKALK ISQGRAFVLF
TSYDLLIRVF NRLAKPLKAA GLTPMRQGET NRHMLLSNYR SAVNPVLFGT DSFWEGVDVQ
GRGLELVVIT RLPFRVPTEP ILEARSEHIA ALGGDPFMSY TVPQAVIKFK QGFGRLIRSK
EDRGAVLILD SRVLTKNYGK VFLTALHGVE VVRGEEALLC EKLEAFFGKT