Gene GM21_3707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3707 
Symbol 
ID8139081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4272421 
End bp4273566 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content62% 
IMG OID644871327 
ProductDEAD/DEAH box helicase domain protein 
Protein accessionYP_003023485 
Protein GI253702296 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones84 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATTTG CATCCCTTGG ACTGCACCCG GAGCTGTTGA GCGCGATCGC GGAACAGCCG 
GAATTCAAGC GTCCCTACCC CATCCAGAGC GAGGTCATTC CCGCCGTTTT AAAGGGAAGG
GATCTGATCG CCATCGCCAA GACCGGTTCC GGCAAGACGG CGAGCTTCAT CCTGCCGCTC
CTGCAACTCA TCCACGGCCA GGATGCGGGG GAGCGGCGCA GGCTCAGGGC CTTGGTCCTG
GTGCCGACGC GCGAGCTCGC CGCCCAGATC GAGGAGGTCG CTAAACAGCT CGGGAGCCAT
CTGGAGCCTC GTGTCAAGAC CGGCGCGGTC TTCGGCGGGG TCGCCATCAA CCCGCAGATG
ATCCAACTGA AAGGGATCGA GCTGCTCATC GCCACTCCCG GCCGCCTGCT TGAACTCGTG
GCCAAAAACA GCGTGAAGCT TTCCTCGGTC GCCACCTTAG TCCTTGACGA AGCCGACCGG
CTCTACGCGG AAGACTTTCA GGACGAAATG CAGCAGATCC TCGCCCTGCT CCCGGCAAAA
CGGCAGAATT TGCTCTTTTC AGCGACCATT CCCCCGGAAG TGGAGCGGTT GGCGGCGAGC
CTGCTGAGCG ATCCGATGCG GATCGAAATC GAGGCCAAGG CGTCCGAAAC GGAGCTGATT
TCCCAGCAGA TTTACCTGGT GGACTCAAGC CGCAAGGGCC CCCTGCTCAG GTATCTGATC
AAAAGCGGCG ATTGGAAACA GGTGTTGGTC TTCACTTCGT CGCAGAAGCG GGCCGACAAC
GTCACCAGGA AGCTCGTCGC CAACGGGATC AGCGCCTCCA CATTTCACAG CGGCATGAGC
CAAGGTGGAA GGACCGCCGC CCTGGCCAAG TTCAAGACGG GTGAACTGCG GGTGCTGGTG
GCAACCGACC TCGCCTCCCG CGGGATAGAC GTGCAGTCTC TTCCACATGT GGTCAATTAC
GAACTTCCCC GGTCGCCCAT CGACTACCAG CACCGCATAG GGAGAACCGG CAGGGCCGAA
ACCGCCGGGG TGGCCGTGAC GCTGCTCTGC CCGGAAGATC TGGCGCACTT CAAGGTGATC
GAGAAACGGC TCGGGCAACG ACTGGCACGT ATCGACACTG CCGAGCTCGA TCTCTCCGCT
TACTAG
 
Protein sequence
MSFASLGLHP ELLSAIAEQP EFKRPYPIQS EVIPAVLKGR DLIAIAKTGS GKTASFILPL 
LQLIHGQDAG ERRRLRALVL VPTRELAAQI EEVAKQLGSH LEPRVKTGAV FGGVAINPQM
IQLKGIELLI ATPGRLLELV AKNSVKLSSV ATLVLDEADR LYAEDFQDEM QQILALLPAK
RQNLLFSATI PPEVERLAAS LLSDPMRIEI EAKASETELI SQQIYLVDSS RKGPLLRYLI
KSGDWKQVLV FTSSQKRADN VTRKLVANGI SASTFHSGMS QGGRTAALAK FKTGELRVLV
ATDLASRGID VQSLPHVVNY ELPRSPIDYQ HRIGRTGRAE TAGVAVTLLC PEDLAHFKVI
EKRLGQRLAR IDTAELDLSA Y