Gene GM21_4133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4133 
Symbol 
ID8139507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4724809 
End bp4727205 
Gene Length2397 bp 
Protein Length798 aa 
Translation table11 
GC content65% 
IMG OID644871748 
ProductDEAD_2 domain protein 
Protein accessionYP_003023906 
Protein GI253702717 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1199] Rad3-related DNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCTG CACCTAATCT CATCAACATC CCGGTAGGGT ATTTCGCCCT TCCCGTTCCC 
CGCACCGGGA GCATCGAGCC CCGTTCCGGC TACGACCGCT CCACCGCGGA GGGGCGCGAA
ATTCACCTTA GGATCCAGAA AAAGCGCGCG CAGGCGGACA GCAGCTACCA GGCCGAAGTG
CCGGTCAGCC GCGTCTTCGA AAGGGGAGGG TACCAGTTCC AGGTGAGCGG CCGCCTGGAC
GGGGTCTTCC GCCACGCCCC CCCCTTGATC GAGGAGATCA AGAGCTGCTT CAACCTGTGG
GAGCTCAGGC GCCGGTTGGC TCACGCCGGC ACGGACGAGC CTTATTCGCT GCAGCTTCTC
ACCTACGGCT ACTTCCATTG GCTGGAGCAC GGCGTGATAC CGCGCCTCTC CTTCCACCTC
GTGTCCTCCC GCAACGGCGC CTCCGAGGAT CTCCCGGTGC GACTGGACCT GGAGGCGTAC
CGGGTCTGGC TGGAGCTGCG CCTGGACGAG CTGGCGCGGG AGGCGGACGA GGCCTCGAAG
CGCGCCCGAA GAAGGAGGAA GGCGGCAAAG GAGTTCCCTT TTCCCTTCGC CGCCCCCCGC
CCCGGGCAGC TTGAGCTGAT GGGGGAGATA GAGTCCGGCA TGGTGGAGGG GCGTGCCATG
CTGATCCAGG CGCCGACCGG GCTGGGAAAG ACCGCGGGGG TCCTGCACCC GGTGCTGAAG
GAGGCGCTCG GGCGCGGCCA GACGGTCTGC TACGTGACGC CGAAAAACAG TCAGCACGAG
GTGGCGGAGG ACGCCATGAA GCGTTTTCGG CAGTCGGGGG TGCGGCTGCG TTCGCTCACC
GTGACCGCGA AGGGCAGGAT CTGTTTCATG AACGAACCTG TCTGCACGCC GGACTACTGC
GAGTACGCGC GCGATTACTA CGGAAAGCTC GCCCGGCACG GCGTAGTCGA GCTGATGGCC
GGGAAAAGGA AACTGAAGGA GCGGACTTTC CGCGATTTCG GGGAGAAGTA CCAGCTTTGC
CCCTTCGAGT TGCAGATCGA AGCGGTGCCG CTGGCCGACG TCGTCATCTG CGACTACAAC
TACGTCTTCG CCCCCAGGTC CGCGCTGGGG CGGGCCGCGG CACTCGCCGT GGAGCAGAGC
GGGAAGCCGA ACCTGGTGAT AGACGAGGCG CATAACCTCC CCTCCCGCGC CATGGATTAC
TACTCCCCCA GGCTTTCCAG CGCGCTTTTG GATGGGATGC GGGGCGATCT GGCGGAAATT
CCCAACCCAT TTCGGCGCGA CGCACTGGAG CTTTTGGATG AATGCCTGGC GGCGGTGGCG
TCCTGCGGCG GCAGCGGAAA AGGGATGCGC CCGGTAAAGA TAGAGCCCCC CCTGGCCCCT
TTCCTGGAGC TGGACGGGAG GCTGCGCGCC CTGCTGTCGC GCTACCTGGA GGCGGATGTG
GAGATCAGGC AAAAGGACCC GGTCTTGAAG CTCTCCCATT ACTGGGGGGA GTTCGCCGAG
ATCCTGGAGT TCGCCACAGC CTCGAAGCGG CGGGAGTTTT TCACCTCCTA TGAGCCTCAT
TCCGGCGGAG GGAGCGTCAA GATAACCTGC TGCGACGCCT CCGCCCTCAT TGCGGACCGG
TACGGGGAGT ACCAGCAGGT GGTGGCGTTC TCGGCGACCC TGAAGCCGTT CGATTACTAC
GCGAAGCTTT CCGGGCTCGA CCCGGAACAG GTGCGCTGCG CCGAGTTCCA GAGCCCCTTT
CCCAGCAGCC TGCGCAAGCT GATGATCATC CCGCAGATCT CGACCAGATA CTCCCAGCGC
GAACGCAATT ACGGCCGGAT CGCCGATGCC CTGGCCCGGG TAGTGGCGCT GAAAAGCGGA
AACTACCTGG CATTTTTCCC GAGTTTCGTT TTCCTTGAGC GGGTGGCGGA GCTCTTTCGG
GCGCCGGAAG GGTTCGAGGT GCTGCTCCAG GAGAGGAAGA TGAGCGGGGC GAGGGTGACC
GAGATCCTGG AGCGGCTGCG CGCCGGGAAC GCCCCGACCC TTGTCTTCGC CGTGCAGGGA
GGCTCCCTTT CCGAAGGGGT GGATTACGCG GGGGACATGG TGATAGGCGC CTTCGTGGTA
GGTCCCCCCC TTCCCAACTT CGACCTGGAG CGGGAGGAGA TGAGGGGGTA CTACCAGCGC
CATTACGGCA ACGGCTACCA GTACGCCTAC ACCATTCCGG CCATGGCCAA GGCGATCCAG
GCGGCGGGGC GGGTGATCCG CTCGGAAACG GACCGCGGCC TGATCGTCTT GATGGACGAC
CGTTTCCTGC GAGAGGAGTA CAGCTCAGCC ATGCCTGGGG ATTGGTTCGA GTCCGGGGCC
AGCGAGCTGG TCTCGGAGGC CATATTGAGC GATATCAGGG AGTTTTGGGG ACGGTGA
 
Protein sequence
MKPAPNLINI PVGYFALPVP RTGSIEPRSG YDRSTAEGRE IHLRIQKKRA QADSSYQAEV 
PVSRVFERGG YQFQVSGRLD GVFRHAPPLI EEIKSCFNLW ELRRRLAHAG TDEPYSLQLL
TYGYFHWLEH GVIPRLSFHL VSSRNGASED LPVRLDLEAY RVWLELRLDE LAREADEASK
RARRRRKAAK EFPFPFAAPR PGQLELMGEI ESGMVEGRAM LIQAPTGLGK TAGVLHPVLK
EALGRGQTVC YVTPKNSQHE VAEDAMKRFR QSGVRLRSLT VTAKGRICFM NEPVCTPDYC
EYARDYYGKL ARHGVVELMA GKRKLKERTF RDFGEKYQLC PFELQIEAVP LADVVICDYN
YVFAPRSALG RAAALAVEQS GKPNLVIDEA HNLPSRAMDY YSPRLSSALL DGMRGDLAEI
PNPFRRDALE LLDECLAAVA SCGGSGKGMR PVKIEPPLAP FLELDGRLRA LLSRYLEADV
EIRQKDPVLK LSHYWGEFAE ILEFATASKR REFFTSYEPH SGGGSVKITC CDASALIADR
YGEYQQVVAF SATLKPFDYY AKLSGLDPEQ VRCAEFQSPF PSSLRKLMII PQISTRYSQR
ERNYGRIADA LARVVALKSG NYLAFFPSFV FLERVAELFR APEGFEVLLQ ERKMSGARVT
EILERLRAGN APTLVFAVQG GSLSEGVDYA GDMVIGAFVV GPPLPNFDLE REEMRGYYQR
HYGNGYQYAY TIPAMAKAIQ AAGRVIRSET DRGLIVLMDD RFLREEYSSA MPGDWFESGA
SELVSEAILS DIREFWGR