Gene GM21_0821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0821 
Symbol 
ID8136137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp974028 
End bp977156 
Gene Length3129 bp 
Protein Length1042 aa 
Translation table11 
GC content53% 
IMG OID644868435 
Producttype III restriction protein res subunit 
Protein accessionYP_003020649 
Protein GI253699460 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value0.0473136 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAACAAGC TCCCCCATGG CATATATGAA GCCCTTATAG ATGAGTCACT TCGGGACGCG 
CTAAATCAGC GGCCTGAACT GCGGGGAGTG TTTGGAAAAA TCGACCCAGA AGAACAACCA
TCTCGTTATG CCGCCTTCGT GGGAAAGGTT TTGGAACAGG CTCTGCGGGA AGAGTCCGAC
CCGGAAAAGA GGCTTAAACT CTGCAATCAG CTTCTGGGGG TGGTCACGAA AGAGCCTGGC
CATAGCCATC TGGACGGGCA CCGTCTTGTT TCTGAACCAA AGCAGGTTCT GCTGGAAATA
ACCCCGCCGC ACTATGCTAC TCAAGGCATA CCCCGCCCTC ATACACCCCT AACAGAAAGT
AGCCTTTTTA CAGGTTCGCC CCAGGAACCG CAACTCGCCC ATGAACTGCT GGAGGAGATG
CGCTCAGCTG ATGCAGTCGA AATCCTGGTG TCCTTCATCA AATGGTCTGG CTTGCGCTTG
CTTATGCCAG CTTTTGAAGA TTTGCGCGAC CGTAACGTGC CGGTTCGTCT GATCACCACT
TCATATATGG GGGCTTCTGA TGCTCCTGCT GTTGAGTGGC TGGCAAGGAT GCCGAATGTT
GAGGTTCGCA TCTCCTATGA TACCGATCGG ACGAGGCTTC ATGCCAAAGC CTACCACTTC
CGACGCAACA GCGGATTCTC CACGGCGTAC ATTGGCTCAG CCAACATGTC CCATGCAGCC
ATGACCAGCG GATTGGAGTG GAACCTCAAG GTGACCGCCC AAGACATGGG GCACATTATC
GAGAAATTTT CGGTCGAGTT CGAAACCTAT TGGAATAGCC GCGAGTTTGT TCCCTTTGAC
CCAGAGCAGC CGACCCTTTT CCGCACGGCC ATTAACAGAG CCCGCCACCG CAGTCAAGAC
AACCCTGCTA TCTTCTTTGA TCTCCGGCCG CACCCTTTTC AGGAGCGCAT CCTTGAAGCG
CTGGCAAGGG AGCGAACTTC GCATGACCGT TGGCGCAACC TGGTCATAGC TGCAACCGGT
ACGGGCAAGA CGGTAATTGC TGCCTTTGAC TTCAAGAATT TCTTCGAGGC AAAGAGGCGC
CAGGCCCGTC TCCTGTTCAT TGCTCATCGC CTGGAGATTC TCCAGAAGGC GCAGGGGACC
TTTCGCAACG TCCTTCGAGA CCAGAACTTT GGGGAGTTGC TGGTCGGCGA GTATCAAGCC
GTTCGCCTGG AGCATCTGTT CTGTTCTGTT GGGATGCTGG CGAATAGGCG CCTGTGGGAA
CAGGTGGGCA GCGGTTTCTA CGACTATATC GTCATCGACG AGGCTCATCA CAGTACCGCG
TCCAGCTACC GTCCTATCTT CGAAAACTTT GCACCTGAGA TCCTCCTCGG CCTGACCGCC
ACACCAGAGC GGATGGATGG CGGTAACGTG GCTGCCGATT TTGGCAACCG TTTTGCCGCG
GAGATACGCC TTCCTGAGGC GTTGGAAGAA AAGCTCCTCT GCCCTTTTCA TTATTTCGGC
GTTGCTGATC CGATTGCAAT AAGTGGAGAA CAGTTCTGGC GCAATGGCAA ATACAATGAG
TCTGCCCTTG AAAACGTATA TGTCATGGAC AATGTACGGG CAAAGCAGCG CGTTGATGCC
ATCATAACTG CACTTACTCG CTATGAACCC GATCTCAGCA ACGTCAAAGG GGTCGGCTTT
TGCGTCACAA TCAGGCATGC ACATTTTATG GCCGAACAGT TCTCGAAGAG AGGGATACCA
TCTGGTGCCT TTGTCTCTGG CGTCGAAGAT GATCGGTGTA GCAAACTATT GGAAGACCTG
AGTACAGGGA GGCTTACCTT CCTCTTCACC GTGGACAAAC TCAGCGAAGG TGTGGATATT
CCCCTGCTGA ACACGGTTCT CTTTCTACGT CCCACTGAGA GTCTGACGGT CTTTTTGCAA
CAGCTAGGTC GAGGACTGCG CCACGCACCA GGCAAAGATT GCCTCACAGT TATTGACCTT
GTCGGACAGG CGCACCGTCG CTACCGTGCC GACATCAAGC TAAAAGCGCT CATGCCTCGG
CACCGTTACT CCATAGATAG GGAAGTTGAG GCCGATTTCC CTCATCTACC GTCAGGCTGT
TCGATTCAGC TCGACCGGCT TTCTCGCAAA TACATTCTAG ACAATATCCG TGAAAACTTC
GGCAGACTGG CCGTTCAGGT TACGGACCGG CTACAGACCT TCACCATGGA GACCGGACAG
GAACTCACCT TCGGCAACTT CGTTCGTTTC CACGACTATG AGCCCGAAGT ACTGCTGGTA
AAGGAGTCTT GGTCTGAGTG GAAATCGCGG GCTCAGTTGG CACCGATTCC AGATGATCCT
GATCTCGCAC GGCTAAGAAA GAGCCTTGTG AAAGTTGCCT TCATCAATGG ACCGCGGGAA
GTGGGCCTGT TGCGTGCAGT ACTTGGCAAG GTTTCCCAAG GTGCTGTCGA TGAAGCACTG
GCCCTTGCTG GAGATTCAAT CCCGTCCATC TATTACCGGA TATGGGGGGG CAAGGGAAGC
AAGCTTGGTA TTAGCAGCCT TCGTGAGGCA TTTACACGGC TAGCTGGAAA TCCTTCCATA
CTTAGCGATA TGGATGAAAT ATTGAGTTGG TCACTTGAAA TAACGGAGAT CGGCGGAGAG
ATACCTGTTT TGCCCTTTGC TTGTCCACTC GAACTGCACG CACAGTACGG CGGCATGGAG
ATTCAAGCGG CGTTTGGCAA GGCGACACTT GAGACATCGG GACAGACCGG GGTCGGAGTT
TTCCATTTCT CCGAGCAGAA AGCCTATGCA TTGCTGGTCA CCTTTCAGAA GACTGAAAAG
GAGTTCTCCC CGAGCACCAT GTATGCTGAC TATCCAATAA GCCGACAGCT GATGCATTGG
GAGTCACAGG CAAATACCGC GCAACATCAC GCTGATGGTC AGAATTTAAT TCACCATGCA
GAGCGAGGCT ACACAATTCT AATCTTTGCC AGAGGCCAGA AAAAGCGGAA CGGAGTCACG
GTGCCGTTCA CATATCTTGG CCCGGCAGAG CGGGTTAGCT ACGAGAGTGA GAGACCGATC
AAAATGGTCT GGAAGCTAAG GCACCAGATG CCAGTGGAGA TGTTTGAAGA TAATCGGCGA
GGCGGGTGA
 
Protein sequence
MNKLPHGIYE ALIDESLRDA LNQRPELRGV FGKIDPEEQP SRYAAFVGKV LEQALREESD 
PEKRLKLCNQ LLGVVTKEPG HSHLDGHRLV SEPKQVLLEI TPPHYATQGI PRPHTPLTES
SLFTGSPQEP QLAHELLEEM RSADAVEILV SFIKWSGLRL LMPAFEDLRD RNVPVRLITT
SYMGASDAPA VEWLARMPNV EVRISYDTDR TRLHAKAYHF RRNSGFSTAY IGSANMSHAA
MTSGLEWNLK VTAQDMGHII EKFSVEFETY WNSREFVPFD PEQPTLFRTA INRARHRSQD
NPAIFFDLRP HPFQERILEA LARERTSHDR WRNLVIAATG TGKTVIAAFD FKNFFEAKRR
QARLLFIAHR LEILQKAQGT FRNVLRDQNF GELLVGEYQA VRLEHLFCSV GMLANRRLWE
QVGSGFYDYI VIDEAHHSTA SSYRPIFENF APEILLGLTA TPERMDGGNV AADFGNRFAA
EIRLPEALEE KLLCPFHYFG VADPIAISGE QFWRNGKYNE SALENVYVMD NVRAKQRVDA
IITALTRYEP DLSNVKGVGF CVTIRHAHFM AEQFSKRGIP SGAFVSGVED DRCSKLLEDL
STGRLTFLFT VDKLSEGVDI PLLNTVLFLR PTESLTVFLQ QLGRGLRHAP GKDCLTVIDL
VGQAHRRYRA DIKLKALMPR HRYSIDREVE ADFPHLPSGC SIQLDRLSRK YILDNIRENF
GRLAVQVTDR LQTFTMETGQ ELTFGNFVRF HDYEPEVLLV KESWSEWKSR AQLAPIPDDP
DLARLRKSLV KVAFINGPRE VGLLRAVLGK VSQGAVDEAL ALAGDSIPSI YYRIWGGKGS
KLGISSLREA FTRLAGNPSI LSDMDEILSW SLEITEIGGE IPVLPFACPL ELHAQYGGME
IQAAFGKATL ETSGQTGVGV FHFSEQKAYA LLVTFQKTEK EFSPSTMYAD YPISRQLMHW
ESQANTAQHH ADGQNLIHHA ERGYTILIFA RGQKKRNGVT VPFTYLGPAE RVSYESERPI
KMVWKLRHQM PVEMFEDNRR GG