Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0821 |
Symbol | |
ID | 8136137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 974028 |
End bp | 977156 |
Gene Length | 3129 bp |
Protein Length | 1042 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644868435 |
Product | type III restriction protein res subunit |
Protein accession | YP_003020649 |
Protein GI | 253699460 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 0.0473136 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAACAAGC TCCCCCATGG CATATATGAA GCCCTTATAG ATGAGTCACT TCGGGACGCG CTAAATCAGC GGCCTGAACT GCGGGGAGTG TTTGGAAAAA TCGACCCAGA AGAACAACCA TCTCGTTATG CCGCCTTCGT GGGAAAGGTT TTGGAACAGG CTCTGCGGGA AGAGTCCGAC CCGGAAAAGA GGCTTAAACT CTGCAATCAG CTTCTGGGGG TGGTCACGAA AGAGCCTGGC CATAGCCATC TGGACGGGCA CCGTCTTGTT TCTGAACCAA AGCAGGTTCT GCTGGAAATA ACCCCGCCGC ACTATGCTAC TCAAGGCATA CCCCGCCCTC ATACACCCCT AACAGAAAGT AGCCTTTTTA CAGGTTCGCC CCAGGAACCG CAACTCGCCC ATGAACTGCT GGAGGAGATG CGCTCAGCTG ATGCAGTCGA AATCCTGGTG TCCTTCATCA AATGGTCTGG CTTGCGCTTG CTTATGCCAG CTTTTGAAGA TTTGCGCGAC CGTAACGTGC CGGTTCGTCT GATCACCACT TCATATATGG GGGCTTCTGA TGCTCCTGCT GTTGAGTGGC TGGCAAGGAT GCCGAATGTT GAGGTTCGCA TCTCCTATGA TACCGATCGG ACGAGGCTTC ATGCCAAAGC CTACCACTTC CGACGCAACA GCGGATTCTC CACGGCGTAC ATTGGCTCAG CCAACATGTC CCATGCAGCC ATGACCAGCG GATTGGAGTG GAACCTCAAG GTGACCGCCC AAGACATGGG GCACATTATC GAGAAATTTT CGGTCGAGTT CGAAACCTAT TGGAATAGCC GCGAGTTTGT TCCCTTTGAC CCAGAGCAGC CGACCCTTTT CCGCACGGCC ATTAACAGAG CCCGCCACCG CAGTCAAGAC AACCCTGCTA TCTTCTTTGA TCTCCGGCCG CACCCTTTTC AGGAGCGCAT CCTTGAAGCG CTGGCAAGGG AGCGAACTTC GCATGACCGT TGGCGCAACC TGGTCATAGC TGCAACCGGT ACGGGCAAGA CGGTAATTGC TGCCTTTGAC TTCAAGAATT TCTTCGAGGC AAAGAGGCGC CAGGCCCGTC TCCTGTTCAT TGCTCATCGC CTGGAGATTC TCCAGAAGGC GCAGGGGACC TTTCGCAACG TCCTTCGAGA CCAGAACTTT GGGGAGTTGC TGGTCGGCGA GTATCAAGCC GTTCGCCTGG AGCATCTGTT CTGTTCTGTT GGGATGCTGG CGAATAGGCG CCTGTGGGAA CAGGTGGGCA GCGGTTTCTA CGACTATATC GTCATCGACG AGGCTCATCA CAGTACCGCG TCCAGCTACC GTCCTATCTT CGAAAACTTT GCACCTGAGA TCCTCCTCGG CCTGACCGCC ACACCAGAGC GGATGGATGG CGGTAACGTG GCTGCCGATT TTGGCAACCG TTTTGCCGCG GAGATACGCC TTCCTGAGGC GTTGGAAGAA AAGCTCCTCT GCCCTTTTCA TTATTTCGGC GTTGCTGATC CGATTGCAAT AAGTGGAGAA CAGTTCTGGC GCAATGGCAA ATACAATGAG TCTGCCCTTG AAAACGTATA TGTCATGGAC AATGTACGGG CAAAGCAGCG CGTTGATGCC ATCATAACTG CACTTACTCG CTATGAACCC GATCTCAGCA ACGTCAAAGG GGTCGGCTTT TGCGTCACAA TCAGGCATGC ACATTTTATG GCCGAACAGT TCTCGAAGAG AGGGATACCA TCTGGTGCCT TTGTCTCTGG CGTCGAAGAT GATCGGTGTA GCAAACTATT GGAAGACCTG AGTACAGGGA GGCTTACCTT CCTCTTCACC GTGGACAAAC TCAGCGAAGG TGTGGATATT CCCCTGCTGA ACACGGTTCT CTTTCTACGT CCCACTGAGA GTCTGACGGT CTTTTTGCAA CAGCTAGGTC GAGGACTGCG CCACGCACCA GGCAAAGATT GCCTCACAGT TATTGACCTT GTCGGACAGG CGCACCGTCG CTACCGTGCC GACATCAAGC TAAAAGCGCT CATGCCTCGG CACCGTTACT CCATAGATAG GGAAGTTGAG GCCGATTTCC CTCATCTACC GTCAGGCTGT TCGATTCAGC TCGACCGGCT TTCTCGCAAA TACATTCTAG ACAATATCCG TGAAAACTTC GGCAGACTGG CCGTTCAGGT TACGGACCGG CTACAGACCT TCACCATGGA GACCGGACAG GAACTCACCT TCGGCAACTT CGTTCGTTTC CACGACTATG AGCCCGAAGT ACTGCTGGTA AAGGAGTCTT GGTCTGAGTG GAAATCGCGG GCTCAGTTGG CACCGATTCC AGATGATCCT GATCTCGCAC GGCTAAGAAA GAGCCTTGTG AAAGTTGCCT TCATCAATGG ACCGCGGGAA GTGGGCCTGT TGCGTGCAGT ACTTGGCAAG GTTTCCCAAG GTGCTGTCGA TGAAGCACTG GCCCTTGCTG GAGATTCAAT CCCGTCCATC TATTACCGGA TATGGGGGGG CAAGGGAAGC AAGCTTGGTA TTAGCAGCCT TCGTGAGGCA TTTACACGGC TAGCTGGAAA TCCTTCCATA CTTAGCGATA TGGATGAAAT ATTGAGTTGG TCACTTGAAA TAACGGAGAT CGGCGGAGAG ATACCTGTTT TGCCCTTTGC TTGTCCACTC GAACTGCACG CACAGTACGG CGGCATGGAG ATTCAAGCGG CGTTTGGCAA GGCGACACTT GAGACATCGG GACAGACCGG GGTCGGAGTT TTCCATTTCT CCGAGCAGAA AGCCTATGCA TTGCTGGTCA CCTTTCAGAA GACTGAAAAG GAGTTCTCCC CGAGCACCAT GTATGCTGAC TATCCAATAA GCCGACAGCT GATGCATTGG GAGTCACAGG CAAATACCGC GCAACATCAC GCTGATGGTC AGAATTTAAT TCACCATGCA GAGCGAGGCT ACACAATTCT AATCTTTGCC AGAGGCCAGA AAAAGCGGAA CGGAGTCACG GTGCCGTTCA CATATCTTGG CCCGGCAGAG CGGGTTAGCT ACGAGAGTGA GAGACCGATC AAAATGGTCT GGAAGCTAAG GCACCAGATG CCAGTGGAGA TGTTTGAAGA TAATCGGCGA GGCGGGTGA
|
Protein sequence | MNKLPHGIYE ALIDESLRDA LNQRPELRGV FGKIDPEEQP SRYAAFVGKV LEQALREESD PEKRLKLCNQ LLGVVTKEPG HSHLDGHRLV SEPKQVLLEI TPPHYATQGI PRPHTPLTES SLFTGSPQEP QLAHELLEEM RSADAVEILV SFIKWSGLRL LMPAFEDLRD RNVPVRLITT SYMGASDAPA VEWLARMPNV EVRISYDTDR TRLHAKAYHF RRNSGFSTAY IGSANMSHAA MTSGLEWNLK VTAQDMGHII EKFSVEFETY WNSREFVPFD PEQPTLFRTA INRARHRSQD NPAIFFDLRP HPFQERILEA LARERTSHDR WRNLVIAATG TGKTVIAAFD FKNFFEAKRR QARLLFIAHR LEILQKAQGT FRNVLRDQNF GELLVGEYQA VRLEHLFCSV GMLANRRLWE QVGSGFYDYI VIDEAHHSTA SSYRPIFENF APEILLGLTA TPERMDGGNV AADFGNRFAA EIRLPEALEE KLLCPFHYFG VADPIAISGE QFWRNGKYNE SALENVYVMD NVRAKQRVDA IITALTRYEP DLSNVKGVGF CVTIRHAHFM AEQFSKRGIP SGAFVSGVED DRCSKLLEDL STGRLTFLFT VDKLSEGVDI PLLNTVLFLR PTESLTVFLQ QLGRGLRHAP GKDCLTVIDL VGQAHRRYRA DIKLKALMPR HRYSIDREVE ADFPHLPSGC SIQLDRLSRK YILDNIRENF GRLAVQVTDR LQTFTMETGQ ELTFGNFVRF HDYEPEVLLV KESWSEWKSR AQLAPIPDDP DLARLRKSLV KVAFINGPRE VGLLRAVLGK VSQGAVDEAL ALAGDSIPSI YYRIWGGKGS KLGISSLREA FTRLAGNPSI LSDMDEILSW SLEITEIGGE IPVLPFACPL ELHAQYGGME IQAAFGKATL ETSGQTGVGV FHFSEQKAYA LLVTFQKTEK EFSPSTMYAD YPISRQLMHW ESQANTAQHH ADGQNLIHHA ERGYTILIFA RGQKKRNGVT VPFTYLGPAE RVSYESERPI KMVWKLRHQM PVEMFEDNRR GG
|
| |