Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0429 |
Symbol | |
ID | 8135738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 506444 |
End bp | 507658 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 644868047 |
Product | restriction modification system DNA specificity domain protein |
Protein accession | YP_003020267 |
Protein GI | 253699078 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 129 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGCGA AGGTGAAGCC GGGTTATAAG CAGACCGAGG TGGGTGTGAT TCCGGAGGAG TGGGATTGTT GCATGCTTCG GGATGGTATT GTCCTTCTGT CTGGACATCA CATCTTGGCT CATTACTGCA ATATGAGTGG CTGCGGTGTT CCATACCTTA CGGGGCCGGC AGACTTTCGT AATGGCGCTA TTGCTAACAC CAAGTTCACG AATAAGCCTG CCACATTATG CAGTGATGGT GACATTCTTG TTACGGTAAA AGGTTCTGGT TCTGGAACTA TTGTAGTGGC TGACAAAATG TATTGCATCA GCCGACAACT GATGGCAATT AGGCCGTTGG AATGGAATTC TATATTCCTC TATTACTCAC TCCTTCAGAA CGCATTACAC TTTAAAGCCG CCTCTGCTGG ACTGATTCCG GGATTGTCCC GCTCAGACAT TCTAGAACAG TTGGTTCCGT TACCACCGCT CCCCGCACAA AACACGATTG CGGATGCATT GAGCGATGTA GATGTGTTGC TGGGGGCGCT GGACCGGCTT ATTGCCAAGA AGCGTGACCT CAAACAGGCC GCCATGCAGC AACTCCTTAC GGGTGAAACC AGACTCCCTG GGTTTCATGG CGAGTGGGCG GTGAAGCGGT TGGGAGATCT CGGGACCTTC TTGAAAGGTA ATGGCATCAG GAAAGACGAA GCGATGAGCG GCGCGCTGCC CTGTGTTCGT TATGGCGAGA TTTACACGCA CCACAACAAT TACGTGAAGT CATTCAACTC TTGGATCTCT CCAGAGGTGG CTGTCTCTGC AACACGTCTA AAAAAAGGCG ATTTGCTATT TGCTGGCTCT GGTGAAACCA AGGAGGAAAT CGGAAAATGT GTAGCTTGTA TAGATGATTG CGACGCATAT GCGGGTGGAG ACATAGTAAT TCTTCGCTTA GCCGCGGCCC ATCCTTTGTT CATGGGCTAT TACTGCAACA TCGCGACTGT AAATGCTCAG AAGGCCAGTA GAGCACAGGG GGATGCCGTG GTGCACATTG GTGCGGTTGC CCTGTCCAGT GTGTTGGTTT CAGTTCCGTC AGTAAGTGAG CAAGTTGCCA TCGCAGAGGT GCTATTCGAC ATGGACGCAG AACTCGCGGG TTTGGAGCAG CGTCGCGACA AGACTCGTTC CCTAAAGCAG TCCATTATGC AGGAATTACT CACCGGAAAA ACGCGCCTTA TCTGA
|
Protein sequence | MSAKVKPGYK QTEVGVIPEE WDCCMLRDGI VLLSGHHILA HYCNMSGCGV PYLTGPADFR NGAIANTKFT NKPATLCSDG DILVTVKGSG SGTIVVADKM YCISRQLMAI RPLEWNSIFL YYSLLQNALH FKAASAGLIP GLSRSDILEQ LVPLPPLPAQ NTIADALSDV DVLLGALDRL IAKKRDLKQA AMQQLLTGET RLPGFHGEWA VKRLGDLGTF LKGNGIRKDE AMSGALPCVR YGEIYTHHNN YVKSFNSWIS PEVAVSATRL KKGDLLFAGS GETKEEIGKC VACIDDCDAY AGGDIVILRL AAAHPLFMGY YCNIATVNAQ KASRAQGDAV VHIGAVALSS VLVSVPSVSE QVAIAEVLFD MDAELAGLEQ RRDKTRSLKQ SIMQELLTGK TRLI
|
| |