Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1682 |
Symbol | |
ID | 8137013 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 1962396 |
End bp | 1964165 |
Gene Length | 1770 bp |
Protein Length | 589 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644869294 |
Product | Rhs element Vgr protein |
Protein accession | YP_003021494 |
Protein GI | 253700305 |
COG category | [R] General function prediction only |
COG ID | [COG3500] Phage protein D |
TIGRFAM ID | [TIGR01646] Rhs element Vgr protein |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 110 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCACTG AGTCGACCAT CCCCACGCCG GCGACGCCGG ACGTCTGCAC GATCGATCTC CTCGTCGAGG GAAGCGCGAT TCCCGGCGAG TACCACGTCC TTTCAGTTGC GGTGAGAAAG GAGATCAACC GCATCCCGAC GGCTACCCTG GTGCTGCGCG ACGGGGAGGC AGCGAAGGCC ACCTTCCAGG TCAGCAACAG CGATCATTTC CTACCAGGAA ACAAGGTGGA GATCAGGCTG GGATACCGGT CGAACAACGA AACGGTGTTC AAGGGGGTGG TGATAAAGCA GGGGATCAGC ATCCGCAAGA GCGGCAGCAT CCTGACGGTT GAGTGCCGCG ACGAGGCGGT GAAGATGACC TGCGGCGCCA AGAGCCGCTA CTACACAGGC ATGAAGGACA GCGACATCTT GGAGCAGATC ATCGCTTCGT ATCGGCTCGA CAAGGACGTG CAGGCGACCA AGCCCGACCT TAAGGAAGTG ACGCAGTACA ACGCGACCGA CTGGGATTTC CTCCTCTGCC GGGCGGAGGC CAACGGCCAG GTGGTGATCG TCAGCGACGG CAAGGTGAGC GTAACCCAGC CTGCCGCAAG CGAGGAACCG GTCCTTTCGG TGGGGTACGG CAGCACCCTT TTGGAGCTGG ACGCGGAGAT AGACGCGCGC AGGCAAAGCA CGGGGATCGT GGCCCGCAGT TGGAGCGGGA CGGACCAGGA CGTGCTGGAA GCCGAGGCGA AGGAGCCGGC GAAGACCGTG GCGGGAAACC TGGCCCCGGA CACGCTGGCA AAGGTTTTGG GGGGCGACCC CCACGAGATG AGGCACGAAG GCAAACTCAC CACCCCCGAA CTGCAGGCGT GGGCCGACGG GCGGCTCCTC AGGGAGCGCC TGGCCAAGGT GCGCGGCAGG GCGAAGTTCC AGGGGTTCGC CAAGGTGGCT CCGGGGAAGG TCATGGAGGT GAGCGGCATC GGCGAGCGGT TCCAGGGGAG GTTCTACGTG GCGGGCGTGC GCCACGTGGT GGACAAGGGG AACTGGGAGA CCGACGTGCA GTTCGGTTTG AGCACCGAGA CCTTCGCCGA GACCTTCGAC CTCCGCCCGC TCCCCGCATC GGGGCTTCTT CCGGCGGTGA GCGGACTGCA GATGGGAGTG GTGACGGTCC TGGAGAACGA CCCGCAGGGG GAGGACCGGA TCAAGGTCCG CCTGCCGCTG GTGAACAAGG CGGAGGAAGG GCTCTGGGCG CGGCTGGCGA CGCTCGACGC TGGCAACAAG AGAGGGACCT TTTTCCGCCC CGAGGTCGGC GACGAGGTGG TGGTCGGTTT CCTGGGGGAC GACCCCTGCC ACCCGGTGGT GCTGGGGATG TGCCACAGCA GCGCGAAGCC CGCCCCGGAA CCCGCCAAGG ACAAGAACCA CCGCAAAGGG TACGTCAGCC GGTCGAAGCT CAAGTTCACC TTCGACGACC AGAACAAGGT GGTGCTCCTG GAGACGCCGG GCGGCAACAG GCTGGCGCTT TCGGAGGCGG ACAAGGGGAT CGTCATCAAG GATCAAAACG GCAACAAGAT CATCCTCGAC AACACCGGGG TGCGCATAGA GAGCAGCAAG GACCTGACAC TTAAGGCGGC GAAAAACGTG AACATCGAGG CATCGGCCCG CCTGAATCTG AAGGCGCAGA CCTCCTTCAA GGCGGAGGGG GCTGCCAGCG CAGAGGTCTC GGGCGCAAGC ACCACGGTCA AGGGAAGCGC CAAGACGGTG ATTCAGGGGG GGATCGTGCA GATAAATTAG
|
Protein sequence | MSTESTIPTP ATPDVCTIDL LVEGSAIPGE YHVLSVAVRK EINRIPTATL VLRDGEAAKA TFQVSNSDHF LPGNKVEIRL GYRSNNETVF KGVVIKQGIS IRKSGSILTV ECRDEAVKMT CGAKSRYYTG MKDSDILEQI IASYRLDKDV QATKPDLKEV TQYNATDWDF LLCRAEANGQ VVIVSDGKVS VTQPAASEEP VLSVGYGSTL LELDAEIDAR RQSTGIVARS WSGTDQDVLE AEAKEPAKTV AGNLAPDTLA KVLGGDPHEM RHEGKLTTPE LQAWADGRLL RERLAKVRGR AKFQGFAKVA PGKVMEVSGI GERFQGRFYV AGVRHVVDKG NWETDVQFGL STETFAETFD LRPLPASGLL PAVSGLQMGV VTVLENDPQG EDRIKVRLPL VNKAEEGLWA RLATLDAGNK RGTFFRPEVG DEVVVGFLGD DPCHPVVLGM CHSSAKPAPE PAKDKNHRKG YVSRSKLKFT FDDQNKVVLL ETPGGNRLAL SEADKGIVIK DQNGNKIILD NTGVRIESSK DLTLKAAKNV NIEASARLNL KAQTSFKAEG AASAEVSGAS TTVKGSAKTV IQGGIVQIN
|
| |