Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1677 |
Symbol | |
ID | 8137008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 1953865 |
End bp | 1957260 |
Gene Length | 3396 bp |
Protein Length | 1131 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644869289 |
Product | hypothetical protein |
Protein accession | YP_003021489 |
Protein GI | 253700300 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 129 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAAC CTTCCCCTGT CATAGCGAAC CTGCCGCAGC TTCCCCCCGA GGAGGATTTC CACCGCCTGC GCCGCGAGGG GATCGGCTTC ATCGTGCAGA TGGGAAGCCG CCTCTGGACC GAGTACAACG AGACCGACTC CGGAATCGCC ATACTGGAGG CGCTCTGTTA CGCCGTAACC GACCTAGGCT ACCGCAGCGG CTGGGAGATC CGGGACCTCC TGGCGCCTCC CCTCCCCTCC CCCGACCCGG CGCACCCCTT CCCCAACCAG CCCTTCTTCA CCGCTCGGGA GATCCTCACG GTCAACCCGT GGACCCCCGA CGACTTCAGG CGGCTTTTGA TCGACCAGGA GGCGGTGCGA AACGCCTGGA TCGCCTGCAA GGAGTGCGCG TGCGATACCA GCTACTACGC CTGGTGCGAG GGAGAGAGGC TCGCCCTTTC CTACCAGTTG CCGGAAAACC GAAGGCTCAG GCCCAAGGAG GTTTGGCCGC TGGGGCTGTA CGAGGCGCTC TTGGAATTGG AGGCGGACGC GGACCTTGGA GACCTGAACG ACCGCAAGGT CGAAGCCGCC GTCACGCTGG AGGACGCAAA CGGCAAGCAC CCGCTCACGA CGGAGCTCCG CTTTCACGAC ATGGCCCTCT CGGACCGGGT CGGCTGGGGA CTTTTTCTGG GGAGCGACGA CGCATTCGCC GGACGCAACG GCCAATCCTT CAACCTGAAG CTGATCGGCT TCGGTGCTAC GCGCAATTAC GACCTCTTGA CGGACCCGAA CCTGGACGAC GCCGGGCGCA ACGACTACCT GCGCCGGCAC TGGCGCGACC TCTTCTACCT CGCCCTGGAA ATCGAGATGG TACCGACCGG CAAGAAGATC GTGCTGCATG CGACGCTGAG GTTCTTGGGG GACGCTGCGG CGAGGGGTGC CGCGACGGTG GCGGCGCTTA AGGGGATCTT GGAGGAGACG GGCGTGAACG GGCTCACGCA GCGCTACCGC AAAAAAGAGC TGCAAAAGGC GGCCGGTGTG GCGCGGGCGA AGGAATCGCT TTTCTCGCAC CGGAACCTCG ACGAGGAGTT CTGCCGGGTG AAGCTGATCG GGATCGAGGA GGTGGCCGCC TGCGCCGACG TGGAGGTATC CCCCGAGGTC GACATCGAGC TGGTGCAGGC GCGGATCTGG TTCGAGATCG AGCAGTACCT GAACCAGGCG GTCCCGTTCT ACACGCTCCG TGAGATGCTG GAGCAGGGTT TTCCGGTGGA GGAGATCTTC AACGGGCCGG CGCTTAAGAG CGGCTTCATC AGGACGACCG ACCTGGAGCA GGCGACGCTG AGATCCGTCC TCTGCGTCTC CGACCTCCTT AACCGGCTGA TGGAGATAGA CGGCGTCCTA GCGGTGAACC ACCTGCAGCT CACCAAGTAC GACCCGGAAG GAAAGGCGGT CAAGGGTGCC GCCGACCCGG CCTGGACGAG CGACGGCAAA CCGATCTTCG ACCCCGGCAA GATCAGCGCC TCCTGGCTTT TGTACCTGAG CCCCCAGCAC CTGCCCAGGC TTTACCGCAA CGCCTCGCGC TTTCTCTTCT ATAAGAACGG GCTCCCCTTT CTCCCCCGGA TGGATGAGGC CCTGGCGACA CTTACCCAGT TGCGCGGTGA GGCCGAGCGG ATGCGGGTGA AGAACGCCCC GAACGATCTC CCCATTCCAG CGGGGAACTA CCGGGACCCG GCGGCGTACT TCCCGGTCCA GTACAGCTTT CCCCTCACCT ACGGCATCGG GGTGGACCAA CTTCCTGCCA ACGCGAACGC GAAGCGAAGG GCGCAGGCGA AGCAGTTTAA GGGCTACCTC ATGGTGTTCG AGCAGCTCCT CGCCGACGCC CTGGAGCAGC TCGCGCACAC GGCCGACCTC TTCTCGCTGG ACCCGCTGGT GAAGCGGACC TACTTCGCCG CACACCTGAG CGAGGCGTTG ATCCAGGGGT ACAACGAGCT CTCCACCATC ACCCAGGCGA CCCTTGAAGC GCTGCTCGAA AAGGAGCCGG AATTCCTCAA GCGGCGCAAC AGGTTTCTCG ATCACGTGCT GGCGCGGTTC GGCGGGGAAT ACAGGGAATT CACCCTGCTC CTGGAGAAGC TGCAGGGACA GCAGGTAGCG CTTGGAGCGC TTATCGGCGA CAAGATCGAC TTCATCACCG CCTACCCGGT CGTAAGCCGC GACCGGGCCA AGGCTTTCAA CAGGGAGCTT GCCTGCGCGC CGGGGAACGA CCCCGCGATC AAGCGGCGCA TCGCGCTGCT TCTTGGGAAA AAGGAGTTGA GCGACCGGAT CATCGTGGTC GAGCACCTGC TTTTGCGCCC GAAATTCCCC GGGGACGCGC TCTACCCCGC ATGCAGCGAT GGCGCCTGCC GGCTATGCGG AGAAGAGGAC CCCTATTCAT TCAGGCTCAC CCTGGCGATG CCGGGGTGGA TGGAACCGTT TGACTCGGAC CTGGTGATGA GGGAGTACGC CGACCGGGTC ATAAGACAGG AGCTGCCGTC GCACCTGGTG GGGAAGATCT GCTGGGTCGG CAACGACGGC TTCGAGGAGG ACCCGTGCGA CCCCGTCATA TTGGAGCTGA CGCGATTAAT CGAGGAGAAG GGGAACGGCA TAGCCGGGGT CCGCCCCACC GAGGACGAGG CCTGCGCCTG CGCCCTAGGG GCGTACCACG AATTCTCCCA GGTATTCCGC GAGTGGTACC AGGACAAGGT GCTGCGGCAT ATCCACCCGG ACGCCCTGAA GCAGCAACTG GAGCTGTTGT TCGGCCAGAA AGTGGATCGC GCCACCATAC CCTGCGCCGC CGTCTGGGAC GACGAGCTGT GGGCGGAGGT GACGAAGCTT TTGGTCGGGC GTTTCCTGGA GATCGCCCTT TACGGCTTCC AGTTCGACCG CTTCCAGGCG GCCTGGTGCG CCTGGCTGGA AGCCGACGCC GCATTCGACT GGACGGAAGA GCGCCTGCAG GAACGGCTGC AGGCTATCCT TACCGAGAAT CTCCTCTCCA GTTCCGCCGA CCTCGGATCC CCGGCCGGCC GGATCTGCCG CTGCGCCGAA CGGATCCTGC GCAGCTACGG CGCCACGTTC GACCTCTGGA TGCAGGGTCT CGTAGCTTCG GACAGCTTCG ATCCCGACGC CCCTTTGCCC CCCTTCCCGC TCGATCCACC GCCGGAGTGC GCCGGACTCG GTTTCAAGGC GGGTACCATG GCGCGGTTGA AGGAGCTCGT CGAAGACAGG TACGGCGCCT ACAGGAATGT CTCCTACCGG CTCCGGGTCG TGCTGGACCT CCTGGGGAGG CTGCGGAACG TTTACCCTCC GGCGACCCTG CACGACTGCG ACGAAGGCGG CGACAAAAAC CCGGTGCGGC TGGGGACAAC AGCTTTAGGA AACTGA
|
Protein sequence | MSQPSPVIAN LPQLPPEEDF HRLRREGIGF IVQMGSRLWT EYNETDSGIA ILEALCYAVT DLGYRSGWEI RDLLAPPLPS PDPAHPFPNQ PFFTAREILT VNPWTPDDFR RLLIDQEAVR NAWIACKECA CDTSYYAWCE GERLALSYQL PENRRLRPKE VWPLGLYEAL LELEADADLG DLNDRKVEAA VTLEDANGKH PLTTELRFHD MALSDRVGWG LFLGSDDAFA GRNGQSFNLK LIGFGATRNY DLLTDPNLDD AGRNDYLRRH WRDLFYLALE IEMVPTGKKI VLHATLRFLG DAAARGAATV AALKGILEET GVNGLTQRYR KKELQKAAGV ARAKESLFSH RNLDEEFCRV KLIGIEEVAA CADVEVSPEV DIELVQARIW FEIEQYLNQA VPFYTLREML EQGFPVEEIF NGPALKSGFI RTTDLEQATL RSVLCVSDLL NRLMEIDGVL AVNHLQLTKY DPEGKAVKGA ADPAWTSDGK PIFDPGKISA SWLLYLSPQH LPRLYRNASR FLFYKNGLPF LPRMDEALAT LTQLRGEAER MRVKNAPNDL PIPAGNYRDP AAYFPVQYSF PLTYGIGVDQ LPANANAKRR AQAKQFKGYL MVFEQLLADA LEQLAHTADL FSLDPLVKRT YFAAHLSEAL IQGYNELSTI TQATLEALLE KEPEFLKRRN RFLDHVLARF GGEYREFTLL LEKLQGQQVA LGALIGDKID FITAYPVVSR DRAKAFNREL ACAPGNDPAI KRRIALLLGK KELSDRIIVV EHLLLRPKFP GDALYPACSD GACRLCGEED PYSFRLTLAM PGWMEPFDSD LVMREYADRV IRQELPSHLV GKICWVGNDG FEEDPCDPVI LELTRLIEEK GNGIAGVRPT EDEACACALG AYHEFSQVFR EWYQDKVLRH IHPDALKQQL ELLFGQKVDR ATIPCAAVWD DELWAEVTKL LVGRFLEIAL YGFQFDRFQA AWCAWLEADA AFDWTEERLQ ERLQAILTEN LLSSSADLGS PAGRICRCAE RILRSYGATF DLWMQGLVAS DSFDPDAPLP PFPLDPPPEC AGLGFKAGTM ARLKELVEDR YGAYRNVSYR LRVVLDLLGR LRNVYPPATL HDCDEGGDKN PVRLGTTALG N
|
| |