Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1678 |
Symbol | |
ID | 8137009 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 1957257 |
End bp | 1960964 |
Gene Length | 3708 bp |
Protein Length | 1235 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644869290 |
Product | hypothetical protein |
Protein accession | YP_003021490 |
Protein GI | 253700301 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02243] conserved hypothetical protein, phage tail-like region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 121 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGAA GCGACTGCAA ACAAAACCGC GATCCGCTGC AACCGGTACG GGAGGGAACC AGCCGGGGAG AGCGGCTCTC CCCCGCGCTG GACCCGGGCT ACGCGCCGGT GGACGAGCGG ACCGCGGCAC ACCACATGGT GTTCGCGCAG GCGTACTCGG CCTTTCTCAA CTACTTCAAC GCCGACAACG CGCTCGACGG TGACTGGCAA CCTTTCTTCA GCGGCGACGT TTCGGTGCAA CTGGCGCTGG CGGCGGTGCA GGACGTCGAC GCCTACAGAA CCCGCATCAA GGAGTACCTC GACTTCCTGA AGGATCTTGA CTTCCAGTCG GACGAGGCGA AGGCCAAAGA GCACCTGGGC TACCTGTTCA GCGCCGCCGC GACGCTCGCG CGCCAGTTCG ACCTCTTTAG AAGCGGGCTT CCAGCCGACC TGCCGCTGAA ATCCACCCTG CAGGCCCTGA TCCAGACCCA GCTCGCTCCC GCCTTCAACC GGCTCACCGC CTATTTCAAG GCCGACCAGG CCCTCCCCCA GCCCGACCGC CTCATCGCGG AAGCGGCCCC CGACCTGGTG GTTTTGGGGA AGAGCGCAGT CGCGTTCGCT GAGGTTTACC AGGGTGGGCT GTCGGGGGAA TGGATCACGG ACGCCTCGGC GGACTGGGCC GCTTACACGA ACGCCGTACC CCCCGATGCG TCCGTGTACG GTTCGGGGAT TGAGCCTTTC CAGCGCATCA ACCACATAGC CACCCACAAC CTCTTCACCT CGGCCCTGGA CCAGTTTCTC AAGGCGTACG CCCGGACCGC GGCGGAAGCC GCCAAGGGAC TCGAAGCCTC GCTCACCGAC CGGGACGACC ACCACCCCCA TTACGCTCTC TTCCTGACCT TTCTGAGGCT CTTCGCCTAC GTGCGCGACG CCGCCAACAC CCTGACCGGC CGCCATCTCG ACTTCTATTA CCGCGACATC CTGCAACTCA AGGAAAAGGA GGCCAGGCCC AGCCACGCGC ACCTCCTTAT CGAACTCGCC AAGCAGGCAC CCGAGCACCT GGTGCGGGCC GGCGAACTCT TCAAGGCCGG AAAGGACGCG CTCGGGCACG ACGCCTTTTT CGCCAACGAC CGCGACTTCG TTGCGAACCA GGCGCAGGTG GCCGCACTAA AGACGGTGTA CCGGCACGGC ACGGAGAAGG TTGGGGTCAC CGCCCCCAGT CCCCTGCAGC AGGGACGGAT CTTCGCCTCG CCGGTCGCCA ACTCGGAAGA CGGCATCGGC GGCGAACTGA CCTCCGAGGA CAAGTCGTGG CACCCCTTCC ACAACAAGAA ATACCAGGAC GGGACCCTGG CAAAGATAGA CATGCCTAAA GTGGAGATCG GGTTCGGCAT CGCGTCACAC TACCTCTACC TTGCCCAGGG GGAAAGGAGC GTGACGCTCA GTTTCCAGAG TCAGCCGGGA GTGTCCCGGG ACTACAAGAA CGACGTTGCC TGCCTGTTCA GCTCGGAAAA GGGTTGGCTG GAAAAGGCGC CGCTCTCCTT CGCCCAGGAG ACCGGGGGGC TGGCGCTCCG GGTCGCCCTC ACCGGGGCGG ACCCAGCCGT CGTCGCCTAT TCCGCAAAGC TCCACGGCTA CAGCTTCGCC ACCGAACTTC CGGTGCTGCT GATACGGCTC AAGCACCGCG ACGACGCCCC CTTCGCCTAC CCGGAGTTGG AGCCGCTCAC GCTGAACCGG GTGAAACTCG CCGTAGAGGT GACGGGGCTG AAGACAGCGG CCCTGTCCAA CGACTTCGGT CCGGTGGACG CCTCCAAGCC CTTTCAGCCT TTCGGCTCTT CCCCGGCAGC CAACAGCTCC TTCGTCATCG GCTCAGGCGA GATGTTCCAG AAGTCGCTCA CGAAGGCAGC GGTGAACGTC CAGTGGCAAA GCGCCCCCGC CCCCTTCAAA GGCAAGAGCG TGACTGCGAT GACTGAATAC CTGCAAGGTG GGGTCTGGCG CCAGTTCAAG ACGAGCCTTG AGGGAGACGT CACCAGCACC ACCTTCGACC TCTTGGGCGA ACCGCAGTCG GCGGACGCCC CCCCCTATTC CGATTCCCCC GCCGCGCCGG AAAACGAGCA GTACCAGACC TCCTCGCGAA ACGGCTTCGT CCGCCTGAGG CTCGCCGCCG ACTTCGGCCA AAGCGACTAC GAGCAGGCGC TCATCGATTA CATCAAGCGC ATCACCGACG GCGACCCCAA TCCGGGGTCC AAGCCGATCG CCCCGACAGG CCCATACGTC ACCGGCCTGA CGCTTGACTA CGGCGCGCAA CAGACCATAA CCCTGGACCA GGCGGACCTG GACAAGTTCA AGGCGAGGCA GGCCCTCTTT TTCCACCTGA ACCCCTTCGG TTACACGGAG CAGCACCCAT ATTTGAAGAC GACGGTGCCG GCTGCTCACC CCTCCATGAA GCTGGTGCCG CAGTTCAAGC ACGCAAACCG CGAGGACCCG ACTCTTCCCA AGGGGGTGCC GGTGCCGCAC GAGGCCGAGT TCTACATAGG GGTCGCAGGC CTCGCTCCGC CGCAGAACCT GGCGCTATTG TTCCAGGTGG TCGACGGGAG CGCCGACCCG CTCTCCATCA AGCCGAAGCC GCATATCGAC TGGAGCTACC TGCAGCAAAA CGAGTGGGTC CCCTTCGGAC ACGGCGAGGT TGAAGACCAA ACGGGCGAAC TGATCGACTC GGGCATCGTC ACCCTCTCCA TCCCCCGCGG CGCCTCCAGC GACAACACGC TGCTCCCGGC AGAGCTGCAT TGGGTGCGCG CGGCGGTGAA AAGCAACAGC GATTCGGTCT GCCGCCTGAT CCTCGTCGCC GCCCAGGCGC TGCAGGCGAC GTTTGAGGAC CAGGGAAACG ACCCCGCTTT CGTCGGGGGC ATGCTCCCCG CAGGGAGCAT CGCGAAATTG GCGCAGCCCG ACGCCGACGT GAAGAAGGCG AGCCAGCCCT TCGCCGGCTT CGGGGGGCGG GGGAAGGAGG CGCCCGCCGA CTTCTACACC AGGGTCTCGG AGCGGCTGCG CCACAAGGAC CGCGCCATCG CGCAGTGGGA CTACGAGCGG CTGGTGCTGG AAGGGTTCCC CCAGATCTAC CGGGTCAAGT GCCTGAACCA CACGCAGTAC GAGCCGGACG CGACCGGGAG CGGGATCTAC CGGGAACTCG CCCCCGGGCA CGTCACCGTG GTCACCATCC CGGATCTGCG CTTCGCAGCC CTTAGGGACC CGCTGCGCCC GTACACGAGC CTTGGGCTCC TGGACAAGAT CTCCGCATAC CTTGCGCAGC GGCTTTCCTG CTTCGTGCGG CTGCACGTAA GGAACCCGCT CTTCGAGGAG GTGCAGGTCG ACTGCAAGGT AACCCTGCAA CCGGGGCTGG ACCAGAGCTT TTACGAACTG AAGGTCAAGG AGGCGATCAC CCGTTTCCTT TCCCCCTGGG CCTTCCCCGG CGGGGGGAAT CCATCCTTCG GCGGCAAGGT GAGGAAGTCT GTCCTCATCG ACTTCGTGGA GGAACTCCCT TACGTGGAAT GCGTGATGGA CGTGAGGCTC CTGCACAGCT ACATCGACGC CTATGGCTTC GCCCGCACCG ACGAAGTCGA CGAGGCGGCC GGTTCCACCG CCGTCTCCAT CCTGGTCTCG GCCCGCAAAC ACCTGGTCGC GACCATTGAC CCAGCCGAGG AGGAGACTCC CGGTGAACTC TGCCGGTGCC CCGCATGA
|
Protein sequence | MSRSDCKQNR DPLQPVREGT SRGERLSPAL DPGYAPVDER TAAHHMVFAQ AYSAFLNYFN ADNALDGDWQ PFFSGDVSVQ LALAAVQDVD AYRTRIKEYL DFLKDLDFQS DEAKAKEHLG YLFSAAATLA RQFDLFRSGL PADLPLKSTL QALIQTQLAP AFNRLTAYFK ADQALPQPDR LIAEAAPDLV VLGKSAVAFA EVYQGGLSGE WITDASADWA AYTNAVPPDA SVYGSGIEPF QRINHIATHN LFTSALDQFL KAYARTAAEA AKGLEASLTD RDDHHPHYAL FLTFLRLFAY VRDAANTLTG RHLDFYYRDI LQLKEKEARP SHAHLLIELA KQAPEHLVRA GELFKAGKDA LGHDAFFAND RDFVANQAQV AALKTVYRHG TEKVGVTAPS PLQQGRIFAS PVANSEDGIG GELTSEDKSW HPFHNKKYQD GTLAKIDMPK VEIGFGIASH YLYLAQGERS VTLSFQSQPG VSRDYKNDVA CLFSSEKGWL EKAPLSFAQE TGGLALRVAL TGADPAVVAY SAKLHGYSFA TELPVLLIRL KHRDDAPFAY PELEPLTLNR VKLAVEVTGL KTAALSNDFG PVDASKPFQP FGSSPAANSS FVIGSGEMFQ KSLTKAAVNV QWQSAPAPFK GKSVTAMTEY LQGGVWRQFK TSLEGDVTST TFDLLGEPQS ADAPPYSDSP AAPENEQYQT SSRNGFVRLR LAADFGQSDY EQALIDYIKR ITDGDPNPGS KPIAPTGPYV TGLTLDYGAQ QTITLDQADL DKFKARQALF FHLNPFGYTE QHPYLKTTVP AAHPSMKLVP QFKHANREDP TLPKGVPVPH EAEFYIGVAG LAPPQNLALL FQVVDGSADP LSIKPKPHID WSYLQQNEWV PFGHGEVEDQ TGELIDSGIV TLSIPRGASS DNTLLPAELH WVRAAVKSNS DSVCRLILVA AQALQATFED QGNDPAFVGG MLPAGSIAKL AQPDADVKKA SQPFAGFGGR GKEAPADFYT RVSERLRHKD RAIAQWDYER LVLEGFPQIY RVKCLNHTQY EPDATGSGIY RELAPGHVTV VTIPDLRFAA LRDPLRPYTS LGLLDKISAY LAQRLSCFVR LHVRNPLFEE VQVDCKVTLQ PGLDQSFYEL KVKEAITRFL SPWAFPGGGN PSFGGKVRKS VLIDFVEELP YVECVMDVRL LHSYIDAYGF ARTDEVDEAA GSTAVSILVS ARKHLVATID PAEEETPGEL CRCPA
|
| |