Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1709 |
Symbol | |
ID | 8137040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 1989968 |
End bp | 1991548 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644869321 |
Product | hypothetical protein |
Protein accession | YP_003021521 |
Protein GI | 253700332 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 2.98029e-32 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCTAGGCA TAATGAGGAA GTACAAGCAA TCGATACTCA TAAAGATTGT ATTCGTCGTG ATCGTACTAT CTTTCGTAGG GACCATCTTC CTCGTCTGGG GCAGAGGCGG CGAGGGTTCG GCTGACGGTC CTGGTGGATA TGCCGCAACG GTTGACGGCA CGAAGATCTC GATGGATGAC TTCCAGAAGA ACTACTACCG CACCAGGAAC CTGTACGAGC AGATCTACGG CCGCTCGCTG ACCCCCGAGA TGGAAAAGCA GATGGGGCTC AAAAAGACGA CCATAGGTAG CATGGTGGAC AACGTCCTCA CCCTCAAAGA AGCCAAGAAG ATGGGCATCA AGGTGAATAA GGACGAGGTG GCAGCGGAGA TCGCGAAGAT CCCCTCCTTC CAGAATAACG GCGCGTTCGA CTTCAACCTG TACCAGCAGA CCCTCAAGGC CAACCGGGTC ACCCCGAAGG AGTTCGAGGA AACCCAGGAA CAGGACATCC TGGTCCAAAA GGCGCGCAAC AAGGTGAAGG AGAAGGCGAC CGTCACCGAC GCCGACGTGA TGCAGGAATT CAAGAAGCAA AACGACAAGG TGAACCTGCA GTACGTCTCC TTCTCCCCCG CCGACGTGAA GGGAAGCATC AAGCTGACCG ACGCCGAGCT GAACGTCTAC CTCCAGGATC ACCAGGCGCA GTTCAAGACG CCGGAGCAGG TATCGATCGC CTACACGTTG GTGAGCCCGG CGGCTCTCGC CGCCAAGGTG AGCGTCACCC CTGAAGAAGC TCAGAACTAC TACCAGAAGA ACATCGACCG CTACCAGGGC AAAGGGGGGA TTCTCCCGTT CTCCGAAGTA AAGGATCAGG CAACCGCCGA CGCGCAGAAG GCTAAGGCCG CCAAGGAAGC CTACGAGAAG GCTGCCGAGA CCGCCAACAA GTTCCGTAGC CAGGGCAACC TCGATGCAGC CGCCCAAGCG CTCGGGGGCA AGGTCGAGAA GACCCCGCTC TTCACCGCGC AGGCGCCTGC CGCTGCCATC GCAGGGGAAA TCGAACTCGT CACCCGCGCC TTCGCGCTGA AGCAGGGCGA ATTGGGGGGA CCGGTCGAGA CCGCCAAGGG GATCTACCTG CTGCAGGTTC TCGACAAGAA GCCGTCCGTC GTGCCGCCGC TGGCGCAGGT AAGGGCGCAG GTCGAGCAGA AGCTTTTGGA AGTGAAAGGG GCCGAGGTGG CCAAGAAGAA GGCTGAAGAA GCGCTGCAGC AGCTCGCCAA AGGGGGCGCG GCAGCCAAGG AGACCGGCAA CTTCGGCTAC TCCCCGGCCG GTGCCATCCC CACCGTCGGA ACCTCCCCCG AACTCATGGA AGCCGCTTTC GCGCTTACCC CTGCCAGCCC GGTCGCCAAG CAGCCGGTGA AGGTGGGCGA GCGCTGGTAC GCGGTGAAAC TTAAGAACAG GGTGGAAGCC CCCACCACCG ACTTCGCCAA GGCCTCCGCT ACCATCAAAC AGGCCCTGCT CCCCAAAAAG CAGCAGGACG AGCTGGACAA GTGGTTGAAG GGGCTCAGGG ATAAGGCTAA AATCGAGATC AACCCGTCGA TCCAGGACTA A
|
Protein sequence | MLGIMRKYKQ SILIKIVFVV IVLSFVGTIF LVWGRGGEGS ADGPGGYAAT VDGTKISMDD FQKNYYRTRN LYEQIYGRSL TPEMEKQMGL KKTTIGSMVD NVLTLKEAKK MGIKVNKDEV AAEIAKIPSF QNNGAFDFNL YQQTLKANRV TPKEFEETQE QDILVQKARN KVKEKATVTD ADVMQEFKKQ NDKVNLQYVS FSPADVKGSI KLTDAELNVY LQDHQAQFKT PEQVSIAYTL VSPAALAAKV SVTPEEAQNY YQKNIDRYQG KGGILPFSEV KDQATADAQK AKAAKEAYEK AAETANKFRS QGNLDAAAQA LGGKVEKTPL FTAQAPAAAI AGEIELVTRA FALKQGELGG PVETAKGIYL LQVLDKKPSV VPPLAQVRAQ VEQKLLEVKG AEVAKKKAEE ALQQLAKGGA AAKETGNFGY SPAGAIPTVG TSPELMEAAF ALTPASPVAK QPVKVGERWY AVKLKNRVEA PTTDFAKASA TIKQALLPKK QQDELDKWLK GLRDKAKIEI NPSIQD
|
| |