Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3102 |
Symbol | |
ID | 8138452 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3592706 |
End bp | 3596104 |
Gene Length | 3399 bp |
Protein Length | 1132 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644870706 |
Product | PKD domain containing protein |
Protein accession | YP_003022888 |
Protein GI | 253701699 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 138 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGACA GCACCGGTAA CTACATCAAC TACGGCCCCT TCCAGTTGGG CGCGGACGGC ATGCCCACCG CGGAAACCAT GGGGAGCCTC ACCACCATGC TGATGGGTAG CGCAAGCTTC AGCACCGTAG CCGGCAGCAA TCCGCTGCTC ACCAAGGTCG ACTATTCCGC CTGGGACCTC TCCCAGGCTT GCGGCTCCTG CCACGTCGGC GGCGGCTTCG TCGAGAAGGA CCGCGAAGGG AAGCGCTTCA GCATGATGAG CCCCTTCGTC GACGGCATCA CCCCGTACAC CATGACCATC TTCGAGCGCT ACGACGCGAC GACCGGGCTG CCGGCTCACA CCGTCGAGCC CGCTCCCTGG TCCTATCCGA TCTGGAACGG CTCCACCCCC GTGACCGCCG ACGCCGGCTG GGGACAGCCG ATGACCATGA CGCTTCCGGA CGGCTCCGCC ATGCCGGTCG TGGACAAGCA GGTCATGATG CCCAACGTGA AGGAGATGGA CTGCCTCATG TGCCATTTCC AGGGGTACAA CAACCTCATG TCCTCGGTCA TGGCCTACTC CGGCGCGCAT AACGCGACGC CTAGCTTCGG CGCGGGCTTC ATGAACATGT TCACCCAGGC CTACGACTTC GCCACCGGCC TTTTGCAAAA AGACGGCAAC GGCGTGGTCT CGCTCTCAAG CCTCGGCCTC TCCAAGATGA AGTACGATCC CCCCAGCCAG AACTGCCGCA ACTGCCACAT GCCTTCGAAC CTGAAGGACC TGCCGGACAT GATGAGCGAC TTCCTCTCCT CCGCTCCCAT GATCTACAAC GGCAATTTCG ACGGCAACAA GCGGGGTTCC TTCACCGGCC TTACCATGCC GGCCTTCGAC TTCAACGCCC CGATAAGCTT CGACCGCGGC GCCACCGCCT ATACCTGGGA CTGGACCGCG GCCTATGCGA CCTCGCCGAC CCTGTACATG ACCATGACCG GCATCGGGAG CACGACCCCC GGCTTCACCG GCACGACTAT CCCCGCCGGG TGGCCCGCCG CCATGGGGAT GAGCGAGTTC AACAAGCCCG CCTTCGACGC CTTCTCCACC GCCATGCCGG GGATGAAGCA GTATTTCCTG GGCGGCGGCA ACCGTCCCGG CACCGGCCCG ATCTACTACC AGGCCACCCT GGCCGACGGC ATGCACCAGG ACCAGAACGT GCTCAAAAAG AGCACCGTTC CCTTCCCGCG CGCCGAGTGG TTCAAGCGCG GCGATCTCTG GGAGCCCGGC TACGACGTCC ACATGACGCT CGAGTGCGCC GGCTGCCACA TGAACACCGC GACCACGAAG ACCGACAAGT ACGACGCTAG CGGCAACCTG ATCTTCGACG GCAAGAGCGA TTGCGATCCG GGCCGCGGCT ACGACAGCGC CGGCGGCGTG GAAGCGAACC CCGCCTTCAA GACCACCGTC AACAGCCAGA ACACCGTCAA GAGCTGCGAG GCCTGCCACG TGACCGGCAA GAACCACGAC GGCGTGGTGG TCGACACCTA CGGCGCGCCC GACCCGGCTT CCGCTCACAA GGCCGCGGGC CTTCTGGCCA ACGTCACCAA TGCGGTGCGC CTGAACGCCG CCACCGGCGC CGAAGAAAAC TTCACCGGCA GCCACCTGGA TGTCGTCGAC TGCACCGTCT GCCACCTGAG CCGCGAGCAG ATGGTGGTCC GTCTCCTCGA CTGCACCTCC GGCAACCGCT ACCCCAACAT GCTCGGCTTC GACGAGAACC GCGGCATGAT GGGTATGTTC AGCGACCCGA TGGGCCAGCA GTGGCCGGTC GGCAACAACC TGAAGAAGTG GGATCCGCTC TACACCTGGC AGAAGGGGGG CAGCGACGCC AAGGGGAGCG CAGGCTCCTG GAACGCCGAA TGGCGCCGCA AGATCTACGC CGTCAACCTG ATCACCGCAG CCATCTGGAA CAACGTCGAC GCCAACGTCG ACGCCAACGG CGACGGCGTC CCGGGGCGTG CCCCGAGCAT CCACCCGGGC ACTCCGGAGG TCTCCCCCAG CACCAACTAC GACCCCTGGA TCTCCCGCGA CATGAAGGCC GGCATTAACT ACGGTCCTTC CGGCTTCGCC CCCATCCCGG TCGGCTTCGG CGACAACGAC GCCTTGAAGG GACCGTTCGC GGTCAACTTC CAAAGCGCCT ACAACGCCGA CGGCAGCTTC ACCGGCGCCC TTAAGTACGT CGGCGTCTAC GGCGGCAACG CCATGTTCTC CACCCCGCAG GAGATCTCGG GCTACAAGTC CTGGCGCAAC TCCATCAAGG CGGGCGTGGA CAACAAGGAC TGGACCGGAA CCCAACTCGC CTTGGTGGCC GGCCCCTACA AGCTGACCCA CGGCATCAGG GCGACCGAGA AGTTCGTCCT CGGCAAGAAG ACCGAGACCG GCTTCGGCTG CGCCGACTGC CATGCGCCGG CGGCAACCGC CAAGGTGGCG TTCTTCGACG GCACCATCAA CATGGTCGGC ACCGCGGTCA ACACCCATAA AGCGATCCAG GCAGGCAAGG GCTTCATGGA GGCCTCGGCC GAACTGATGG AGATCGTGGG TGCGAAGGAA GACATCGACA CCGCGACGGA AGTGGCAACC AAGGCCGGCG GAGCGGTGGA AGTGAAGTTC GAGGAACTCG GTGACTGGGA CGGGGCGGCG TTCAGCGTGA ACCCCGCCGG CGAGTACAAG CGCGTCACCG AGATGGACCG AAACGAAGCG CTCTACCCGG CGGTGTCCGG CGTGAGCTTC ACCGACATCA ACGGCAACAG CTACGCCGAC CGCGACGCCT GGAGGGGGTA CTTGACCGGC ATCACCCCGG CGCAGGCCGG TATCGGCGTG GCGCCGGTGG CAAGCATCGT CTCGACCGTC ACCGACCTCG ACCCCGCTGC CGCAGGCACC CAGGTCGCCG CCAACGCTGC GGTAACCCTG ACCGCAGGCG TCGCGCAAAC AGGCGGCGTG GTCAGCTACA GCTGGAGCTC AAGCGACGGC ACCGTGATCC CGGCGGGCAA GGAGAGCAGC GTCACCTTCA CCACCACCGG GAGCAAGACC GTCACCCTCA GCGTGACCGA CGAGGAAGGA AACAAGGCGT TCGCGACGCT GAACCTCCAG GTCGTGGCGG TTCCGGCGGA GATGATCTCC TGGAACGACG CAGCCGGCAG CCTGGGCGGC GTCATGACCG TGGCCGGCAT GCCCACCCCC AACGACAAGG TGAAAATCGT CTGGGGCGAC GGCAAGTACC AGTATGTCAC CCTCGCCAAC GCGGCGAGCA TCTCCAGGGC TCACGTCTAC CTCACCGCCG GCAACAAGCT GGTGCAGGTG TACATCTACA AGGCGGGGGT GCTGGTAGGG ACCTCCAAGA AGAGCATCAC CGTAAACGGC GGCAACTAA
|
Protein sequence | MKDSTGNYIN YGPFQLGADG MPTAETMGSL TTMLMGSASF STVAGSNPLL TKVDYSAWDL SQACGSCHVG GGFVEKDREG KRFSMMSPFV DGITPYTMTI FERYDATTGL PAHTVEPAPW SYPIWNGSTP VTADAGWGQP MTMTLPDGSA MPVVDKQVMM PNVKEMDCLM CHFQGYNNLM SSVMAYSGAH NATPSFGAGF MNMFTQAYDF ATGLLQKDGN GVVSLSSLGL SKMKYDPPSQ NCRNCHMPSN LKDLPDMMSD FLSSAPMIYN GNFDGNKRGS FTGLTMPAFD FNAPISFDRG ATAYTWDWTA AYATSPTLYM TMTGIGSTTP GFTGTTIPAG WPAAMGMSEF NKPAFDAFST AMPGMKQYFL GGGNRPGTGP IYYQATLADG MHQDQNVLKK STVPFPRAEW FKRGDLWEPG YDVHMTLECA GCHMNTATTK TDKYDASGNL IFDGKSDCDP GRGYDSAGGV EANPAFKTTV NSQNTVKSCE ACHVTGKNHD GVVVDTYGAP DPASAHKAAG LLANVTNAVR LNAATGAEEN FTGSHLDVVD CTVCHLSREQ MVVRLLDCTS GNRYPNMLGF DENRGMMGMF SDPMGQQWPV GNNLKKWDPL YTWQKGGSDA KGSAGSWNAE WRRKIYAVNL ITAAIWNNVD ANVDANGDGV PGRAPSIHPG TPEVSPSTNY DPWISRDMKA GINYGPSGFA PIPVGFGDND ALKGPFAVNF QSAYNADGSF TGALKYVGVY GGNAMFSTPQ EISGYKSWRN SIKAGVDNKD WTGTQLALVA GPYKLTHGIR ATEKFVLGKK TETGFGCADC HAPAATAKVA FFDGTINMVG TAVNTHKAIQ AGKGFMEASA ELMEIVGAKE DIDTATEVAT KAGGAVEVKF EELGDWDGAA FSVNPAGEYK RVTEMDRNEA LYPAVSGVSF TDINGNSYAD RDAWRGYLTG ITPAQAGIGV APVASIVSTV TDLDPAAAGT QVAANAAVTL TAGVAQTGGV VSYSWSSSDG TVIPAGKESS VTFTTTGSKT VTLSVTDEEG NKAFATLNLQ VVAVPAEMIS WNDAAGSLGG VMTVAGMPTP NDKVKIVWGD GKYQYVTLAN AASISRAHVY LTAGNKLVQV YIYKAGVLVG TSKKSITVNG GN
|
| |