Gene GM21_3102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3102 
Symbol 
ID8138452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3592706 
End bp3596104 
Gene Length3399 bp 
Protein Length1132 aa 
Translation table11 
GC content65% 
IMG OID644870706 
ProductPKD domain containing protein 
Protein accessionYP_003022888 
Protein GI253701699 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones138 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGACA GCACCGGTAA CTACATCAAC TACGGCCCCT TCCAGTTGGG CGCGGACGGC 
ATGCCCACCG CGGAAACCAT GGGGAGCCTC ACCACCATGC TGATGGGTAG CGCAAGCTTC
AGCACCGTAG CCGGCAGCAA TCCGCTGCTC ACCAAGGTCG ACTATTCCGC CTGGGACCTC
TCCCAGGCTT GCGGCTCCTG CCACGTCGGC GGCGGCTTCG TCGAGAAGGA CCGCGAAGGG
AAGCGCTTCA GCATGATGAG CCCCTTCGTC GACGGCATCA CCCCGTACAC CATGACCATC
TTCGAGCGCT ACGACGCGAC GACCGGGCTG CCGGCTCACA CCGTCGAGCC CGCTCCCTGG
TCCTATCCGA TCTGGAACGG CTCCACCCCC GTGACCGCCG ACGCCGGCTG GGGACAGCCG
ATGACCATGA CGCTTCCGGA CGGCTCCGCC ATGCCGGTCG TGGACAAGCA GGTCATGATG
CCCAACGTGA AGGAGATGGA CTGCCTCATG TGCCATTTCC AGGGGTACAA CAACCTCATG
TCCTCGGTCA TGGCCTACTC CGGCGCGCAT AACGCGACGC CTAGCTTCGG CGCGGGCTTC
ATGAACATGT TCACCCAGGC CTACGACTTC GCCACCGGCC TTTTGCAAAA AGACGGCAAC
GGCGTGGTCT CGCTCTCAAG CCTCGGCCTC TCCAAGATGA AGTACGATCC CCCCAGCCAG
AACTGCCGCA ACTGCCACAT GCCTTCGAAC CTGAAGGACC TGCCGGACAT GATGAGCGAC
TTCCTCTCCT CCGCTCCCAT GATCTACAAC GGCAATTTCG ACGGCAACAA GCGGGGTTCC
TTCACCGGCC TTACCATGCC GGCCTTCGAC TTCAACGCCC CGATAAGCTT CGACCGCGGC
GCCACCGCCT ATACCTGGGA CTGGACCGCG GCCTATGCGA CCTCGCCGAC CCTGTACATG
ACCATGACCG GCATCGGGAG CACGACCCCC GGCTTCACCG GCACGACTAT CCCCGCCGGG
TGGCCCGCCG CCATGGGGAT GAGCGAGTTC AACAAGCCCG CCTTCGACGC CTTCTCCACC
GCCATGCCGG GGATGAAGCA GTATTTCCTG GGCGGCGGCA ACCGTCCCGG CACCGGCCCG
ATCTACTACC AGGCCACCCT GGCCGACGGC ATGCACCAGG ACCAGAACGT GCTCAAAAAG
AGCACCGTTC CCTTCCCGCG CGCCGAGTGG TTCAAGCGCG GCGATCTCTG GGAGCCCGGC
TACGACGTCC ACATGACGCT CGAGTGCGCC GGCTGCCACA TGAACACCGC GACCACGAAG
ACCGACAAGT ACGACGCTAG CGGCAACCTG ATCTTCGACG GCAAGAGCGA TTGCGATCCG
GGCCGCGGCT ACGACAGCGC CGGCGGCGTG GAAGCGAACC CCGCCTTCAA GACCACCGTC
AACAGCCAGA ACACCGTCAA GAGCTGCGAG GCCTGCCACG TGACCGGCAA GAACCACGAC
GGCGTGGTGG TCGACACCTA CGGCGCGCCC GACCCGGCTT CCGCTCACAA GGCCGCGGGC
CTTCTGGCCA ACGTCACCAA TGCGGTGCGC CTGAACGCCG CCACCGGCGC CGAAGAAAAC
TTCACCGGCA GCCACCTGGA TGTCGTCGAC TGCACCGTCT GCCACCTGAG CCGCGAGCAG
ATGGTGGTCC GTCTCCTCGA CTGCACCTCC GGCAACCGCT ACCCCAACAT GCTCGGCTTC
GACGAGAACC GCGGCATGAT GGGTATGTTC AGCGACCCGA TGGGCCAGCA GTGGCCGGTC
GGCAACAACC TGAAGAAGTG GGATCCGCTC TACACCTGGC AGAAGGGGGG CAGCGACGCC
AAGGGGAGCG CAGGCTCCTG GAACGCCGAA TGGCGCCGCA AGATCTACGC CGTCAACCTG
ATCACCGCAG CCATCTGGAA CAACGTCGAC GCCAACGTCG ACGCCAACGG CGACGGCGTC
CCGGGGCGTG CCCCGAGCAT CCACCCGGGC ACTCCGGAGG TCTCCCCCAG CACCAACTAC
GACCCCTGGA TCTCCCGCGA CATGAAGGCC GGCATTAACT ACGGTCCTTC CGGCTTCGCC
CCCATCCCGG TCGGCTTCGG CGACAACGAC GCCTTGAAGG GACCGTTCGC GGTCAACTTC
CAAAGCGCCT ACAACGCCGA CGGCAGCTTC ACCGGCGCCC TTAAGTACGT CGGCGTCTAC
GGCGGCAACG CCATGTTCTC CACCCCGCAG GAGATCTCGG GCTACAAGTC CTGGCGCAAC
TCCATCAAGG CGGGCGTGGA CAACAAGGAC TGGACCGGAA CCCAACTCGC CTTGGTGGCC
GGCCCCTACA AGCTGACCCA CGGCATCAGG GCGACCGAGA AGTTCGTCCT CGGCAAGAAG
ACCGAGACCG GCTTCGGCTG CGCCGACTGC CATGCGCCGG CGGCAACCGC CAAGGTGGCG
TTCTTCGACG GCACCATCAA CATGGTCGGC ACCGCGGTCA ACACCCATAA AGCGATCCAG
GCAGGCAAGG GCTTCATGGA GGCCTCGGCC GAACTGATGG AGATCGTGGG TGCGAAGGAA
GACATCGACA CCGCGACGGA AGTGGCAACC AAGGCCGGCG GAGCGGTGGA AGTGAAGTTC
GAGGAACTCG GTGACTGGGA CGGGGCGGCG TTCAGCGTGA ACCCCGCCGG CGAGTACAAG
CGCGTCACCG AGATGGACCG AAACGAAGCG CTCTACCCGG CGGTGTCCGG CGTGAGCTTC
ACCGACATCA ACGGCAACAG CTACGCCGAC CGCGACGCCT GGAGGGGGTA CTTGACCGGC
ATCACCCCGG CGCAGGCCGG TATCGGCGTG GCGCCGGTGG CAAGCATCGT CTCGACCGTC
ACCGACCTCG ACCCCGCTGC CGCAGGCACC CAGGTCGCCG CCAACGCTGC GGTAACCCTG
ACCGCAGGCG TCGCGCAAAC AGGCGGCGTG GTCAGCTACA GCTGGAGCTC AAGCGACGGC
ACCGTGATCC CGGCGGGCAA GGAGAGCAGC GTCACCTTCA CCACCACCGG GAGCAAGACC
GTCACCCTCA GCGTGACCGA CGAGGAAGGA AACAAGGCGT TCGCGACGCT GAACCTCCAG
GTCGTGGCGG TTCCGGCGGA GATGATCTCC TGGAACGACG CAGCCGGCAG CCTGGGCGGC
GTCATGACCG TGGCCGGCAT GCCCACCCCC AACGACAAGG TGAAAATCGT CTGGGGCGAC
GGCAAGTACC AGTATGTCAC CCTCGCCAAC GCGGCGAGCA TCTCCAGGGC TCACGTCTAC
CTCACCGCCG GCAACAAGCT GGTGCAGGTG TACATCTACA AGGCGGGGGT GCTGGTAGGG
ACCTCCAAGA AGAGCATCAC CGTAAACGGC GGCAACTAA
 
Protein sequence
MKDSTGNYIN YGPFQLGADG MPTAETMGSL TTMLMGSASF STVAGSNPLL TKVDYSAWDL 
SQACGSCHVG GGFVEKDREG KRFSMMSPFV DGITPYTMTI FERYDATTGL PAHTVEPAPW
SYPIWNGSTP VTADAGWGQP MTMTLPDGSA MPVVDKQVMM PNVKEMDCLM CHFQGYNNLM
SSVMAYSGAH NATPSFGAGF MNMFTQAYDF ATGLLQKDGN GVVSLSSLGL SKMKYDPPSQ
NCRNCHMPSN LKDLPDMMSD FLSSAPMIYN GNFDGNKRGS FTGLTMPAFD FNAPISFDRG
ATAYTWDWTA AYATSPTLYM TMTGIGSTTP GFTGTTIPAG WPAAMGMSEF NKPAFDAFST
AMPGMKQYFL GGGNRPGTGP IYYQATLADG MHQDQNVLKK STVPFPRAEW FKRGDLWEPG
YDVHMTLECA GCHMNTATTK TDKYDASGNL IFDGKSDCDP GRGYDSAGGV EANPAFKTTV
NSQNTVKSCE ACHVTGKNHD GVVVDTYGAP DPASAHKAAG LLANVTNAVR LNAATGAEEN
FTGSHLDVVD CTVCHLSREQ MVVRLLDCTS GNRYPNMLGF DENRGMMGMF SDPMGQQWPV
GNNLKKWDPL YTWQKGGSDA KGSAGSWNAE WRRKIYAVNL ITAAIWNNVD ANVDANGDGV
PGRAPSIHPG TPEVSPSTNY DPWISRDMKA GINYGPSGFA PIPVGFGDND ALKGPFAVNF
QSAYNADGSF TGALKYVGVY GGNAMFSTPQ EISGYKSWRN SIKAGVDNKD WTGTQLALVA
GPYKLTHGIR ATEKFVLGKK TETGFGCADC HAPAATAKVA FFDGTINMVG TAVNTHKAIQ
AGKGFMEASA ELMEIVGAKE DIDTATEVAT KAGGAVEVKF EELGDWDGAA FSVNPAGEYK
RVTEMDRNEA LYPAVSGVSF TDINGNSYAD RDAWRGYLTG ITPAQAGIGV APVASIVSTV
TDLDPAAAGT QVAANAAVTL TAGVAQTGGV VSYSWSSSDG TVIPAGKESS VTFTTTGSKT
VTLSVTDEEG NKAFATLNLQ VVAVPAEMIS WNDAAGSLGG VMTVAGMPTP NDKVKIVWGD
GKYQYVTLAN AASISRAHVY LTAGNKLVQV YIYKAGVLVG TSKKSITVNG GN