Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2999 |
Symbol | |
ID | 8138342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3484966 |
End bp | 3487578 |
Gene Length | 2613 bp |
Protein Length | 870 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644870597 |
Product | type II and III secretion system protein |
Protein accession | YP_003022786 |
Protein GI | 253701597 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1450] Type II secretory pathway, component PulD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.0000000000114171 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCATAGAC CAAGACCGAT CTTAACCCTG ATGCTGGTGG CACTGGCTCT TTCGGGCTGC ACGTCCGGCC GCACCGCGTT CAGCAAGGCG GAGAAGCTGG AGCGAGAGGG GAACCTGGAC GCCGCGCTGG TGAAATATGC CGAGGTGTCC GCTGCCAACC CCGATATCGG CGAGTACCGG GTGAAGCTTT TGAACATCAC CGAGACGGCG GCTCGCGTGC ATTTCAAGAA GGGTGAGGAG TTCTTCGCCA AGAATAACTA CGACGAAGCG CTCAGGGAGT TTCAAAGCGC CTACGCCATG GACCCCACCA ACGTCCTGGC CAAGAACCAG GCCGACCATG TGCTGAAGCT GAGGAACGCC CAGACCTACC TGCTGGAGGG GCTCGACTTC GAGAAAAACC GCAAACCGCG CGAGGCGATG ATCGCCTTCA AGCACGCCCT CGAATTCCAC CCGAGCAACA AGGAGGCGAA GGAAGGGCTG GATCGCATCA TCGCCAACAA GCGCCAGAAG CTGGACGGTT TCGAGCTGAA CCTGAAGTCG AACAAGCCGA TCACCCTGAA GTTCCGCGAT GCGAAGCTCA AGGAGATCTT CACCATCCTT TCCCAGCTCT CAGGCATCAA CTTCGTCTTC GACGAGGCGG TGAAGGACGT GAACGTCACG CTGCACTTGG AGAACGGCTC GTTCCAGCAG GCGATGGAAC TGATCACCGG GATGCATAAG CTGGACAAGA AGATCCTGAA CGAGAGCACC ATCATCATCT ATCCGAAGAC CCCGGACAAG GTCAAGCAGT ACGAAGAACT CTTTCTGCAG ACCTTCTACC TGAACAAGCT GGACGCGAAG AAGGCGGTCA ACCTGGTGCG CACCATGCTC CAGGTGAAGA AGATCTATGT GAACGAGGAG GCGAACGCAC TGGTCCTCCG CGACAAACCT GAAGTCATCG AGGTGGCGCG CAAGATTCTG GAAGCCAACG ACGTTCCCGA CGCCGAGGTG CTCCTCGAGG TGGAGGTGTT CGAACTCTCC AAGCAGAACG CCGAGACCTT CGGACTGCTC CTCTCCAGGT ATGCCACCTC CATGGGGGTG ACCGGTCCGG GAAGCACCTC CGGCGGCAGC CCGTTCCTGG CTGACACGCT GGGAGCAGTC ACCACCACCA CGACCGGGAC AGGTACGGCT ACCACCTCTG CTCAGCCGTC CAACCTGTTG AACGTGTTCA ACCTGCGGGG GTATAACGGC TACCTGACCG TTCCGAACGC CACCTTCAAC TTCGGCAAGA CCCTCTCCAA CGGCGAGACC CTCTCCAACC CGAAGATCAG GGTGAAGAAC CGGGAGAAGG CGAAGTTCAA CGTCGGCACC AGGGTCCCGA TCACCACAAC CTCTTCGCCT TCCGGCGGCG GCGTCAGCGT CAACGTCCAG TACGTCGACG TCGGGGTCAA GGTGAACGCG GAGCCGACCA TCCAGCTCAA CAACGAGGTC GCCATCAAGC TGGGTCTGGA GGTCAGCTCC ATCCTCAACG AGAAGACCAT CGGCACCGAC CAGGCCACCA CCGTGGTCAC CATCGGCACC AGGAACCTCG ACACGGTGCT GTCTCTCAAA GACGGCGAGA CCAGCATCAT CGGGGGGCTC ATTCAGAAGA CCCAGACCGA CAGCAAGAAC AAGGTCTTCC TTTTGGGCGA CATCCCGATC ATCGGACCTC TGTTCAGCAA CACAAGCGAC AAGAAGGACA AGACCGAACT GCTGCTCGCC ATCACCCCCA GGATCGTGCG CGGCGTGACC GTTCCCGACA ACGACGTCGC CGCCTTCTGG TCCGGCCGCG AGGACGAGCC CTCGTCGCAC CAGCCGTATT CCTCCTTCAT GGAGCCCGAT TTCGTCAACC CCGAGGCTGC CGCCGAGGGT GCCCCGGCTC CTGCTGCAGC CAAGCCCGCT CCCAGGGTGC TGCCTAACCT GGTGCCGGTC CAGAAGAGCG TGCCGGCTCC TGTTCCTGCC CCTGCTCCTG CCGCCGCCCC TGCTCCTGCT CCTGCCGGTG CTCCGGCTGG TGCCGCTCCC GCTCCAGAAG GAGCTGCCGC GGCCGTTCCG GCACCTGCCC CGGGAGCCGT TCCGGCCGCG GGGGATGTAC CGGTACCTGT GCCGGTGCCC GCGCCTGTAC CTGTGCCTGT ACCTGCTCCG GTTCCCGCTC CCGCCGCACC CGCGACCGCA AAGCAGCCGC TTCTGACCGA TTCGCTGATC GCGCTGAGCC TTCCGGCCAA GGTGAAGCTG AACGACCAGT TCAACGTGCA GGTGAACGGG TCCGGGATGG GGGATATGTA CAAGGCGGTC TTCGTGCTCT CTTACGATCC CAAGCTCCTG GACGCCGTCT CGCAGTCCGA GGGGAACCTC TTGAAGCAAC CCGGAAAGCC TTCCGCCTTC CAGGCCTTCG CGGACAAGAA AAAGGGTGAG ATCTGGATGT CCGGCATGCG CGAGGAGCCG ACCGGTACGG CCAACGGCAT CCTGGCCAAT GTCAGCTTCA AGGCGATAGG GCCGGGATCG GCCGCCGTTT CCGTGATCAA CACCAACTTC AGCAAGAAGA CGGGCGAGCA GATTCCGGTC ACCGCCTTCA AATCCGTAGT GGAGGTACAG TAG
|
Protein sequence | MHRPRPILTL MLVALALSGC TSGRTAFSKA EKLEREGNLD AALVKYAEVS AANPDIGEYR VKLLNITETA ARVHFKKGEE FFAKNNYDEA LREFQSAYAM DPTNVLAKNQ ADHVLKLRNA QTYLLEGLDF EKNRKPREAM IAFKHALEFH PSNKEAKEGL DRIIANKRQK LDGFELNLKS NKPITLKFRD AKLKEIFTIL SQLSGINFVF DEAVKDVNVT LHLENGSFQQ AMELITGMHK LDKKILNEST IIIYPKTPDK VKQYEELFLQ TFYLNKLDAK KAVNLVRTML QVKKIYVNEE ANALVLRDKP EVIEVARKIL EANDVPDAEV LLEVEVFELS KQNAETFGLL LSRYATSMGV TGPGSTSGGS PFLADTLGAV TTTTTGTGTA TTSAQPSNLL NVFNLRGYNG YLTVPNATFN FGKTLSNGET LSNPKIRVKN REKAKFNVGT RVPITTTSSP SGGGVSVNVQ YVDVGVKVNA EPTIQLNNEV AIKLGLEVSS ILNEKTIGTD QATTVVTIGT RNLDTVLSLK DGETSIIGGL IQKTQTDSKN KVFLLGDIPI IGPLFSNTSD KKDKTELLLA ITPRIVRGVT VPDNDVAAFW SGREDEPSSH QPYSSFMEPD FVNPEAAAEG APAPAAAKPA PRVLPNLVPV QKSVPAPVPA PAPAAAPAPA PAGAPAGAAP APEGAAAAVP APAPGAVPAA GDVPVPVPVP APVPVPVPAP VPAPAAPATA KQPLLTDSLI ALSLPAKVKL NDQFNVQVNG SGMGDMYKAV FVLSYDPKLL DAVSQSEGNL LKQPGKPSAF QAFADKKKGE IWMSGMREEP TGTANGILAN VSFKAIGPGS AAVSVINTNF SKKTGEQIPV TAFKSVVEVQ
|
| |