Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0609 |
Symbol | |
ID | 8135924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 739107 |
End bp | 741014 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644868226 |
Product | general secretion pathway protein D |
Protein accession | YP_003020441 |
Protein GI | 253699252 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1450] Type II secretory pathway, component PulD |
TIGRFAM ID | [TIGR02517] general secretion pathway protein D |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 9.73081e-17 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGATTTG CTCGCATCAC AGCCATGCTC ATGTTCCTGC TCGCCGCACC GACGCTGGTC TTCGCCAAGG GTGTGGTGCT TAACTTCACC GACGTGGATA TCGCCACCAT GGTGAAATTC GTCAGCGACC TGACCGGGAA GAACTTCATC ATGGACGACC GGGTGAAGGG AAAGATCTCG GTGTTCTCCC CGGCCAAACT CTCCAACGAC GAGGCGTACA ACGTCTTCAC CTCGGTCCTG GAACTCAAGG GGTTCACCGT GGTCCCGGCG GGAAAGGTGC TGAAGATCGT TCCCACGGCG AGCGCCAGGC AGTCGGGGAT GAAGGTCCTC TCCGAGGGTG AGCGGGGGGT AGTGAACGAC AGCTATCAGG CCCGCGTGAT CCAGCTGGAG CACGTGGCAC CTCAAGAAGC CGTCGCTTTC CTGCAGCCGC TTGTCTCCAG AGACGGCCAG ATCTCACCTT TCGGCGCGGC GAACATGATC CTCGTGGTCG ACTCCGCATT CAACATCCAG AAGGTATTGG GGATCCTCAA GCACATCGAC ACGGACCAGG TGCGCGAGGG GGCCGAACTG GTCTTTCTTA AGAACGCCGC CGCCGATAGC GTGGCGACGC TGGTTAAGGA CTGGATGGGC GGCAAGTCAT CTAAGCTGCC CGGCGCGGCA GCCACAAACG CAAGCTCCAC CGTCGTCGCC GACAACAGGC TGAACGCTCT GATCATCTTC GGCAGCGACA AGGACAAGGC CGACGTGAAG AAGTTGATCG CGCTGGTAGA CGTGGTCCCC CCCACCACCA GCAGCAAGGT CAACGTCTAC TACCTTGAAA ACGCCGAGGC CGCCGAGGTC GCCAAGGTGC TGGACGGCCT TTTGAAGGGT ACGGCGGCCA CGCCGGCGCC CGTAGCCGGC GCTGCCGCGA CGGCTCCGCA ACAGGCCATC TTCGAGGGGG GGAAGATCAC CATCACCCCG GACAAGTCGA CCAACTCGCT GGTCATCATG GCCTCCCCCA CCGATTACCA GAACCTCTTG CAGGTGATCC AGAAGCTTGA CCGCCGCAGC CGCCAGGTCT TCGTGCAGGC GATGATCGCC GAGGTCTCCG CCAACAAGGC GAAGGAACTG GGCGTGCAGT GGGGCGTGAT CGCCGGAGCC TCCAACGGCA CGCTCTCGAC GGTCGGCACC TTCGATCCCT TCGGCGCCGT GGCCGGCCTG AGCGGCGCCT TGCAACTCGC CGACACATTG GGAATCACAC CACCGGACGG CGGAGTGGCC CTGTTCCCCG CAACGCTCAA GGCGCTGCAC AGTAACGGTG CGCTGAACGT CCTGTCCACC CCGAACATCA TGACCAGCGA CAACAAGGAA GCCGAGATCT TCGTGGGGGA GAACGTCCCC TTCCTCTCCG GTACCAACCT CACCTCCACG GGGCTCTCCC AGCAGTCGAT CGAAAGGAAA GACACCGGTA TCATCCTGAA GATCAAGCCT CAGATCAGCG AGGGCGAATA CATAAAGCTC GACATCTACC AGGAGATCTC GGCGGTGAAG GACTTCGGCA CCGCCACGAA CCCGAACCTC GGTAGCACCA AGCGCTCGGC CAAGACCTCG GTGGTGGTGA AGAACACCGA CACGGTCATC ATCGGCGGAC TGATTCAGGA CACCGACCAG GTGACGGAGA GCAAGATCCC GCTTCTGGGC GACATCCCGC TCCTGGGGTG GCTTTTCAAG ACCAAGCGGA CAACGCGGGA CAAGACCAAC CTCCTGATCA TGCTCACCCC GCGCATCATC AAGGACGCGC GCGACATGGC CGAGGTTTCC ATCAACCAGC GAAACAGCTT CAGCGACGCG GTGAAGACCA GCGAGCCGAT CAACATGGAG CAGGCTCTCA AGGAAAAGCC AAAGTCGGTG ACCGAGGACA AGCCCTAA
|
Protein sequence | MGFARITAML MFLLAAPTLV FAKGVVLNFT DVDIATMVKF VSDLTGKNFI MDDRVKGKIS VFSPAKLSND EAYNVFTSVL ELKGFTVVPA GKVLKIVPTA SARQSGMKVL SEGERGVVND SYQARVIQLE HVAPQEAVAF LQPLVSRDGQ ISPFGAANMI LVVDSAFNIQ KVLGILKHID TDQVREGAEL VFLKNAAADS VATLVKDWMG GKSSKLPGAA ATNASSTVVA DNRLNALIIF GSDKDKADVK KLIALVDVVP PTTSSKVNVY YLENAEAAEV AKVLDGLLKG TAATPAPVAG AAATAPQQAI FEGGKITITP DKSTNSLVIM ASPTDYQNLL QVIQKLDRRS RQVFVQAMIA EVSANKAKEL GVQWGVIAGA SNGTLSTVGT FDPFGAVAGL SGALQLADTL GITPPDGGVA LFPATLKALH SNGALNVLST PNIMTSDNKE AEIFVGENVP FLSGTNLTST GLSQQSIERK DTGIILKIKP QISEGEYIKL DIYQEISAVK DFGTATNPNL GSTKRSAKTS VVVKNTDTVI IGGLIQDTDQ VTESKIPLLG DIPLLGWLFK TKRTTRDKTN LLIMLTPRII KDARDMAEVS INQRNSFSDA VKTSEPINME QALKEKPKSV TEDKP
|
| |