Gene GM21_0609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0609 
Symbol 
ID8135924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp739107 
End bp741014 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content61% 
IMG OID644868226 
Productgeneral secretion pathway protein D 
Protein accessionYP_003020441 
Protein GI253699252 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1450] Type II secretory pathway, component PulD 
TIGRFAM ID[TIGR02517] general secretion pathway protein D 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value9.73081e-17 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGATTTG CTCGCATCAC AGCCATGCTC ATGTTCCTGC TCGCCGCACC GACGCTGGTC 
TTCGCCAAGG GTGTGGTGCT TAACTTCACC GACGTGGATA TCGCCACCAT GGTGAAATTC
GTCAGCGACC TGACCGGGAA GAACTTCATC ATGGACGACC GGGTGAAGGG AAAGATCTCG
GTGTTCTCCC CGGCCAAACT CTCCAACGAC GAGGCGTACA ACGTCTTCAC CTCGGTCCTG
GAACTCAAGG GGTTCACCGT GGTCCCGGCG GGAAAGGTGC TGAAGATCGT TCCCACGGCG
AGCGCCAGGC AGTCGGGGAT GAAGGTCCTC TCCGAGGGTG AGCGGGGGGT AGTGAACGAC
AGCTATCAGG CCCGCGTGAT CCAGCTGGAG CACGTGGCAC CTCAAGAAGC CGTCGCTTTC
CTGCAGCCGC TTGTCTCCAG AGACGGCCAG ATCTCACCTT TCGGCGCGGC GAACATGATC
CTCGTGGTCG ACTCCGCATT CAACATCCAG AAGGTATTGG GGATCCTCAA GCACATCGAC
ACGGACCAGG TGCGCGAGGG GGCCGAACTG GTCTTTCTTA AGAACGCCGC CGCCGATAGC
GTGGCGACGC TGGTTAAGGA CTGGATGGGC GGCAAGTCAT CTAAGCTGCC CGGCGCGGCA
GCCACAAACG CAAGCTCCAC CGTCGTCGCC GACAACAGGC TGAACGCTCT GATCATCTTC
GGCAGCGACA AGGACAAGGC CGACGTGAAG AAGTTGATCG CGCTGGTAGA CGTGGTCCCC
CCCACCACCA GCAGCAAGGT CAACGTCTAC TACCTTGAAA ACGCCGAGGC CGCCGAGGTC
GCCAAGGTGC TGGACGGCCT TTTGAAGGGT ACGGCGGCCA CGCCGGCGCC CGTAGCCGGC
GCTGCCGCGA CGGCTCCGCA ACAGGCCATC TTCGAGGGGG GGAAGATCAC CATCACCCCG
GACAAGTCGA CCAACTCGCT GGTCATCATG GCCTCCCCCA CCGATTACCA GAACCTCTTG
CAGGTGATCC AGAAGCTTGA CCGCCGCAGC CGCCAGGTCT TCGTGCAGGC GATGATCGCC
GAGGTCTCCG CCAACAAGGC GAAGGAACTG GGCGTGCAGT GGGGCGTGAT CGCCGGAGCC
TCCAACGGCA CGCTCTCGAC GGTCGGCACC TTCGATCCCT TCGGCGCCGT GGCCGGCCTG
AGCGGCGCCT TGCAACTCGC CGACACATTG GGAATCACAC CACCGGACGG CGGAGTGGCC
CTGTTCCCCG CAACGCTCAA GGCGCTGCAC AGTAACGGTG CGCTGAACGT CCTGTCCACC
CCGAACATCA TGACCAGCGA CAACAAGGAA GCCGAGATCT TCGTGGGGGA GAACGTCCCC
TTCCTCTCCG GTACCAACCT CACCTCCACG GGGCTCTCCC AGCAGTCGAT CGAAAGGAAA
GACACCGGTA TCATCCTGAA GATCAAGCCT CAGATCAGCG AGGGCGAATA CATAAAGCTC
GACATCTACC AGGAGATCTC GGCGGTGAAG GACTTCGGCA CCGCCACGAA CCCGAACCTC
GGTAGCACCA AGCGCTCGGC CAAGACCTCG GTGGTGGTGA AGAACACCGA CACGGTCATC
ATCGGCGGAC TGATTCAGGA CACCGACCAG GTGACGGAGA GCAAGATCCC GCTTCTGGGC
GACATCCCGC TCCTGGGGTG GCTTTTCAAG ACCAAGCGGA CAACGCGGGA CAAGACCAAC
CTCCTGATCA TGCTCACCCC GCGCATCATC AAGGACGCGC GCGACATGGC CGAGGTTTCC
ATCAACCAGC GAAACAGCTT CAGCGACGCG GTGAAGACCA GCGAGCCGAT CAACATGGAG
CAGGCTCTCA AGGAAAAGCC AAAGTCGGTG ACCGAGGACA AGCCCTAA
 
Protein sequence
MGFARITAML MFLLAAPTLV FAKGVVLNFT DVDIATMVKF VSDLTGKNFI MDDRVKGKIS 
VFSPAKLSND EAYNVFTSVL ELKGFTVVPA GKVLKIVPTA SARQSGMKVL SEGERGVVND
SYQARVIQLE HVAPQEAVAF LQPLVSRDGQ ISPFGAANMI LVVDSAFNIQ KVLGILKHID
TDQVREGAEL VFLKNAAADS VATLVKDWMG GKSSKLPGAA ATNASSTVVA DNRLNALIIF
GSDKDKADVK KLIALVDVVP PTTSSKVNVY YLENAEAAEV AKVLDGLLKG TAATPAPVAG
AAATAPQQAI FEGGKITITP DKSTNSLVIM ASPTDYQNLL QVIQKLDRRS RQVFVQAMIA
EVSANKAKEL GVQWGVIAGA SNGTLSTVGT FDPFGAVAGL SGALQLADTL GITPPDGGVA
LFPATLKALH SNGALNVLST PNIMTSDNKE AEIFVGENVP FLSGTNLTST GLSQQSIERK
DTGIILKIKP QISEGEYIKL DIYQEISAVK DFGTATNPNL GSTKRSAKTS VVVKNTDTVI
IGGLIQDTDQ VTESKIPLLG DIPLLGWLFK TKRTTRDKTN LLIMLTPRII KDARDMAEVS
INQRNSFSDA VKTSEPINME QALKEKPKSV TEDKP