Gene GM21_0608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0608 
Symbol 
ID8135923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp737478 
End bp739040 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content64% 
IMG OID644868225 
Productgeneral secretory pathway protein E 
Protein accessionYP_003020440 
Protein GI253699251 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0000000000000611826 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAGAAA GCTTAGACAT AGAAGAGATA GCGACACGGC TCGGCCTTCC CTACCAGGCC 
GAGATCGACG ACTCCAAGGT AGACGGGGCG CTGGTGAACC GGGTCCCGCT CAACTTCGCC
CGCAACAACC TCCTGCTCCC GCTGCACGAG GAGGACGGCT CCATCGTCGC CGCCAGCGCC
GACCCGACGA ACCTCCTGGC GCTGGACGAG ATGGCCGGAC TCTTCTCGCA ACCGGTCTCC
GTGGTCGCGG TCCCGCGCCC GGCTCTCCTG GACGCGGTGA ACCGCCTCTA CTCCCGCCTC
TCCGGTTCCG CCCAGGAGGT GGTCGAGGAA CTCGAAGGGG AAGAGCTCTC AACACTCGCC
ACCACCTTCA ACGAGCCGCG CGACCTTATG GAGCTGACCG ACGAGGCGCC GGTGATCCGG
CTCTTGAACT CCATCCTGTT TCAGGCCGTG AAGGAGCGCG CGAGCGACAT CCACATCGAG
CCGTTCGAGC GCGAGCTCGA GGTCCGCTTC CGGATCGACG GCCTTTTGTA CAAGATGCTC
TCCCCCCCCA AGGTGATCCA GGAGGCGCTC ACCTCCCGCG TCAAGATCAT GTCCGGGCTC
AACATCGCCG AGAAGAGGCT GCCGCAGGAC GGGCGCATCA GGGTCAAGGT GGCCGGACGC
GACGTCGACA TCCGCGTGTC GCTCATCCCC ACCTTCTTCG GGGAGCGCGT GGTATTGAGG
CTTTTGGACA AGCAGCGCGG CGTCCTCTCC CTCAAAGAGA TCGGGCTCTC CGACGGCAAC
GACCGGCTGA TGGACCGGCT TTTGTCGAGG ACGAGCGGCA TCATCCTGGT CACGGGACCT
ACCGGCAGCG GCAAGACCAC CACGCTCTAC GCGGCACTTT CCCAGATCAA CTCGCCGGAG
AAGAACATCA TCACCGTCGA GGACCCGATC GAGTACCAGT TGAAGGGTGT GGGGCAGATA
CAGGTCAACC CGAAGATCGA CCTCACCTTC GCCGCGGGGC TTCGCTCCAT CCTGAGGCAG
GACCCGGACA TCGTCATGAT CGGTGAGATC CGCGACGCCG AGACCGCCGA GATCGCCATG
CAGGCCTCGC TCACCGGTCA CCTGGTCCTC TCGACGCTGC ACACAAACGA CGCCGCCACC
GCGGTCACCC GCCTGATCGA CATGGGGATC GAGCCGTTCA TGGTCGCCTC TTCGCTCTCC
GCGGTACTAG CACAGCGGCT GGTCCGCGTC ATCTGCCCGC ACTGCAAGGA ATCCTACGTC
CCGGACCGCA GCTACCCCGG CGTCGAACTC CCGCCGCTTC TGTATCGGGG CCGCGGCTGC
GAGAAGTGCT TCAACCTGGG GACCATCGGG CGGGTCGGCA TCTACGAGCT TCTTCCCATC
GACGCAGAAC TCTGCTCCAT GATCATCCGA CAGGCCTCCT CCGGCGCCAT CAAGGAGTAC
GCCGTCTCCA AAGGGATGCG CACCCTGCGC GAGGACGGCC TCGCCAAGGC GGCCCAGGGG
ATCACCACCA TCGAGGAGGT CCTGAGGGTA ACCCAGGACG ATTATGCCGA CCTTTCGCTA
TAG
 
Protein sequence
MAESLDIEEI ATRLGLPYQA EIDDSKVDGA LVNRVPLNFA RNNLLLPLHE EDGSIVAASA 
DPTNLLALDE MAGLFSQPVS VVAVPRPALL DAVNRLYSRL SGSAQEVVEE LEGEELSTLA
TTFNEPRDLM ELTDEAPVIR LLNSILFQAV KERASDIHIE PFERELEVRF RIDGLLYKML
SPPKVIQEAL TSRVKIMSGL NIAEKRLPQD GRIRVKVAGR DVDIRVSLIP TFFGERVVLR
LLDKQRGVLS LKEIGLSDGN DRLMDRLLSR TSGIILVTGP TGSGKTTTLY AALSQINSPE
KNIITVEDPI EYQLKGVGQI QVNPKIDLTF AAGLRSILRQ DPDIVMIGEI RDAETAEIAM
QASLTGHLVL STLHTNDAAT AVTRLIDMGI EPFMVASSLS AVLAQRLVRV ICPHCKESYV
PDRSYPGVEL PPLLYRGRGC EKCFNLGTIG RVGIYELLPI DAELCSMIIR QASSGAIKEY
AVSKGMRTLR EDGLAKAAQG ITTIEEVLRV TQDDYADLSL