Gene GM21_0349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0349 
Symbol 
ID8135656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp426202 
End bp428118 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content63% 
IMG OID644867966 
Producttype II secretion system protein E 
Protein accessionYP_003020188 
Protein GI253698999 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.000000000000754733 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGAGTC ACGGCAACTA CCAAAAAGAG GCGAACCGCA AGGAAGGGGG GGACGACTCC 
GGCCTCGAGA TCGCCGCCCT GCTAATGAAG TCGGGTTACC TCGCGGACAC CCAGCTCAGT
TACGCGCAGC GGGTCAAGTC GAAACTCCTC TCCCCGAGAA CCCTGGTCGG CGTACTGCTG
GAGCTCGGCT TCTTCACCCG GGAGCAGCTC AGGGAGACGC TCCGGAGCAA CATGGTCTCG
GTGAAGCTCG GGGCCCTGCT GGTGGAGCTT GGCTACCTGA AACCGGTAGA GTTGCAGGCC
GCCCTCGGGA TACAGCGGGA CGGGGAAAAC TGCAAGATGC TGGGCGAGAT ACTGGTGGAA
CAGCGCTTTA TCGAGGAGTA CACCCTGGCG GAAGTGCTCG CCTTCCAGCT GGGGTTCCCC
TTCATAGACC TGGACGCGGC CAGCATCGAC CGGGCGCTTT TGGCCCGGGT CCCCCAGCAC
TGGCTGGCGC AGCACAGCTT CGTTCCCGTC AAGGAGGAGG AGGGGAAGGT CCTGGTGGCG
ATCGCGGACC CCCTGAACCT GGAGGGGAGA AAGACGGCGG AGAGGCTCTT CGGCCAGAGC
ACCACCTCCT TCGCCATCTG CACCATGAAG GCGATCCGCG ACGCGCTCAG CCTGCTGAAG
CGGGGGATGG TACAGGGGGA CGGCACGGCG ACCGACGAGC ACACGGTCAC CGGCATCGTG
AACTCCATCT TCGAGGAGGC CCTCAAGGAG GGGGCCAGCG ACATACATAT AGAGCCGATG
CGCAGCGCGC TCAGGGTACG CTTCAGGCGG GACGGGATGC TGGTCCTTTA CAAGGACTTC
GCCAGGGAGC TGGCGCTTCC CGTAAGCAGC AGGATCAAGG TGATGGCCGA GGCGGACATC
GCCGAGAGGA GGAGGCACCA GGACGGCCGG ATCTGCTACG AAAGCGCGAA GAGCGGCAAC
ACCCTGGACA TGAGGGTCTC CTTCTACATC ACCATCCACG GCGAAAAGAT CGTGCTCCGC
CTATTAAGCA TGAAGGGGGA GCTGCTCGAC CTCAAGGAGA TCGGCATGCC CGGGCGCATG
CTGGAGCGCT TCATCGACGA CGCGCTGGAC ACCCCAAGCG GCGTCCTGAT CGTCACCGGG
CCTACCGGCA GCGGCAAGAC CTCGACGCTC TACAGCTGCG TGCACCACCT AAACGACCTG
AACACCTCCA TCGTCACCGC CGAGGAGCCG GTTGAATACG TGATAGAGGG GATCGCCCAG
TGCTCCATCA ACCAGAAGAT CGGGGTGACC TTCGAGGAAA CCCTCAGGCA CATCGTGCGC
CAGGACCCCG ACGTGATCGT CCTGGGGGAG ATCCGGGACA GCTTCTCCGC CGAGACCGCG
ATCCAGGCGG CGCTCACCGG CCACAAGGTC CTCACCACCT TTCACACCGA GGACAGCGTA
GGGGGGCTCT TGCGCCTGAT GAACATGAAC ATCGAGGCGT TCCTGATCGC CTCGACCGTG
GTCTGCGTCC TGGCGCAGCG GCTTTTGCGC CGCATCTGCC CGGAGTGCAG CGAGAGCTAT
CTCCCCACCC CGACCGAATT GAGGAGAATC GGCTACAGCA ACGCCGACCT CAGGGGGGCC
GAGTTCAGGA TCGGCCGGGG ATGCGCCAAG TGCAGGTACA GCGGCTACCG CGGCAGGGTC
GGCGTATTCG AGATGCTGAT ACTGAACGAA CTGGTGAAGG ACGCCATCCT CAGCAAGAAG
ACCTCCTACG AGATCAGGAA GATCTCCACG GAGAGCACCG GAATGGTCAC GCTCCTGGAA
TCGGGGCTCT GTAAGGCGAC CTCTGGGCTG GTATCGCTGC ACGACACGGT GCGGCTCCTC
CCCAGGATCG GCAAACCGCG CCCGATCGCA GAGATCAGGC GCCTTTTGGG GGATTGA
 
Protein sequence
MESHGNYQKE ANRKEGGDDS GLEIAALLMK SGYLADTQLS YAQRVKSKLL SPRTLVGVLL 
ELGFFTREQL RETLRSNMVS VKLGALLVEL GYLKPVELQA ALGIQRDGEN CKMLGEILVE
QRFIEEYTLA EVLAFQLGFP FIDLDAASID RALLARVPQH WLAQHSFVPV KEEEGKVLVA
IADPLNLEGR KTAERLFGQS TTSFAICTMK AIRDALSLLK RGMVQGDGTA TDEHTVTGIV
NSIFEEALKE GASDIHIEPM RSALRVRFRR DGMLVLYKDF ARELALPVSS RIKVMAEADI
AERRRHQDGR ICYESAKSGN TLDMRVSFYI TIHGEKIVLR LLSMKGELLD LKEIGMPGRM
LERFIDDALD TPSGVLIVTG PTGSGKTSTL YSCVHHLNDL NTSIVTAEEP VEYVIEGIAQ
CSINQKIGVT FEETLRHIVR QDPDVIVLGE IRDSFSAETA IQAALTGHKV LTTFHTEDSV
GGLLRLMNMN IEAFLIASTV VCVLAQRLLR RICPECSESY LPTPTELRRI GYSNADLRGA
EFRIGRGCAK CRYSGYRGRV GVFEMLILNE LVKDAILSKK TSYEIRKIST ESTGMVTLLE
SGLCKATSGL VSLHDTVRLL PRIGKPRPIA EIRRLLGD