Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0349 |
Symbol | |
ID | 8135656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 426202 |
End bp | 428118 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644867966 |
Product | type II secretion system protein E |
Protein accession | YP_003020188 |
Protein GI | 253698999 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.000000000000754733 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGAGTC ACGGCAACTA CCAAAAAGAG GCGAACCGCA AGGAAGGGGG GGACGACTCC GGCCTCGAGA TCGCCGCCCT GCTAATGAAG TCGGGTTACC TCGCGGACAC CCAGCTCAGT TACGCGCAGC GGGTCAAGTC GAAACTCCTC TCCCCGAGAA CCCTGGTCGG CGTACTGCTG GAGCTCGGCT TCTTCACCCG GGAGCAGCTC AGGGAGACGC TCCGGAGCAA CATGGTCTCG GTGAAGCTCG GGGCCCTGCT GGTGGAGCTT GGCTACCTGA AACCGGTAGA GTTGCAGGCC GCCCTCGGGA TACAGCGGGA CGGGGAAAAC TGCAAGATGC TGGGCGAGAT ACTGGTGGAA CAGCGCTTTA TCGAGGAGTA CACCCTGGCG GAAGTGCTCG CCTTCCAGCT GGGGTTCCCC TTCATAGACC TGGACGCGGC CAGCATCGAC CGGGCGCTTT TGGCCCGGGT CCCCCAGCAC TGGCTGGCGC AGCACAGCTT CGTTCCCGTC AAGGAGGAGG AGGGGAAGGT CCTGGTGGCG ATCGCGGACC CCCTGAACCT GGAGGGGAGA AAGACGGCGG AGAGGCTCTT CGGCCAGAGC ACCACCTCCT TCGCCATCTG CACCATGAAG GCGATCCGCG ACGCGCTCAG CCTGCTGAAG CGGGGGATGG TACAGGGGGA CGGCACGGCG ACCGACGAGC ACACGGTCAC CGGCATCGTG AACTCCATCT TCGAGGAGGC CCTCAAGGAG GGGGCCAGCG ACATACATAT AGAGCCGATG CGCAGCGCGC TCAGGGTACG CTTCAGGCGG GACGGGATGC TGGTCCTTTA CAAGGACTTC GCCAGGGAGC TGGCGCTTCC CGTAAGCAGC AGGATCAAGG TGATGGCCGA GGCGGACATC GCCGAGAGGA GGAGGCACCA GGACGGCCGG ATCTGCTACG AAAGCGCGAA GAGCGGCAAC ACCCTGGACA TGAGGGTCTC CTTCTACATC ACCATCCACG GCGAAAAGAT CGTGCTCCGC CTATTAAGCA TGAAGGGGGA GCTGCTCGAC CTCAAGGAGA TCGGCATGCC CGGGCGCATG CTGGAGCGCT TCATCGACGA CGCGCTGGAC ACCCCAAGCG GCGTCCTGAT CGTCACCGGG CCTACCGGCA GCGGCAAGAC CTCGACGCTC TACAGCTGCG TGCACCACCT AAACGACCTG AACACCTCCA TCGTCACCGC CGAGGAGCCG GTTGAATACG TGATAGAGGG GATCGCCCAG TGCTCCATCA ACCAGAAGAT CGGGGTGACC TTCGAGGAAA CCCTCAGGCA CATCGTGCGC CAGGACCCCG ACGTGATCGT CCTGGGGGAG ATCCGGGACA GCTTCTCCGC CGAGACCGCG ATCCAGGCGG CGCTCACCGG CCACAAGGTC CTCACCACCT TTCACACCGA GGACAGCGTA GGGGGGCTCT TGCGCCTGAT GAACATGAAC ATCGAGGCGT TCCTGATCGC CTCGACCGTG GTCTGCGTCC TGGCGCAGCG GCTTTTGCGC CGCATCTGCC CGGAGTGCAG CGAGAGCTAT CTCCCCACCC CGACCGAATT GAGGAGAATC GGCTACAGCA ACGCCGACCT CAGGGGGGCC GAGTTCAGGA TCGGCCGGGG ATGCGCCAAG TGCAGGTACA GCGGCTACCG CGGCAGGGTC GGCGTATTCG AGATGCTGAT ACTGAACGAA CTGGTGAAGG ACGCCATCCT CAGCAAGAAG ACCTCCTACG AGATCAGGAA GATCTCCACG GAGAGCACCG GAATGGTCAC GCTCCTGGAA TCGGGGCTCT GTAAGGCGAC CTCTGGGCTG GTATCGCTGC ACGACACGGT GCGGCTCCTC CCCAGGATCG GCAAACCGCG CCCGATCGCA GAGATCAGGC GCCTTTTGGG GGATTGA
|
Protein sequence | MESHGNYQKE ANRKEGGDDS GLEIAALLMK SGYLADTQLS YAQRVKSKLL SPRTLVGVLL ELGFFTREQL RETLRSNMVS VKLGALLVEL GYLKPVELQA ALGIQRDGEN CKMLGEILVE QRFIEEYTLA EVLAFQLGFP FIDLDAASID RALLARVPQH WLAQHSFVPV KEEEGKVLVA IADPLNLEGR KTAERLFGQS TTSFAICTMK AIRDALSLLK RGMVQGDGTA TDEHTVTGIV NSIFEEALKE GASDIHIEPM RSALRVRFRR DGMLVLYKDF ARELALPVSS RIKVMAEADI AERRRHQDGR ICYESAKSGN TLDMRVSFYI TIHGEKIVLR LLSMKGELLD LKEIGMPGRM LERFIDDALD TPSGVLIVTG PTGSGKTSTL YSCVHHLNDL NTSIVTAEEP VEYVIEGIAQ CSINQKIGVT FEETLRHIVR QDPDVIVLGE IRDSFSAETA IQAALTGHKV LTTFHTEDSV GGLLRLMNMN IEAFLIASTV VCVLAQRLLR RICPECSESY LPTPTELRRI GYSNADLRGA EFRIGRGCAK CRYSGYRGRV GVFEMLILNE LVKDAILSKK TSYEIRKIST ESTGMVTLLE SGLCKATSGL VSLHDTVRLL PRIGKPRPIA EIRRLLGD
|
| |