Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0066 |
Symbol | |
ID | 8135365 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 83210 |
End bp | 84868 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644867683 |
Product | General secretory system II protein E domain protein |
Protein accession | YP_003019911 |
Protein GI | 253698722 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 7.364280000000001e-24 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACGGGC TTGTCAAGGA AGGATCCATC GGCGAGATCC TTTTCAAATC GCAGATCATC ACGGAGCACG AACTGAGGGC GGCGCTCGAA GCGCAGAAGG TCTCGGGATG CCGGGTGGGC GAGGCGCTGG TCCGCCTGGG GGTGGTCACC CAGGAGGATA TCGACTGGGC GCTCGCCAAC CAGCTGAACA TCCCCTACGT GCGGCTCAAG AAGGAAAACA TCGATCCCGC CGCGGTGGCG AAGGTCCCGG GACAACTGGC CCGACGCTAC AGCCTCTGCC CCATCTTTCT CTCCGGCAAC GAACTCTCCG TCGCCATGGC GGACCCCCTG AACAAGGAGG CGGTCGAGGA GATCACCCGG GCGACAGGCT GCCAGATCAG CATCTCGGTG GGGCTCATCC GCGAGATCCG CGAGATGCAC GACGCCATGT ACGGCCCGGA CCAGAACCTC CCGGAACTGG GGTTCAGCTC GGGGCACTTC CCCGCCAAGG TCCTATCCGC CATCAACGCC GACCTCTCCG GCGCCATGCT CCTCAACCAC CTCCTCTTGC GCGCAGTACA GCAGAAGTTC GTCTCGGTAG CGCTGCAGCC GCTGGGGGAC CAGGTGCGGG TGCTGGCGCG CGGCGAAGGG AGGACGGCTG AGTTCGGCAA GCTCTCCGCG ACCCACTACG GACGGCTCAC CGAGCGTATC CGCCGCCTCT CAGGCATCGA CGGCGCCGAG GAGAACCCCT CCAGCGGCGT TTTGACCTTC ATCTGGCAGG GGAAAAGGAT CCCGTTCCAG ACCCTGGCGA TGCCCGGCAA CGGGGGGGAT TACCTCACCT TGAAGCTGCA CGTCGGCGCG CCGAAAATCT CCGAGCTGGA CGACCTCGGC GTCTCCGCCG CGAAACGCGC GGACCTGAAG GCGCTCGCTT CCGAAAAGGA GGGGCTGATC CTCTTCACCG GGCGCGACCC GGAGGAGCGG AGCCGGCTCA TGGACCTGTT CCTGGATGCC TGCGATCACG CCGACCGCAC CGTCATTTTG GTGGGGGAAA GGCTTGGGCG CGGCAGGGAC CGGTGGCCGC GGCTTCCGGC AGGGAGATGC GGCGCGGACG ATACCGCGAA GGTGGTGTCG GCGGCCTTGG AGCATGACCC GGACACGCTG GTCATCGAGG ACGTCACCGA ACTGGCCTCC TTCATAGCGG CGAGCAAGGC GGTGATGCGG GGGAAGCTCG TGGTGGCGGG GATGTCCCAG GGGAACAAGG GGGCGGTGTT GAAGCAGCTT TTGTACCTCT CCCAGAAGAA CTTCCTGATA CCGACCCACC TGAAGGGGGT GGTTTCCTGC AAGAGCGTGC TCCTCCTTTG CCCGGACTGC AAGAAGCGTT TCGCGCCTGC CGCCGACGAG CTGGCGGCCT TGCGGCTTAG GGCGACGGCG CCCGAGTATT TCCGCCCGAC CGGCTGCCCC TCCTGCGACC AGACAGGCTA TAGCGGCAAG AAATACCTCC TGGACGTGAT CCGATTCGAT CAGGGGCTCC TGGAGGCGTT CGAGGTGATC CGCGATTCCG ACGAGATCAT CCGCCACCTC AAAGACAACG GCTACCGCGG CATCGGCGAG GAAGGGGCCG AGCTGCTGGA GCGGGGAGAA ATATCCCCGG GCGAGTACGT CGCTTCCATA CTACTGTAA
|
Protein sequence | MNGLVKEGSI GEILFKSQII TEHELRAALE AQKVSGCRVG EALVRLGVVT QEDIDWALAN QLNIPYVRLK KENIDPAAVA KVPGQLARRY SLCPIFLSGN ELSVAMADPL NKEAVEEITR ATGCQISISV GLIREIREMH DAMYGPDQNL PELGFSSGHF PAKVLSAINA DLSGAMLLNH LLLRAVQQKF VSVALQPLGD QVRVLARGEG RTAEFGKLSA THYGRLTERI RRLSGIDGAE ENPSSGVLTF IWQGKRIPFQ TLAMPGNGGD YLTLKLHVGA PKISELDDLG VSAAKRADLK ALASEKEGLI LFTGRDPEER SRLMDLFLDA CDHADRTVIL VGERLGRGRD RWPRLPAGRC GADDTAKVVS AALEHDPDTL VIEDVTELAS FIAASKAVMR GKLVVAGMSQ GNKGAVLKQL LYLSQKNFLI PTHLKGVVSC KSVLLLCPDC KKRFAPAADE LAALRLRATA PEYFRPTGCP SCDQTGYSGK KYLLDVIRFD QGLLEAFEVI RDSDEIIRHL KDNGYRGIGE EGAELLERGE ISPGEYVASI LL
|
| |