Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2606 |
Symbol | |
ID | 8137948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3039092 |
End bp | 3041902 |
Gene Length | 2811 bp |
Protein Length | 936 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644870212 |
Product | polysaccharide export protein |
Protein accession | YP_003022402 |
Protein GI | 253701213 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1596] Periplasmic protein involved in polysaccharide export |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 126 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAGAA TCCCGCTGTT GGCTGTTGCA TTATACTTGT TGACTGCTGT CACCTCATAT TCCGCAAGTC TGTATGACGG CTCCTCCTCG ACGTCCACTT CTTACGCCGG CACCTCCGCC ACTACCACGC AAGTGACAGC TCCGACCTCC TCCAGTTACT TCACCATCCC CGCACCGACT GGGCAGGGTT CAGGTTCGTC ATACTACCAA TATCCCAATC AGCCGCAGAC CGCCGGGCAG ACGCAACCTC TGGGGGCGCC GCAACAGACC CAGGTGCTTC AGCAAGGGCA GGTGCCTATG CAGGGCCAAG TGCTGATGCA AGGTCAGGTG CCGATGCAGG GTCAGGTGCC GATGCAGGGT CAGGTGCCGA TGCAGGGGCA GGTACCGCAG CAGGGACAAG TGCCGCAGCA GGGACAGGCC GAGGCGCAAC GGTTCGGTCA GCAGGGGCAA AACAGCAATC AAACGGCGTC AACTCCATAC CAGCAGTCCT CTCAAGTGCC TCAACCGGGT CAGGAGTGGG GTAAACAGCC GTTGCCCCAA GCGTGGCAGC AGTGGGTCCC TCCGGCAGAG CCTTCAGCGC TGGAAAAGGC CATGGGAGCA GATCCTGTAA CCGCTGAGAG CTCCCTCTCC CAAGCCTACG GAGTGGGACC TCTCACCCAG TTCGGTTACA GCTTCTTTCA TCCGGAAGTG TCCGACCCCT TATCAACGCA GATGGACATC CCCGTAGGGC CCGATTACAT ACTCGGCCCG GGCGATAGGC TGCTGGTGAC CCTTTGGGGC AGCATCGACG GGACCTGGGA GCTCGAAGTC AGCCGTAGCG GCGAAATCGT CCTTCCCAAA GTCGGGTCCG TCAAGGTCGC AGGCCAGAGC TTCGGCCAGG TGCCGTCGCT GCTCAAGAAC ACCCTGGCGC GCATCTTCAA GGACTTCCAG CTCAACGTCA ACATGGGCAG GCTCCGGGTG ATCAAGGTGT ACGTGGTGGG AGAGGTGACC TCTCCCGGTG ACTACAACAT AAGCTCCCTT TCCACTTTGG TTAACGCCCT TGCCGCAGCA GGCGGCCCCA CCAAAAGCGG CAGCCTCAGG AATATCAAGA TCAACCGGAA CGGCAAGGTC GTCGAAACTG TCGACCTTTA CGACTTCTTC CTCAACGGTG ACAAAGGGAA GGACATTCGC CTGCAGCCCG GGGACACCGT GCTGGTCCCG GTGATTGGGC AGGTGGCCGG AATAGCAGGC AACGTCCGTC GCCCCGGGAT CTATGAGCTG AAAGGTGAAA GCACCCTGAA GGACCTTCTG GCCCTCTGCG GAGGGATCGA TTCCACGGCC TATCTGCAAA GGCTGCAGAT GTACCGTGTC CAGGCGCACG ACAAGAAGAT GGTGACCGAG ATTGCCTTTG AGCCCGGCAA GAATGTAGAA GACGCGACTG CCGGCATCAA ACTGCAGGAT CTCGATCTGG TGAAGGTGCT CTCCATCGAT ACCGTACTGC GCGGGTACGT GCGTCTGGAG GGACACGTGC TGCGTCCCGG TGACTATGCA CTAAAGCCCG GGATGCGCGT GAGCGCCCTC TTGCGGGGAG ACAACCTGCT GCCGGAATAC CACGGCGTAG CCGGGCAGAT CACGCGGCTC TTCCAACCCG ACCTGCACCC GGAGATCGTT TACTTCAACG TGAATGGCGC GCTGAACGGC GAGCCGGAAC AAGACCTGGA GTTGAAGGAA TTCGACCGGG TGAGAATCTT CTCCAAGCTG GAAATGGAGG AAATTCCACT GGTTCAGGTG AGCGGAGAGG TGCTGAGGCC CGGACAGTAC CGCTATCTCG AGAACATGAC CGTGCGCGAC CTGCTCATGC AGGCAGGGAA CGTGAGGCTC AGTGCCTACA TGAAAAATGC CGAGGTAGTG AGGCTTAGAC GTTTAGATTC GGGCGTAGTC ACCTATGTTA TTCCGGTGGA CCTGGAAGCG GCCCTCAAAG GTGGAGAAGA AAACATCAAG CTGCAACCGT TTGACGAGCT CACGGTCCGG CGCATCCCCA ACTGGGCCGA GGCCACAGAC CGCTACGTCA CCTTGAACGG CGAGTTCGTC TTTCCCGGCA CCTACCCCAT CCATAAGGGA GAGCGGTTAA GCTCCGTCAT CGTCCGCGCC GGCGGATTCT CCCCGCTCGC CTATATCAAG GGGGCTAGGT TCACCAGGGA AAGCGTCCGT AACCTGCAGC AACAGCGCAT GGATGAGGCG CTCCAGAAGG CGCAGGAGGA CATCATCAAC CTGCAGACCA ACATGGCGCA GACCGCAGCC TCGGCCGAGG AGGTCGCAGC AGCCAAAACA ACACTGGACG GGCTGTCGCG CAGCGTGGAG ATCCTGAAGA CCAAGAAGGC CGAAGGGCGC ATGTTGATGG AGATCGCATC GCTGGATGAA CTGAAGGGAG GCGCTTACGA CCTCGAACTG CAGGGAGGGG ATCAATTGAT TGTTCCCAGC GACCCCGGCG GCGTCAACGT GATCGGCAAC GTCTACAACC AGAGTTCCGT CGTGAGCCAG CAGGGCCGCG ACGTCGACTG GTACCTTCAG CAGGTCGGTG GACCGACGGG CGAAGCCGAC ACGAGTGAAA TCTACGTGGT CAAGGTGAAC GGCTCCGTCG TCAGCCAGGC CAATTCGGCA AACTTCATGT TCTACAACTC CTTCTGGGGC AAAGAACTCA ACTCCGGCGA TACCATCATC GTCCCGCGCC AGTACGAGAA GACCGCATGG CTCAGGTACA TCAAGGAGAT CACCCAGATC CTCGGTAACG TCGCCATGAC TGCAGGCGTT GTAATCGCTG CGGGGAAGTA A
|
Protein sequence | MKRIPLLAVA LYLLTAVTSY SASLYDGSSS TSTSYAGTSA TTTQVTAPTS SSYFTIPAPT GQGSGSSYYQ YPNQPQTAGQ TQPLGAPQQT QVLQQGQVPM QGQVLMQGQV PMQGQVPMQG QVPMQGQVPQ QGQVPQQGQA EAQRFGQQGQ NSNQTASTPY QQSSQVPQPG QEWGKQPLPQ AWQQWVPPAE PSALEKAMGA DPVTAESSLS QAYGVGPLTQ FGYSFFHPEV SDPLSTQMDI PVGPDYILGP GDRLLVTLWG SIDGTWELEV SRSGEIVLPK VGSVKVAGQS FGQVPSLLKN TLARIFKDFQ LNVNMGRLRV IKVYVVGEVT SPGDYNISSL STLVNALAAA GGPTKSGSLR NIKINRNGKV VETVDLYDFF LNGDKGKDIR LQPGDTVLVP VIGQVAGIAG NVRRPGIYEL KGESTLKDLL ALCGGIDSTA YLQRLQMYRV QAHDKKMVTE IAFEPGKNVE DATAGIKLQD LDLVKVLSID TVLRGYVRLE GHVLRPGDYA LKPGMRVSAL LRGDNLLPEY HGVAGQITRL FQPDLHPEIV YFNVNGALNG EPEQDLELKE FDRVRIFSKL EMEEIPLVQV SGEVLRPGQY RYLENMTVRD LLMQAGNVRL SAYMKNAEVV RLRRLDSGVV TYVIPVDLEA ALKGGEENIK LQPFDELTVR RIPNWAEATD RYVTLNGEFV FPGTYPIHKG ERLSSVIVRA GGFSPLAYIK GARFTRESVR NLQQQRMDEA LQKAQEDIIN LQTNMAQTAA SAEEVAAAKT TLDGLSRSVE ILKTKKAEGR MLMEIASLDE LKGGAYDLEL QGGDQLIVPS DPGGVNVIGN VYNQSSVVSQ QGRDVDWYLQ QVGGPTGEAD TSEIYVVKVN GSVVSQANSA NFMFYNSFWG KELNSGDTII VPRQYEKTAW LRYIKEITQI LGNVAMTAGV VIAAGK
|
| |