Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2477 |
Symbol | |
ID | 8137818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2896671 |
End bp | 2898170 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644870087 |
Product | polysaccharide chain length determinant protein, PEP-CTERM locus subfamily |
Protein accession | YP_003022278 |
Protein GI | 253701089 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 136 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGTCGC CAGAGACCGA GTACAAAAAG TATCTGCAAC TGTTGTTCAG CAACAAGGAG CGATTTGTCG TCATCGCCCT GCTGCTGATG ACCGTGGCCT TCGTGGTCAG TTTCGTGCTT CCGCGCAAGT ACCAGGCCAC GAGCACCGTA TTTATCGAGA AGAACGTGAT CAGCGAGCTG GTCAAGGGGA TCACGGTGAC CCCCTCCATG GAAGACACCA TCAACGTCCT CACCTACGAG ATCACCAGCC GCACCTTGCT CGCCAAGGTC GTGGACAACC TCGATCTGGA TCTGGGCAAG AACGACAGCG AGACCGAGGA GCTGATCAAG CAGCTGCAGC TGAACACCAA GGTGAAGGTG AAGGACAAGA ATCTCTTCAC CATCTCCTTC ACCCACACCA ATCCCAGCAT AGCCAGGGAC TACGTCAACA CGCTCGTGCG CCTTTACATC GAAGGGAACA TCTCCTCCAA ACGCGGCGAG TCCTATGACG CCACCAAATT CCTCTCCGAA CAGATCGACA CCTTCAACGA GAAGCTGCAA AAGGCCGAGA ACGAGGTCAA CGCCTACAAG CGCGATAAGG GGGGGATCAT CGCCATCGAC GAGGGGAAGC TCTTCGAGGA GATCAACATC GCGCAGCAGA AGCTCTACGA CCTCGAACTC AGGCGCCGCC AGTTGGAAGG GATGCGCCAG ATCACCAAGA GGACCGGTGA CCCCCTGCAG AACAGGCTCG CCGGGCTGCA GAAAAGGCTC GACGAACTGC TGGTCGAGTA CACCGAAAAT TTCCCCGAGG TGGTGAAGGT CAAGGGGGAC ATCGAGACGG TGAAGGCTCA GCTGTCGGCG CGCCGGGGTC AGCAGTCCCA GTCGCTCGAT CCCGCGGAGC TGGCCAAGAT CGAATCCGAA ATCTCCGCGA TCAAAATCAC TGAAAGCGGC CTGAGGCGCT ACATAGACAC CAACCGTTCC CTCTTGCAGA CCATCCCTTC GGCTAAGGCG GGACTCGAGA AGCTGGAGTT GGAGAAGAAA AACCAGAAGA ACATCTACGA CCAGCTCTTT GCCCGTCACG GGCAGTCCGA GGTCTCCAAG CAGATGGAGG TGCAGGACAA ATCCACCACC TTCCGCATCG TCGACCCGGC CCTGCTCCCG GTCAAGCCTT CCAGTCCGGA TCGGCTGAAG CTGATGCTGC TGGGGATGGT GGGGGGGGTG GCGGGAAGCT TCGCGCTGCT TTTCCTGATC GACCAGATGA ACAACACGGT GAAAGAGGTG GAGTTCGTAA AGGGGCTGGG GGTGCCGGTC CTGGCGGTCA TTCCGAGGCT GCAGGATCCG GAGGTCGAGG CGAAGAGGCG CAGGCGCTCG CGGCTGATCC TTGGGGGGGC GCTTATGTAC TTCCTGGTGT TGATGGTTTT CCCCGGCATG GAACTCCTGG GGCTCCCTTA TATGGACAAG GTGTTGGACT TATTGTCCGG GCCGGAAGCC CGGTTGCGGA TCAAGGGGCT TTTGCAGTGA
|
Protein sequence | MQSPETEYKK YLQLLFSNKE RFVVIALLLM TVAFVVSFVL PRKYQATSTV FIEKNVISEL VKGITVTPSM EDTINVLTYE ITSRTLLAKV VDNLDLDLGK NDSETEELIK QLQLNTKVKV KDKNLFTISF THTNPSIARD YVNTLVRLYI EGNISSKRGE SYDATKFLSE QIDTFNEKLQ KAENEVNAYK RDKGGIIAID EGKLFEEINI AQQKLYDLEL RRRQLEGMRQ ITKRTGDPLQ NRLAGLQKRL DELLVEYTEN FPEVVKVKGD IETVKAQLSA RRGQQSQSLD PAELAKIESE ISAIKITESG LRRYIDTNRS LLQTIPSAKA GLEKLELEKK NQKNIYDQLF ARHGQSEVSK QMEVQDKSTT FRIVDPALLP VKPSSPDRLK LMLLGMVGGV AGSFALLFLI DQMNNTVKEV EFVKGLGVPV LAVIPRLQDP EVEAKRRRRS RLILGGALMY FLVLMVFPGM ELLGLPYMDK VLDLLSGPEA RLRIKGLLQ
|
| |