Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0779 |
Symbol | |
ID | 8136094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 927761 |
End bp | 928831 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644868396 |
Product | peptidase M42 family protein |
Protein accession | YP_003020611 |
Protein GI | 253699422 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 109 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGACG CATCTTTCGA ATTCCTGCAG CAATTGCTGG CGGCGCCCAG CCCCTCGGGG TACGAGCAGC CTGCCCAGCG GGTCTTCCGC TCCTACATAG AGCCGTTTTG CCAGGTGGCG ACCGACGTCA TGGGTAACGT CTTCGGCATG ATTCAGGGCG CCGGAAATAA CCGCCCCCGC GTCATGGTCG TGGGGCACTC CGACGAGATC GGGCTCCAGG TCCGCTACCT GGACGACAAC GGTTTCATCT ATTTCTCCGC CATCGGCGGG GTCGATCCCC ACATAACGCC CGGGATGCGG GTCCACGTCC ACACCGCGAA GGGGAAGCTG AACGGCGTCA TCGGCAAGCG CCCCATCCAC CTGATCGAGC CCAAAGAGCG CGACACCGTC ATTAAGCTGG ACGCCCAGTA CATAGACATC GGCGCCGCCA ACAAGAAAGA GGCCCTGGAG TGGGTGCGGG TAGGCGATCC CATCACCTTC GACAGCAATC TGGAGCGGCT CTTCGGGGAC CGCGTCAGTT CGCGTGGTCT CGACGACAAG GCAGGCAGCT TCGTTGTCGC CGAGGTGCTC CGTCGCGTCT CCGAGCTTCC GGACCAGCTC CCCATCGACC TCTACGGCGT ATCCTCCGTC CAGGAGGAGG TTGGGCTTCG CGGCGGCACC ACCAGCAGCT ACTCGGTGAA CCCCGACGTC GGCATCTGCG TCGAGGTGGA TTTCGCCACC GACCAGCCCG ACGTGGACAA AAAGCACAAC GGCGAAGTAG GTCTAGGCAA AGGGCCGATC CTTCCCCGCG GCGCCAACAT CAACCCCGTC CTCTTCGACC TCCTCTCCGA CACCGCGACC GGCAACGGCA TCGCCGTGCA GTACACCGGC ATCGCCCGGG CGACCGGCAC CGACGCGAAC GTAATGCAGA TTTCGCGCGG GGGCGTCGCC ACCGCTTTGG TGAAGATCCC GCTGCGCTAC ATGCACACCC CGGTGGAGAC CCTGTCGCTT GCCGACCTGG ACGGGGCGGT CGAGCTGATT GTCGCCTCGC TCTCCAAGAT GGGGCACAAG GACGCGTTCA TTCCGATGTG A
|
Protein sequence | MRDASFEFLQ QLLAAPSPSG YEQPAQRVFR SYIEPFCQVA TDVMGNVFGM IQGAGNNRPR VMVVGHSDEI GLQVRYLDDN GFIYFSAIGG VDPHITPGMR VHVHTAKGKL NGVIGKRPIH LIEPKERDTV IKLDAQYIDI GAANKKEALE WVRVGDPITF DSNLERLFGD RVSSRGLDDK AGSFVVAEVL RRVSELPDQL PIDLYGVSSV QEEVGLRGGT TSSYSVNPDV GICVEVDFAT DQPDVDKKHN GEVGLGKGPI LPRGANINPV LFDLLSDTAT GNGIAVQYTG IARATGTDAN VMQISRGGVA TALVKIPLRY MHTPVETLSL ADLDGAVELI VASLSKMGHK DAFIPM
|
| |