Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1793 |
Symbol | |
ID | 8137124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2084434 |
End bp | 2085609 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644869405 |
Product | hypothetical protein |
Protein accession | YP_003021605 |
Protein GI | 253700416 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3408] Glycogen debranching enzyme |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 0.0265229 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCATG AACTGCTGAC GGAGTGCTAC CGCGAGGCGC TGGCGCTTTT GCGGGAGAAC TCGACGCCGG GCGGCATCCT TGCCTCGGGG AGGAACCAGA GGTCGGAGGG GCGCAACTAT ACCAGCATCT TCGGCCGCGA CGCCTCCATC TGCGCTTTGG GGATGGCGGT GTCGGGAGAC GCCGAATTGC GGCGCATAGC GGCGGAGGGC CTTCTGACCC TGGCCCGGTA CCAGGCCGGC AACGGCCAGA TACCGAAGTA TGTGAAGCCT GAGCTGGGGG AGGCGGACTT CTGGTACTCC GGCTGCATCG ACGCCACCTT GTGGTGGCTC ATCGCCATTG CCTTCATGGA CCGCGTTCTT CCCGAAGGGG AACTGGGAGA GCGGCTCGCC CCGCAGACCG GCCTCGCCCT TTCCTGGCTC CAGTGCCAGG AACACCAGGT CTGGCGCCTG CTCCAGCAAC TGGACGCAAG CGACTGGGCC GACATCATGC CGCGGGCGGG GTACGTCCTT TACACCAACG TCCTTTGGTA CTGGACCAAG ACCCTCTACG GCCTCCCCAG CGCGGCGGAG ACCAAAGAGT ACCTGAACAC GCTCCTCTCC CCCTTCGGCA GGGTGGTTCC CGCGCAAAAG CGCGCCCGCC TGCTGGTCCA TTACGTCCGA AACCGCTGCA AGCCGTCCCC CTTTTATCTC AGTTTCGTCA ACTTCTCCGA CTGGGGCGAG GAGATCGACA TCTTCGGCAA CATCATGGCG CACCTGGTCG GGGTGAGTCC CCCCTCCACC GGCGACAAGG CCGTTCAGGC ACTTTTGGCC CTGAAGGCGA ACGACCCCCA CCCCATCAGG GTGGTGGGAG ACCCGATCCG GCCCGGTTCC AGGCTTTGGC GCCCCTACAT GCAACGCCAC CGGCAGAACC TCGCCTGGCA GTACCATAAC GGCGGCGCCT GGCCCTTCGT CGGGGGATTC TGGGTGCTGC TACTGGCTCG CCTGGGGCGG ACCGCGCAGG CGTGGTCGGA GTTGGAGAAG CTGGCCCGGT CCAACCGGGT GAACGGTTGG GAGTTCAACG AATGGTTCCA GGGGGTGACA GGCGAGCCTA TGGGGATGCC CCGGCAGTCC TGGAACGCGG CTCTCTACGT CCTCGCCTAC CGCGCCTTAG CCGACGGCAC GCGCTACCTT CCCTGA
|
Protein sequence | MDHELLTECY REALALLREN STPGGILASG RNQRSEGRNY TSIFGRDASI CALGMAVSGD AELRRIAAEG LLTLARYQAG NGQIPKYVKP ELGEADFWYS GCIDATLWWL IAIAFMDRVL PEGELGERLA PQTGLALSWL QCQEHQVWRL LQQLDASDWA DIMPRAGYVL YTNVLWYWTK TLYGLPSAAE TKEYLNTLLS PFGRVVPAQK RARLLVHYVR NRCKPSPFYL SFVNFSDWGE EIDIFGNIMA HLVGVSPPST GDKAVQALLA LKANDPHPIR VVGDPIRPGS RLWRPYMQRH RQNLAWQYHN GGAWPFVGGF WVLLLARLGR TAQAWSELEK LARSNRVNGW EFNEWFQGVT GEPMGMPRQS WNAALYVLAY RALADGTRYL P
|
| |