Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1589 |
Symbol | |
ID | 8136920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1853087 |
End bp | 1854805 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644869202 |
Product | Tetratricopeptide TPR_2 repeat protein |
Protein accession | YP_003021402 |
Protein GI | 253700213 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 0.499739 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAAGA AGTGTCTAGC GCTCCTTTCC CTCACTTTGT TAATTAATGC CTGCGCCAGC GAGCACGCCA CCGGCACGAT CCCTTCGGCC GGACTTGAGG CTGTTGCGGC CCCGTCGGCC GGTCCCGGAG AGGGGCGCAC CATGTATCTC TTCGCCTTGG CTCGCCTGCG TGCCGGCGAG GGGGACCAGG ATGCGGCGCT CGCGCTTTTG CGCCAGGCGA TGGCATCCGA CCCGGGGGCG GCCTACCTGC ACACGGCGGC GGCCCAGTAC CTGCTGCAGC AACATAAACC CGAAGAGGCG CTGGCCGAAA GCCAGGCCGC GATCAAGATC GACCCCACTT TCCTTCAGGC GCAGCTTCTG TCGGGGAACA TCCTGATGAC CATGCAGCGC GAGAAGGAGG CCATCCCCTA TTACAAGAAG GTGATGGAGC TCGACCCGAC CAAGGAAGAG GTCTACCTCC ACGTCGCCAT CTACTACCTG AAGAGTTTCG AGTACGAGCA GGCGGTCGAC ACCCTGAAGG GGTTGGTCAA GGCTGCGCCC GACTCGGCGC TTGGCTATTA CTACCTCGCC AAGACCTACG AGCAGATGCG TCTGCCGCGC GAGGCGCTCG GCTACTACAA GAAGGCCCTC GACTTGAAGC CGGACTTCGA GCAGGCGCTG ATCGAGATGG GGATCTCGCA GGAGACCCAG GGGCTCATCC CCGACGCCAT CGAGAGCTAC AAGGGGCTTC TCGATATCAA CCCCGCCAAC GCCAACGTCG TGCAGCACCT GGCGCAGCTC TACATCCAGC AGAAGCGGCT TAGCGAGGCG CTCGCCCTGT TGCAGGAAAA GGGGGGGAAG ACGCTTGAGA ACTCCCGCAA GATCGGGCTC TTGTTCCTGG AGCTTGAGCG CTACGACGAT GCGGTGAAGA CCTTTCAGGA GATCCTCGAC GTAGAGCCTG CCGCCCAGCA GGTCCGCTTC TACCTCGCCA CCGCCTACGA GGAGAAGGAG GACGCCGACC GGGCTATCGC CGAATTCCTG AAGATCCCCA AGGAGTCTCC CTACTACCCC GACGCCGTAG GTCACTTGGC CTACCTGTAC AAGGAGAAGG GGACCCCGGA GAAGGGGATC GCCCTTTTGA AGGAAGAGAT CAAGGATCAA CCGGCGCGGA TCGAGCCTTA CCTGCATCTT GCCGGCCTCT ACGAATCGAT GGAGCGCTAC AAGGAAGGGG TCGACACGCT GAACTCGATG GACGACAAGC TCAAGAACGA CCCCCGCGTC CTGTTCCGCC TCGGCATCCT GTACGACAAA GTCGGGCAGA AAGAGCAGTC GGTCGCCATG ATGAAGCGCG TCATCGCCGT GAACCCGAAC GACGCCAACG CCCTGAATTA CCTTGGGTAC ACCTACGCGG AGATGGGGGT GAACCTGGAG GAGGCGCTTT CCTACCTGAA GAAGGCGGTC GAGCTGAAGC CGGACGACGG CTTCATCCTG GACAGCCTCG GCTGGGCCTA TTACAAGCTG AAGCGCTACA ACGAGGCGGT CGCCCAGCTG GAGCGGGCAG CGGAGCTCTC CGACCAGGAC GCAACGGTGC TCGGCCATCT CGCCGACGCC TACTGCGCCG CGCGCGCCTA TAAGAAGGCG CTCCAGCTGT ACCGGAAGCT GCAGAAGCTG GAGCCCGAGC AAAATGCCGA GCTCGCCGAG AAGATCAGGC ACTGCCGCCA GGAGAGCGGG GAGAAATGA
|
Protein sequence | MTKKCLALLS LTLLINACAS EHATGTIPSA GLEAVAAPSA GPGEGRTMYL FALARLRAGE GDQDAALALL RQAMASDPGA AYLHTAAAQY LLQQHKPEEA LAESQAAIKI DPTFLQAQLL SGNILMTMQR EKEAIPYYKK VMELDPTKEE VYLHVAIYYL KSFEYEQAVD TLKGLVKAAP DSALGYYYLA KTYEQMRLPR EALGYYKKAL DLKPDFEQAL IEMGISQETQ GLIPDAIESY KGLLDINPAN ANVVQHLAQL YIQQKRLSEA LALLQEKGGK TLENSRKIGL LFLELERYDD AVKTFQEILD VEPAAQQVRF YLATAYEEKE DADRAIAEFL KIPKESPYYP DAVGHLAYLY KEKGTPEKGI ALLKEEIKDQ PARIEPYLHL AGLYESMERY KEGVDTLNSM DDKLKNDPRV LFRLGILYDK VGQKEQSVAM MKRVIAVNPN DANALNYLGY TYAEMGVNLE EALSYLKKAV ELKPDDGFIL DSLGWAYYKL KRYNEAVAQL ERAAELSDQD ATVLGHLADA YCAARAYKKA LQLYRKLQKL EPEQNAELAE KIRHCRQESG EK
|
| |