Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0616 |
Symbol | |
ID | 8135931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 747631 |
End bp | 749574 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644868233 |
Product | Tetratricopeptide TPR_2 repeat protein |
Protein accession | YP_003020448 |
Protein GI | 253699259 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 6.1691e-34 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGAAAAGAT ATGCAGCGCT ATCCTCGCTT CTTGCGGTGA CCCTGTTCTC CACCGGGTTC AACTGGCCGT TTCCTACGGG CAACGCCTGC CGCGACGCCA AGCGCATCAT CCTGGAACTC CCCCCGCAGG CGGGGGAGCA GAAGAGGAAG GAAGCCGAAA AGCGGGTGGC CGAACTCTGC CCGACCGGCC CGGCCGGACA TTACCTGAAG GGGCTCACTT TCGAGAGAAG CGGCAATGTC GACGCCGCCA TAAGCGAGTA CCGGGAGACC CTCTCCCTCG ACCCCGAGTT CTACCCCGCC AGCGGAAACC TGGGCCTTTT GCACCTGCAG AAGGGGGGCG GCGAGGAAGC TGCCGTGGAG CTTGCCGCGG GACTTAAGGC CGGGGACCCC CGCTACCACG CAGGGCTTGC CCGGGTCATG GCGGACAAGC AGATGCACCT GCTCGCCATC TTCCACTACA ACGAGGCGAT TGCCGCTTTC CCCGACGACG CTGCGCTTTA CACCGGCGTC GCGGCATCCT ACAACGCGGC GGGGCAGAAA CAGAAGGCCG AGGACGCCTA CCGCAGGGCC ATGGTCTTGC AGCCCGACAA CGCGCAGGCC CGCTTTGGCC TTGGCGCCCT TCTCCTGGAG CGTGGCGAGG TCGACAAGGC GGTCGGCGAG TTGAAGCTCG CGGCCATCGC CCAGCCGGCC AACAGGGAGA CACACCGGCT CTTAGCCGAG GCATACGCCC GCAAGGGTGA CGCGAAGAGC GCCCACTACG AGCGCGGGCT CGCCGGCATC GGCACGAAGC TGAAGGAGCT CCCCAAGGTC GACCACATGG CGCTGGCTGA AAAACACCGC CTGGCCAAAG ACCACGAGAT GGCGATCAGC GAGTACCGGA TGCGGCTGGC CGAGGAGCCC GACGACGCGC TGGCCCAGCA GCGCCTGGGG GACACCCTGC TCGCGGTGGG ACGCGAGGAC GAGGCGATGT CGTACTACCG CGACGCGCTG AGAAACAAGG CGGAAACCCC CGAGCTCCAT TTCAACCTGG CCGGGATCTA CGAACGCAAG GCGCTTCTGG ACGAGGCGGT GGTCGAGTAC CGTCAGGTAC TGGCATCGAA CCCCGACAAC CAGCATGCGC GGCAGCGCCT GGCCGATATC TACACGCTGC GCGGCAGCTT CAATCAGGCC CTCGAACAGT ACCAGGCGCT CATCAAGACG AACCCCGCCG ACCCGGCGCT GCAGCTGAAA CTCGCGCGCG CCTACGTCAA CAGCAAGGAG CTTGACGCCG CGGCCGAGGC CTACCAGGCA GCCCTCAAGC TGGACGGCGA GTCGGTGGAC GCCCACCGCG AGCTCGCCAA CCTGCAGAGA AAAAGAAACC TGATGGACGA GGCGGCCGCC GAATACCAGG AAGTGCTCAG GCTGAAAAAG GACGACCAGG AAGTCCGCAC CGCCCTCACC GCCATCTACG TGAAGAACAA GAACTACGAC GCCCTGGCTC AGCTCCTGAA GGACGGAGTG GAGCTCTCCC CGAACGACCC CAACGCGCAC TACAAGCTGG GACTGGTCTA CGAATTCCAG AAGGATTACA CCGCGGCCAC CGCCCAGTAC AAGGAAGCGG TGACCCTGAA GCCCGACCAT GCCAAGGCCT TGAACGCCAT GGGACGGGTC CAGATGAAGG ACGGCCACCT CGCCGAGGCA AAGGAGTCGC TCGAAGCGGC GAGGAAGGCG GACCCCGACC TGGAAGAAGC CCAGGTCCTT TTGAGCAACA TCAAGGACGA GTTCACGCCC GAGCCCAGGA GTTACCGAAA GCACAAGTCC TCCAACGGGA GCAAGGCCAA GAAAGGGAAG AAGGGGAAAA AAGGGAAGGA AGCGAAGAAA TCCAAGAAGA AGAACAGTGA CGACAAGCCT GCCAAGAAGT CGAAGAAGAA GAAAAAGTCC AAGAAGAAGA GTAAGGAAGA CTAA
|
Protein sequence | MKRYAALSSL LAVTLFSTGF NWPFPTGNAC RDAKRIILEL PPQAGEQKRK EAEKRVAELC PTGPAGHYLK GLTFERSGNV DAAISEYRET LSLDPEFYPA SGNLGLLHLQ KGGGEEAAVE LAAGLKAGDP RYHAGLARVM ADKQMHLLAI FHYNEAIAAF PDDAALYTGV AASYNAAGQK QKAEDAYRRA MVLQPDNAQA RFGLGALLLE RGEVDKAVGE LKLAAIAQPA NRETHRLLAE AYARKGDAKS AHYERGLAGI GTKLKELPKV DHMALAEKHR LAKDHEMAIS EYRMRLAEEP DDALAQQRLG DTLLAVGRED EAMSYYRDAL RNKAETPELH FNLAGIYERK ALLDEAVVEY RQVLASNPDN QHARQRLADI YTLRGSFNQA LEQYQALIKT NPADPALQLK LARAYVNSKE LDAAAEAYQA ALKLDGESVD AHRELANLQR KRNLMDEAAA EYQEVLRLKK DDQEVRTALT AIYVKNKNYD ALAQLLKDGV ELSPNDPNAH YKLGLVYEFQ KDYTAATAQY KEAVTLKPDH AKALNAMGRV QMKDGHLAEA KESLEAARKA DPDLEEAQVL LSNIKDEFTP EPRSYRKHKS SNGSKAKKGK KGKKGKEAKK SKKKNSDDKP AKKSKKKKKS KKKSKED
|
| |