Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3233 |
Symbol | |
ID | 8138585 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3751792 |
End bp | 3753009 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644870837 |
Product | peptidase U32 |
Protein accession | YP_003023017 |
Protein GI | 253701828 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 87 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTTT CCGTCGCCAC AAACTTCCAG CCCGACCTGA TTCCCGCCAT CAAGGGATAT CCGGTCGCCG AACTTTTCGG CAAGCTACCC TCCGACAGCG TGGGGGGCGG CCGCGCCTCG TTCATGCTGG CCCCCCTGGG ACAGGAGCAG TTCCGGGCCC ACGTAAGGGA AGCGGGGAAG AACGGGGTCG GCTTCAACTA CCTGATCAAC CCAGCCTGCA TGGATAACCG CGAGTTCACG CGCCAAGGGC AGGCGGCTCT CGACCAGCTC CTCGATTTCG TGGACGGCTG CGGCGTCACC GCGGTCACCG TCTCGCTTCC TTTCCTGCTG CCGATCATCA AGAAACGGCA TCCGCGGCTC AAGGTGCGGG TCGGTGTCTA TGCCCGCGTC GACTGCGTGG CGAAGGCCCG CTTTTGGGAG GATCTCGGGG CGGACTGCGT CACGCTGGAA TCAATCGCCA TCAATCGCGA CTTCGGCATG CTGCAGGCGA TACGTCAGGC GGTGCAACTG GAGCTGCAAC TCATCGCCAA CTCCAACTGC ATGATCTTCT GCCCGCTCTC GGGGCAGCAC ATGGTGAACC TCTCGCACGC CTCGCAAAAG GGGCACGCCA GCCGCGGCTT CATGATCGAT TACTGCGCGC TCAGGTGCTC TGCGCAGAAA CTGGCCGACC CTTCCCTGTA CCTTCGTTCC GAGTTCATCC GGCCGGAGGA TCTGGGGAGC TACACCGAAC TTGGCTTTAA CTCCTTCAAG ATACTTGAGC GCGGCGCACC GACCCCGGTC CTCGTCGAGC GGGTCCGCGC CTACAGCGAA GGAAGGTTCT CGGGAAACCT GCTGGACCTG ATCCAGCCCT ACGGCTACAA GCGCACCCCG GGCAAGGTGA AGGGGCGGTT AAGCGGCCTG CGCAGGTTTG CCCGGTACTT CCTGCGCCCC GGCGTGATCA ATCTCGCCGG GTTGGTCCGG CTGAAACGGC TGGCGGAGAA GCGGGGGCTC ATCGAGGAGC TGGAAGGAGC CCCGGTCTAC CTGGACAACG CCAAACTGGA CGGGTTCCTG GCGGGCTTCC GGGCGATCGA CTGCCGGAAG ACGGACTGCG CACGCTGCGG CTACTGCGCC GCCTGGACCG AGAAGGCGCT GCGGCTGGAC CCTGTCTACC GCGACGAGAT GCTGCGACTG TACCGCGACG CCTTCGATGA GATGTATTCC GGCGAATTAT GGGACTGA
|
Protein sequence | MKFSVATNFQ PDLIPAIKGY PVAELFGKLP SDSVGGGRAS FMLAPLGQEQ FRAHVREAGK NGVGFNYLIN PACMDNREFT RQGQAALDQL LDFVDGCGVT AVTVSLPFLL PIIKKRHPRL KVRVGVYARV DCVAKARFWE DLGADCVTLE SIAINRDFGM LQAIRQAVQL ELQLIANSNC MIFCPLSGQH MVNLSHASQK GHASRGFMID YCALRCSAQK LADPSLYLRS EFIRPEDLGS YTELGFNSFK ILERGAPTPV LVERVRAYSE GRFSGNLLDL IQPYGYKRTP GKVKGRLSGL RRFARYFLRP GVINLAGLVR LKRLAEKRGL IEELEGAPVY LDNAKLDGFL AGFRAIDCRK TDCARCGYCA AWTEKALRLD PVYRDEMLRL YRDAFDEMYS GELWD
|
| |