Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0698 |
Symbol | |
ID | 8136013 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 836273 |
End bp | 837760 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644868315 |
Product | peptidase M16 domain protein |
Protein accession | YP_003020530 |
Protein GI | 253699341 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.000000000147325 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTTCAT TCGGAAACAG TTTGTTTAAC AAGAAAAGCC AAGCGAGGTT GACGCTCTTT GCAATACTGC TGGCCTTTAC CGCCGGCTGC GGCACCATGC ATGGAGGCGC TGCGAAAAGC GGCGCCCCGC AGGCACAACA GCTGGCGCAG CCCCGCAACA TGAGCTTCCC GCCGCTTAAC TTCAAGCTCC CCAAGAGCGA CCGGGTCCAG CTCAAAAACG GCATGATCGT CTACCTTTTG CAGGACCGCG AACTACCCAT CGTGAACCTG ACCGCGTACC TGAACGCCGG GAGCATCTTT GAGCCGAAGG AGAAGGTGGG GCTTGCCGCC CTGACCGGCG CGGTGCTGAG AAGCGGCGGG ACGCTGAAGA CCCCGCCCGA GCAGTTGGAC CGCGAGCTGG AGTTCATGGC CTCCTCGATC GAATCCGCCA TCAACTCCGA CCACGCTGGG GTTTCCTTCT CGACCCTGAG CGTCAACTTG GACAAGACCC TGTCGCTTTT CGCCGAGATC CTCAAGGAGC CGGCGTTCGA TCCGGCGCGG GTCGAGATTG CCAAGAGCCA TGCCCTTGAG GGGATCCGCC GACAAAACGA CGACCCCAAA CAGATCGCCG GCCGCGAGTT GGCGCGCGCC ATCTATGAGA ATCATCCGCT GGGGCGCATA CCGACCATCG CAACGGTGAA GGCCGTCACC CGCGAGGACA TGGTCGAGTT CCAGAAGCGC TATTTCTACC CCGCCAACAT GGTCCTGGCC GTCTCCGGCG ACTTCGACCG AAAGAAGCTT TTGCAAAGCC TCGAAAAGCT CTTCGCCGAC TGGCCCAACC GGACCGCCTC TCTCCCCCCG GTCCCGAAAC CAAGCGAGGA GCTGACCCCG GCTGTGCTGC ACGTGCAAAA GGACGTGAAC CAGTCGGTGA TCCGGATGGG GCACCTGGGT ATCGAAAAGA ACAACCCCGA CCTCTACGCG ATCAAGGTCA TGGACTATAT CCTGGGGGGC GGCTTCACTT CCAGGCTCAC CCAGGAGATC CGCTCGAACC AGGGGCTTGC CTACAACGTG GACAGCTACT TCGAGGTCGG GCGGCGCTTC AAGGGGTCGT TTGTGGCCGA GACCGAGACC AAGTCTGAAT CGACGGCCAA GGCGATCACG CTGCTCAGCT CCATCATCAC CGGCATGACC CAAGCGGAGG TCTCGGACGA GGAGCTGAAG CTCGCCAAGG ACTCCATCAT CAACTCCTTC ATCTTCGGGT TCGAGCGGAG CAGCGCGGTG GTGAACCAGC AGGCGAGGCT CGAGTTCTAC GGCTATCCGG ATGGGTACCT GGAGAACTAC CGCGACAACA TCGCCCGCGT CACCCGCGCC GACGTACTGA GGGTGGCCAG GCAGTACCTG CGCCCGGAAG CCATGAAACT GGTGGTGGTA GGAAACGAGA AGAAATTCGA CCGGCCACTC TCCCTGTTCG GGAAGGTGCA GGAAATAAAG CTGAACAACA ACAAATAG
|
Protein sequence | MISFGNSLFN KKSQARLTLF AILLAFTAGC GTMHGGAAKS GAPQAQQLAQ PRNMSFPPLN FKLPKSDRVQ LKNGMIVYLL QDRELPIVNL TAYLNAGSIF EPKEKVGLAA LTGAVLRSGG TLKTPPEQLD RELEFMASSI ESAINSDHAG VSFSTLSVNL DKTLSLFAEI LKEPAFDPAR VEIAKSHALE GIRRQNDDPK QIAGRELARA IYENHPLGRI PTIATVKAVT REDMVEFQKR YFYPANMVLA VSGDFDRKKL LQSLEKLFAD WPNRTASLPP VPKPSEELTP AVLHVQKDVN QSVIRMGHLG IEKNNPDLYA IKVMDYILGG GFTSRLTQEI RSNQGLAYNV DSYFEVGRRF KGSFVAETET KSESTAKAIT LLSSIITGMT QAEVSDEELK LAKDSIINSF IFGFERSSAV VNQQARLEFY GYPDGYLENY RDNIARVTRA DVLRVARQYL RPEAMKLVVV GNEKKFDRPL SLFGKVQEIK LNNNK
|
| |