Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1999 |
Symbol | |
ID | 8137333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2316840 |
End bp | 2318132 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644869612 |
Product | amidohydrolase |
Protein accession | YP_003021809 |
Protein GI | 253700620 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 1.38809e-26 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACGACC AAGTAACCAG CGGCACTCTC ACCGGCATGG TTGAAGCGGA ACTCCCCTCC CTGTCCGCCA TCTACCGGCA GCTGCACGCT AAGCCCGAAC TGTCGGGGCA GGAGGAGCAG ACCGCGGCAC TTGTCGCCGC CGAACTGCGC GCTCTGGGCT ACGGCGTGAC CGAAGGGGTG GGAAGGTACC GGAATTACGA CTGGCCTGGC TACGGCGTCG TCGCCGTTCT TTCCAACGGC GTAGGGCCGA CCGTGCTGCT TCGGGCCGAC ATGGACGCGC TCCCCGTTCA GGAAAAGACC GGCCTTCCCT ACGCCAGCCG GGTCAAGGGG ACCTACCGCG ACGGCAACGA GGTGCCGGTC ATGCACGCCT GCGGGCACGA TATCCATGTC ACGGTCTTAT TGGGCGTGGC GCGGGTGATG GCGCGGCTCA GGGAGAGTTG GCAGGGATCC CTGGTTTTGG TCGCTCAACC CGGCGAAGAA GGGGGGGGCG GGGCGGACGC CATGTTGGAC GACGGCGCCT ATGGCCACTG CCCAAAGCCC GACTTCGCCC TGGCGCTGCA CAGCACGCTG CACCTTAAGG CCGGCAGCGC CGGGTACGCC CCGGGGAACT TCATGGCGAG CTTATGCGAA CTCGAGGTAG TGGTCCGCGG CGTCGGCTCC CATGGCTCGG CCCCCGAGTG CGGCAGGGAT CCGGTCGTCA TGGCCGCCCA ACTGGTGCTC GCCCTGCAGA CCGTCGTCAG CCGGGAGAAA AACCCCAGCG AACCGGCAGT CTTGAGCGTC GGCTCCATTC ACGGCGGGGC AGCCAGCAAC GTGATACCGG ACGAGGTCGT CCTGCAACTC AGCGTCAGGA CCTATGACGA CGGCGTTCGC GACAGTATCG TCGAATCGGT GCGCCGGATG GCCGCTGGCG TGGCGCTGGT CGCCGGGGTG CCCGAGGACC GGTCTCCGCT CGTGAAGGTG AAGGCCTCCC ATCCTGCGAT CTACAACGAC CCGGAACTGG CCGAGCGGGT GGCGGCCTCG CTGCGCCTGG CCTTAGGCCC CGGCAACGTC TACCGCAGCG AACCGAAGAT GGTGAGCGAG GATTTCGGAT CCTGGAGCCT GGAAGGGGAG ATTCCCATCT GCATGTTCTG GCTGGGCGCA GCCGATCCCG AAAAATTCGA GGCAAGCCGC AAATGCGGAG TGCCGCTTCC TTCGCACCAT TCTCCTCTGT TCGCCCCACT CCCGGAGCCG ACCATTCGGG CGGGTGTTGC CGGCCTGGCC ACGTCGGCAC TTGACCTCTT CAAGAGCAGG TGA
|
Protein sequence | MNDQVTSGTL TGMVEAELPS LSAIYRQLHA KPELSGQEEQ TAALVAAELR ALGYGVTEGV GRYRNYDWPG YGVVAVLSNG VGPTVLLRAD MDALPVQEKT GLPYASRVKG TYRDGNEVPV MHACGHDIHV TVLLGVARVM ARLRESWQGS LVLVAQPGEE GGGGADAMLD DGAYGHCPKP DFALALHSTL HLKAGSAGYA PGNFMASLCE LEVVVRGVGS HGSAPECGRD PVVMAAQLVL ALQTVVSREK NPSEPAVLSV GSIHGGAASN VIPDEVVLQL SVRTYDDGVR DSIVESVRRM AAGVALVAGV PEDRSPLVKV KASHPAIYND PELAERVAAS LRLALGPGNV YRSEPKMVSE DFGSWSLEGE IPICMFWLGA ADPEKFEASR KCGVPLPSHH SPLFAPLPEP TIRAGVAGLA TSALDLFKSR
|
| |