Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0168 |
Symbol | |
ID | 8135471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 200093 |
End bp | 201280 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644867787 |
Product | beta-lactamase domain protein |
Protein accession | YP_003020011 |
Protein GI | 253698822 |
COG category | [C] Energy production and conversion |
COG ID | [COG0426] Uncharacterized flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.000343113 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCTGGAG CTGTTGAGTT AGGTAAGGGC CTTCATTGGA TCGGGGTCAA GGACCCCAGC CTCACCGTCT TTGACGACCT CTTTCCCACA GAGTACGGCA CTACCTACAA CTCGTACCTG GTGCAGGGGG ATACCCACAC CGCCATCATC GACACGGTGA AGAGGAAGCG CTTCGACGAG TTCCTCGCGA ACATCCGCTC CATCACCGAT CTCTCCAAGA TCGACTACAT CGTGGTGAAC CATTCCGAGC CGGACCACTC CGGTTCCCTC GCGCTCTTAC TCGAGCAGTG CCCCCAGGCC ACCGTGGTCT CCAGCCAGGC CGCCCGCACC TTTCTCGGCA ACCAGATCCA CACCCAGTTC GACTCCCGCA TCGTCAAGGA CAACGACACG CTGGAATTGG GGGGGCGTAC GCTGCGCTTC ATCGCCGCGC CGTTTTTGCA CTGGCCCGAC ACCATGTTCA CCTTGCTCCA GGAGGAGGGG GTGCTCTTTC CCTGCGATGC CTTCGGTTCT CACTACTCCG GCGAGGGGAT CTTCGCCGAC GAGATGCCCG ACTTCTCCGG GGAGACCCGT TTCTACTTCG ACTGCATCAT GCGTCCCTTC AAGGAGCGCA TCCTGCAGGC GGTCGCGAAG CTCGACGACG TCGAGCTGAA GATGCTCTGC CCGAGCCACG GCCCCATCTA CCGCAGCGAC GCCAGGAAAC CCGTGGAGCT GTACCGGAAG TGGTCCATGC CCAAGGCAGC GGGTCGCCGC ATCGCCATTT TCTACATCTC CCCGCACGGC AACACCGAGC AGATGGCCGA GGCGGTCGCC AAGGGTGCCG GCGCGGCTGG CGTCCACGTC ACCCTGTGCC ACATAAACCA CGCTTCCGTC GCCGACATCC GGGATCTGAT GGAGGAGTGC GACGGCCTCA TCTTCGGCAC CCCCACCATC AACCGCGATA TCCCGAAGCC GATGTGGGAC GTCCTCGCCT ACCTCTCCAC GGTAAGCCTG AAAGGGAACA TCGGCGGCAT CTTCGGCAGC TTCGGCTGGA GCGGCGAGGC GTGTCGGATG CTCGAAGAGA GGCTCAAGAG CCTGAACTTC AAGCTCCCCG CCCCCTTCGT CCGTGCCCCC TTCATGCCCA AGGCCGAGGC GCTCGCCGAA TGCGAGGCGC TGGGGCGCGC GGTGGCCGAA GAGGTCCTCA AGAAATAG
|
Protein sequence | MSGAVELGKG LHWIGVKDPS LTVFDDLFPT EYGTTYNSYL VQGDTHTAII DTVKRKRFDE FLANIRSITD LSKIDYIVVN HSEPDHSGSL ALLLEQCPQA TVVSSQAART FLGNQIHTQF DSRIVKDNDT LELGGRTLRF IAAPFLHWPD TMFTLLQEEG VLFPCDAFGS HYSGEGIFAD EMPDFSGETR FYFDCIMRPF KERILQAVAK LDDVELKMLC PSHGPIYRSD ARKPVELYRK WSMPKAAGRR IAIFYISPHG NTEQMAEAVA KGAGAAGVHV TLCHINHASV ADIRDLMEEC DGLIFGTPTI NRDIPKPMWD VLAYLSTVSL KGNIGGIFGS FGWSGEACRM LEERLKSLNF KLPAPFVRAP FMPKAEALAE CEALGRAVAE EVLKK
|
| |