Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1590 |
Symbol | |
ID | 8136921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1854802 |
End bp | 1856406 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644869203 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003021403 |
Protein GI | 253700214 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTTTT CCCGCTGTCT CCCCCTCCTT CTTGCCCTCC TAGTCCTTGC CGGTTGCGCC AAGGGTGCGC CTGAGGACGC GGCGCGCGCG CAGGGTGGGG GCGCTTCCCG CGGCAGTTCC TTTGTCACCG GGAGCATCGG GGAGCCTTCC ACGCTGATAC CGATACTCGC CAGCGATTCC GCCTCCTTCC AGGTGGCCGG TCTGGTCTAC AACGGGTTGG TGCGTTACGA CAAGAACCTG AAGCTGGAGG GGGATCTGGC CCAATCGTGG GAGGTTTCTC CCGACGGTTT GGGCATCACC TTCCACCTGC GTCGCGGCGT CAAGTGGCAC GACGGCCACG ACTTCACCTC CCGTGACGTC CTCTACACCT ACAAGGTGAC CATAGACCCC AAGACGCCGA CCGCGTATGC GGAGGACTTC AAGCAGGTGC TGTCCGCACA GGCGGTGGAT ACTTACACCT TCAAGGTGCG CTACGCCAAG CCCTTCGCGC CGGCGCTGGC CTCGTGGGCG TCCATGTCGG TCCTGCCTGC ACATCTTTTG GAAGGGAAGG ACATCACCAA GAGCCCGCTC TCCAGAAAGC CGGTCGGCAC CGGTCCCTAC ATCTTCAAGG AGTGGGTAGC GGGACAGCGG GTGATCCTGG AGGCAAACCC GCATTATTAC GAGGGTGCGC CGCACCTCTC CCCCTACGTC TATCGCATCA TCCCGGACAA CTCCACCATG TACATGGAGC TCAAGGCGGG CGGCATCGAC ATGATGGGGC TTTCCGCGGT GCAGTACCAG CGCCAGACCG GAAGCCGAGA GTTCCTGTCC CGCTTCAACA AGTACCGCTA CCCGGCCTCC GCCTACACCT ATCTCGGCTA CAACCTCAGG CTCCCGATGT TCCAGGACGT CAGGGTGCGC CGCGCGCTCA CCTGCGCCAT CAACAAGGAA GAGATCATTC AGGGGGTCCT GCTCGGGATG GGGCAGGTAG CACACGGCCC CTACAAGCCG GGCACCTGGG CCTGGAAGCC CACGATCGGG GCGGACCCCG GCTACGACCC GGCCCGCGCC GCTGCACTCT TGAAAGAAGC GGGGTACGTC ATGGGGCAAG ACGGCATCCT GGTCAAGGAC GGCAAGCCGC TGAGCTTCAC CATCATGACC AACCAGGGAA ACGACGAGAG GCTCAAATGC GCCCAGATCA TACAAAGGCG CCTGAAGCGG GTCGGCATCG ACGTGAAGAT CCGCGTCATG GAATGGGCCT CCTTCCTCAC CAACTTCATC GACAAGGGGA GGTTCGAGGC GGTGCTGCTC GGCTGGACCA TCTCGCAGGA CCCCGACCTG TACGACGTCT GGCACTCTTC CAAGACCGGT CCCAAGGAGC TCAATTTCGT CGGCTACAAG AACCCGGAGC TGGACCGCCT GATCGTCGAG GGGCGCGGCA CCTTCGATAT GGCCAAACGG CGCGAGAGCT ACTACCGGCT TCAGGAGATA CTGGCGCAGG ACCAGCCCTA CACCTTCCTT TACGTCCCGG ACGCGCTGCC GGTGGTCGCC TCCAGGATCA AGGGGATTGA GCCGGCGCCT GCGGGGATCA GCTACAACTT GATCAAGTGG TATGTAGAAC AATGA
|
Protein sequence | MTFSRCLPLL LALLVLAGCA KGAPEDAARA QGGGASRGSS FVTGSIGEPS TLIPILASDS ASFQVAGLVY NGLVRYDKNL KLEGDLAQSW EVSPDGLGIT FHLRRGVKWH DGHDFTSRDV LYTYKVTIDP KTPTAYAEDF KQVLSAQAVD TYTFKVRYAK PFAPALASWA SMSVLPAHLL EGKDITKSPL SRKPVGTGPY IFKEWVAGQR VILEANPHYY EGAPHLSPYV YRIIPDNSTM YMELKAGGID MMGLSAVQYQ RQTGSREFLS RFNKYRYPAS AYTYLGYNLR LPMFQDVRVR RALTCAINKE EIIQGVLLGM GQVAHGPYKP GTWAWKPTIG ADPGYDPARA AALLKEAGYV MGQDGILVKD GKPLSFTIMT NQGNDERLKC AQIIQRRLKR VGIDVKIRVM EWASFLTNFI DKGRFEAVLL GWTISQDPDL YDVWHSSKTG PKELNFVGYK NPELDRLIVE GRGTFDMAKR RESYYRLQEI LAQDQPYTFL YVPDALPVVA SRIKGIEPAP AGISYNLIKW YVEQ
|
| |