Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1093 |
Symbol | |
ID | 8136415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 1284524 |
End bp | 1285744 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644868704 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003020912 |
Protein GI | 253699723 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 0.0159699 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACT CCGCCACAGG CAACCAGCCA GACGAACAAC ACCAAAACCC GGGAGAGCGC CAGGCGGCAC AGGTACTTAT CTGCTGCGCG ATCGGCTTCG TCTGTTTCCT GGGCGCCTAC ATGCGGATTC CCGTCTTGCC GCTTCTGGCC AGCGGCATCG GCGCCAGCAC CACGCAGATC GGCCTGATCA ACGCCGCCTT CATGCTTTCC GCCGGCCTCT TGGCCGTACC CTCCGGGTTG ATATCGGACA AGGTCGGCAT CACCCCCGTC CTCCTTGCCG GGCTGGCACT GATGAGCGGG GCGTCGCTGC TCATTCCCCT AAGCGGCAAC TGGGTCGCCC TGGGCGCCAT CTACCTCGTC TTCGGCGTCG GGCTCGCCGC CTTCACCCCG ACCATGATGT CGTCGGTCGC GAGGATCGTC CCGCGCTCGC ACGTCGGCCG CGCCTACAGT TGGTACACCA CCGCCGTCTA CCTCGCCATG ACCATCGGCC CGGCCGCCGG CGGCTGGTTC GGGCAGAGGC TAGGGTTCAG GCAGGTCTTC CTGGTTTCCG GCGTGCTGAT CGCCCTGGCG TTCATAGCGG TCCTCTGCTT TCTCCCCAAG GAGCCCCACG CTCCCCACCG CCCGCACGCC GGCGACCCGC ATCCGCCACC CGGCATTTTC TCCAACGGGC GGCTCATGGC CGCGCTCGCC GGAACCTTGG GCGGCTGTTT CGGCTTCGGC ATGTACCTCT CATTTCTCCC CCTGCATGCG CGCGCCGCCG GCCTGAGCGT CGACCAGATA GGAGTCGTCT TCGCGGCCCA AGCCTTGGTC AACGTCCTCT TGCGCATCCC GTTCGGGCAT TTGAGCGACC GGATCGACCG CGGCACCATG TCCGGCATCG GCCTCGTCGT CTGCGCCGTT GCCCTCGCTC TGACCGGCGC CAGCCACACC CTCGCAGCCA TGATCCTCAG CGCCTGCCTT CTCGGTGCCG GGATGGGGAC AAGTTTCACT GCCCTGAGTT CGCTGGTGGC GATCGTGATG CCTGCCGGTA GGCGCGGCCT GGGGATGGGG CTTTACAACA GCTGCATCTA TCTGGGGATG ATGCTCAGTT CCGCCACCAT GGGGGTGGTG ATCAAAAGGA CCAGCTTTGC CACCGGGTTC CTGGCCGCCG GTTGCATCAC CTTTGCGGCG ACCTTGCTGT TTCTACTCCT GTACCGCGGC AGCACCTCGG CCGAAAAATG A
|
Protein sequence | MSDSATGNQP DEQHQNPGER QAAQVLICCA IGFVCFLGAY MRIPVLPLLA SGIGASTTQI GLINAAFMLS AGLLAVPSGL ISDKVGITPV LLAGLALMSG ASLLIPLSGN WVALGAIYLV FGVGLAAFTP TMMSSVARIV PRSHVGRAYS WYTTAVYLAM TIGPAAGGWF GQRLGFRQVF LVSGVLIALA FIAVLCFLPK EPHAPHRPHA GDPHPPPGIF SNGRLMAALA GTLGGCFGFG MYLSFLPLHA RAAGLSVDQI GVVFAAQALV NVLLRIPFGH LSDRIDRGTM SGIGLVVCAV ALALTGASHT LAAMILSACL LGAGMGTSFT ALSSLVAIVM PAGRRGLGMG LYNSCIYLGM MLSSATMGVV IKRTSFATGF LAAGCITFAA TLLFLLLYRG STSAEK
|
| |