Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2012 |
Symbol | |
ID | 8137346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2334936 |
End bp | 2336177 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644869625 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003021822 |
Protein GI | 253700633 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.00000000000000197881 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGGCAG AACAGATCTC TCTTTCCTAC CGAAGATACG CGCTGGGACT GCTCCTGGCG GTCAACCTGC TCAACTACAT CGACCGGCAG GTTCTCTTCG CGGTATTCCC CCTGATCAAG GCCGACCTCA ATATAAGCGA CACCGAACTG GGGCTTCTCG GCAGCGCTTT CATGCTGAGC TACATGGTCA TCGCTCCGGT CTTCGGCTGG CTTGGGGATC ACTGGGACCG GGTCAAGCTC GCCTCGTCCG GCGTCGTGGT CTGGAGCCTG GCGACCGTCC TCGCCGGTTT CGCGCCCGGC TACCGGACGC TCCTTGCGGC GCGGGCCACC GTCGGGGTAG GGGAGGCGAG CTTCGGGACC GTCTCCCCAG GGCTCATCGC CGACTTCTTC CCGAAGGATC AGCGCGGCCG CATCCTTTCC TGGTTCTACG TGGCGATTCC CGTGGGGAGC GCCATGGGTT ACCTCCTGGG GGGTGTCTTG GGGCACCGTT TCGGGTGGCA CGCCGCCTTC CTCATGGTGG GCCTGCCGGG AATCCTCCTC GCGCTGCCGC TTTGGTTTTT GCGCCCCCCG GTGCGCGGGG GTAAAAGGGC CACCGAGCAG GTCGCCGGGG AGAAGGGGAT GGCTGCCTAC CTGCAGCTCT TCAGAAACCG CGCCTTCGTC ACCAACACCC TCGCCATGGC GGCCATGACC TTCGCCATAG GAGGACTGGC GCAGTGGATC CCGACCTTCC TGTTCAGGGC CCACGCCCTC GACGTCGAGA AGGCCAACAC CCTCTTCGGG GCCACCACGG TGCTGGCGGG GATAATGGGG ACCCTGGCCG GTGGGTGGCT CGGTGACCGC TGGCAGAAAA AGAGCAGCAA GGGATACCTG CTCGTTTCGG GATGGGGGTT CTTCATCGGC GCCCCCTTCG CCGCCTGGGC CATCATGGCG CCGGCGCTAC CGGTCTGCAT GGCCGCCATC TTCGTGGCCG AGTTCTTCCT CTTCCTTAAC ACCGGCCCGC TCAACACGGT GATCATCAAC GTGACGCGCC CCGCCGTGCG CGCCATGGCC TTCGCGGTGA ACATCTTCTT CATCCACGCC CTGGGCGACG CCGTCTCGCC CTCTATGCTG GGGTGGCTTT CCGATCAGTG GGGGCTGAGG CTCGCGCTTC TCTCCACCCC GCTGGTGATG GCGCTGGCAG GTGTGTTCTG CTTTGTCTGC GGCAGGTACG TGGCGCACGA CATGGCGCAG GCCGAGCCCT GA
|
Protein sequence | MQAEQISLSY RRYALGLLLA VNLLNYIDRQ VLFAVFPLIK ADLNISDTEL GLLGSAFMLS YMVIAPVFGW LGDHWDRVKL ASSGVVVWSL ATVLAGFAPG YRTLLAARAT VGVGEASFGT VSPGLIADFF PKDQRGRILS WFYVAIPVGS AMGYLLGGVL GHRFGWHAAF LMVGLPGILL ALPLWFLRPP VRGGKRATEQ VAGEKGMAAY LQLFRNRAFV TNTLAMAAMT FAIGGLAQWI PTFLFRAHAL DVEKANTLFG ATTVLAGIMG TLAGGWLGDR WQKKSSKGYL LVSGWGFFIG APFAAWAIMA PALPVCMAAI FVAEFFLFLN TGPLNTVIIN VTRPAVRAMA FAVNIFFIHA LGDAVSPSML GWLSDQWGLR LALLSTPLVM ALAGVFCFVC GRYVAHDMAQ AEP
|
| |