Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3226 |
Symbol | |
ID | 8138578 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 3741648 |
End bp | 3742880 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644870830 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003023010 |
Protein GI | 253701821 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 0.361793 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATCGA TAAAAAGCGG AACCAAGAGA TACCGGCAGA TCAACCTGGC TTTCTTCATG GCCGGCTTCG TCACCTTCAT CACGCTCTAC GACGTGCAGC CGCTTCTGCC CGAGTTCTCC AGGGAATTCG GGGTCGCCGC CGCCTGGGGG AGCCTGCCGC TCTCGATAAC CACCTGCGCC CTCGCCGTCG CCATGCTCTT CGCCGGTACC GTCTCGGAAA CGATGGGAAG GAAGAAGGTA ATGGTGGCCT CCCTTGTCGC GACCTCCCTC CTGGCGTTTC TCACCTCGTG GACCTCCACC CTCCCCGAAC TGGTCGCGGT GAGGCTGGTC CAGGGGATGG TGCTCGCGGG GCTACCGGCG GTGGCGATGG CCTACTTAAG CGAGGAGATC GCCCCCGCGT CCCTTACCTC CGCCATCGGC CTCTACATAA GCGGCAACGC CATCGGCGGC ATGACCGGAA GGATCTTCAC CGCCACCATG ACCGAATTCG TCTCCTGGCG CTTTGCCCTG GCCCTGATCG GGGTAGCGTG CCTGCTGCTG AGCCTGTACT TCGCCAAGTC GCTCCCCGCT TCGGAGAACT TCAAGCAAAG GCCCTTCGCC GCCCGTTACT TCTTCACCTC GCTCTTCAAG CAGTTGCAGG ACCCGGGGCT TCTCTGCCTC TACGGCATCT CGTTTCTCAT CATGGGAAGC TTCGTCACCC TCTACAACTA CATCACCTTC AGGCTCCTCG GTTCCCCCTA TTACCTGAGC CCGTCGCTGG TGAGCCTCGT CTTCCTGGTC TACATGCTCG GCTCCTTCAG TTCGTCCATG GTGGGGGGGC AGGTGGAGCG CTTCGGGAGA GGGCGCATGC TCTTTCTCAC CATCGCCACC ATGGTGGCCG GGGCGCTGAT CACGCTGGCC CGTGACCTCC CCACCGTCGT CGCCGGAATC GGCGTTTTCA CCTGCGGCTT TTTCGGCGCC CACACCATCG CTTCGGCCTG GGTCGGAAGC AGGGCGAAAA GCGCCCGGGC GCAGGCGGCG TCGCTCTACC TTTTCTTCTA TTACCTGGGG TCAAGCGTCT CCGGCACCGT CGGCGGGCTC TTCTGGGCCC GGCACGGCTG GGTGGGGGTG GTGCTGTTGA TCATGGGGCT TCTAAGCCTG GGGCTGTTCC TTTTGAAGGC GCTCACGCTC TGTTCGGAGG GAAGCTGCAA CGCCAAGAAC GCCGTCGCGA ATCTGGACGC GCTGCGCAGT TGA
|
Protein sequence | MGSIKSGTKR YRQINLAFFM AGFVTFITLY DVQPLLPEFS REFGVAAAWG SLPLSITTCA LAVAMLFAGT VSETMGRKKV MVASLVATSL LAFLTSWTST LPELVAVRLV QGMVLAGLPA VAMAYLSEEI APASLTSAIG LYISGNAIGG MTGRIFTATM TEFVSWRFAL ALIGVACLLL SLYFAKSLPA SENFKQRPFA ARYFFTSLFK QLQDPGLLCL YGISFLIMGS FVTLYNYITF RLLGSPYYLS PSLVSLVFLV YMLGSFSSSM VGGQVERFGR GRMLFLTIAT MVAGALITLA RDLPTVVAGI GVFTCGFFGA HTIASAWVGS RAKSARAQAA SLYLFFYYLG SSVSGTVGGL FWARHGWVGV VLLIMGLLSL GLFLLKALTL CSEGSCNAKN AVANLDALRS
|
| |