Gene GM21_3226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3226 
Symbol 
ID8138578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3741648 
End bp3742880 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content63% 
IMG OID644870830 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003023010 
Protein GI253701821 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value0.361793 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATCGA TAAAAAGCGG AACCAAGAGA TACCGGCAGA TCAACCTGGC TTTCTTCATG 
GCCGGCTTCG TCACCTTCAT CACGCTCTAC GACGTGCAGC CGCTTCTGCC CGAGTTCTCC
AGGGAATTCG GGGTCGCCGC CGCCTGGGGG AGCCTGCCGC TCTCGATAAC CACCTGCGCC
CTCGCCGTCG CCATGCTCTT CGCCGGTACC GTCTCGGAAA CGATGGGAAG GAAGAAGGTA
ATGGTGGCCT CCCTTGTCGC GACCTCCCTC CTGGCGTTTC TCACCTCGTG GACCTCCACC
CTCCCCGAAC TGGTCGCGGT GAGGCTGGTC CAGGGGATGG TGCTCGCGGG GCTACCGGCG
GTGGCGATGG CCTACTTAAG CGAGGAGATC GCCCCCGCGT CCCTTACCTC CGCCATCGGC
CTCTACATAA GCGGCAACGC CATCGGCGGC ATGACCGGAA GGATCTTCAC CGCCACCATG
ACCGAATTCG TCTCCTGGCG CTTTGCCCTG GCCCTGATCG GGGTAGCGTG CCTGCTGCTG
AGCCTGTACT TCGCCAAGTC GCTCCCCGCT TCGGAGAACT TCAAGCAAAG GCCCTTCGCC
GCCCGTTACT TCTTCACCTC GCTCTTCAAG CAGTTGCAGG ACCCGGGGCT TCTCTGCCTC
TACGGCATCT CGTTTCTCAT CATGGGAAGC TTCGTCACCC TCTACAACTA CATCACCTTC
AGGCTCCTCG GTTCCCCCTA TTACCTGAGC CCGTCGCTGG TGAGCCTCGT CTTCCTGGTC
TACATGCTCG GCTCCTTCAG TTCGTCCATG GTGGGGGGGC AGGTGGAGCG CTTCGGGAGA
GGGCGCATGC TCTTTCTCAC CATCGCCACC ATGGTGGCCG GGGCGCTGAT CACGCTGGCC
CGTGACCTCC CCACCGTCGT CGCCGGAATC GGCGTTTTCA CCTGCGGCTT TTTCGGCGCC
CACACCATCG CTTCGGCCTG GGTCGGAAGC AGGGCGAAAA GCGCCCGGGC GCAGGCGGCG
TCGCTCTACC TTTTCTTCTA TTACCTGGGG TCAAGCGTCT CCGGCACCGT CGGCGGGCTC
TTCTGGGCCC GGCACGGCTG GGTGGGGGTG GTGCTGTTGA TCATGGGGCT TCTAAGCCTG
GGGCTGTTCC TTTTGAAGGC GCTCACGCTC TGTTCGGAGG GAAGCTGCAA CGCCAAGAAC
GCCGTCGCGA ATCTGGACGC GCTGCGCAGT TGA
 
Protein sequence
MGSIKSGTKR YRQINLAFFM AGFVTFITLY DVQPLLPEFS REFGVAAAWG SLPLSITTCA 
LAVAMLFAGT VSETMGRKKV MVASLVATSL LAFLTSWTST LPELVAVRLV QGMVLAGLPA
VAMAYLSEEI APASLTSAIG LYISGNAIGG MTGRIFTATM TEFVSWRFAL ALIGVACLLL
SLYFAKSLPA SENFKQRPFA ARYFFTSLFK QLQDPGLLCL YGISFLIMGS FVTLYNYITF
RLLGSPYYLS PSLVSLVFLV YMLGSFSSSM VGGQVERFGR GRMLFLTIAT MVAGALITLA
RDLPTVVAGI GVFTCGFFGA HTIASAWVGS RAKSARAQAA SLYLFFYYLG SSVSGTVGGL
FWARHGWVGV VLLIMGLLSL GLFLLKALTL CSEGSCNAKN AVANLDALRS