Gene GM21_1093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1093 
Symbol 
ID8136415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1284524 
End bp1285744 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content66% 
IMG OID644868704 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003020912 
Protein GI253699723 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value0.0159699 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACT CCGCCACAGG CAACCAGCCA GACGAACAAC ACCAAAACCC GGGAGAGCGC 
CAGGCGGCAC AGGTACTTAT CTGCTGCGCG ATCGGCTTCG TCTGTTTCCT GGGCGCCTAC
ATGCGGATTC CCGTCTTGCC GCTTCTGGCC AGCGGCATCG GCGCCAGCAC CACGCAGATC
GGCCTGATCA ACGCCGCCTT CATGCTTTCC GCCGGCCTCT TGGCCGTACC CTCCGGGTTG
ATATCGGACA AGGTCGGCAT CACCCCCGTC CTCCTTGCCG GGCTGGCACT GATGAGCGGG
GCGTCGCTGC TCATTCCCCT AAGCGGCAAC TGGGTCGCCC TGGGCGCCAT CTACCTCGTC
TTCGGCGTCG GGCTCGCCGC CTTCACCCCG ACCATGATGT CGTCGGTCGC GAGGATCGTC
CCGCGCTCGC ACGTCGGCCG CGCCTACAGT TGGTACACCA CCGCCGTCTA CCTCGCCATG
ACCATCGGCC CGGCCGCCGG CGGCTGGTTC GGGCAGAGGC TAGGGTTCAG GCAGGTCTTC
CTGGTTTCCG GCGTGCTGAT CGCCCTGGCG TTCATAGCGG TCCTCTGCTT TCTCCCCAAG
GAGCCCCACG CTCCCCACCG CCCGCACGCC GGCGACCCGC ATCCGCCACC CGGCATTTTC
TCCAACGGGC GGCTCATGGC CGCGCTCGCC GGAACCTTGG GCGGCTGTTT CGGCTTCGGC
ATGTACCTCT CATTTCTCCC CCTGCATGCG CGCGCCGCCG GCCTGAGCGT CGACCAGATA
GGAGTCGTCT TCGCGGCCCA AGCCTTGGTC AACGTCCTCT TGCGCATCCC GTTCGGGCAT
TTGAGCGACC GGATCGACCG CGGCACCATG TCCGGCATCG GCCTCGTCGT CTGCGCCGTT
GCCCTCGCTC TGACCGGCGC CAGCCACACC CTCGCAGCCA TGATCCTCAG CGCCTGCCTT
CTCGGTGCCG GGATGGGGAC AAGTTTCACT GCCCTGAGTT CGCTGGTGGC GATCGTGATG
CCTGCCGGTA GGCGCGGCCT GGGGATGGGG CTTTACAACA GCTGCATCTA TCTGGGGATG
ATGCTCAGTT CCGCCACCAT GGGGGTGGTG ATCAAAAGGA CCAGCTTTGC CACCGGGTTC
CTGGCCGCCG GTTGCATCAC CTTTGCGGCG ACCTTGCTGT TTCTACTCCT GTACCGCGGC
AGCACCTCGG CCGAAAAATG A
 
Protein sequence
MSDSATGNQP DEQHQNPGER QAAQVLICCA IGFVCFLGAY MRIPVLPLLA SGIGASTTQI 
GLINAAFMLS AGLLAVPSGL ISDKVGITPV LLAGLALMSG ASLLIPLSGN WVALGAIYLV
FGVGLAAFTP TMMSSVARIV PRSHVGRAYS WYTTAVYLAM TIGPAAGGWF GQRLGFRQVF
LVSGVLIALA FIAVLCFLPK EPHAPHRPHA GDPHPPPGIF SNGRLMAALA GTLGGCFGFG
MYLSFLPLHA RAAGLSVDQI GVVFAAQALV NVLLRIPFGH LSDRIDRGTM SGIGLVVCAV
ALALTGASHT LAAMILSACL LGAGMGTSFT ALSSLVAIVM PAGRRGLGMG LYNSCIYLGM
MLSSATMGVV IKRTSFATGF LAAGCITFAA TLLFLLLYRG STSAEK