Gene GM21_4126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4126 
Symbol 
ID8139500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4713666 
End bp4714859 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content64% 
IMG OID644871741 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003023899 
Protein GI253702710 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value0.641769 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACG GAAATCTGTT CAAGTCGCTG TTTCTGATAA ATTTCGCCAC CAGCCTGGGT 
TTCGGCATAG CGGACGCATT CTTCTCCACC TACCTCTTCA GCCTGGGGGG ACGCGGGATC
CTGCTGGGGC TGCCGCTGTT TCTTTTCTCC CTTTCCAAGA TCCTCTTCGG TCCGGTGATG
GGCGCCTGCG TGGACCGCTT CGGCCCGAGG GGCGCCGTCA CGCTGAGCCT CACCCTCTAC
CTGATTGTTT CGCTTGGCTA CCTCTTCAGT TCGGACCTGG CGCTCATCAC GCTGCTGCGC
CTGGTTCAGG GGGTGGCCTG CGCCATGTTC CGACCGGTGA TGCTCTCCTT GGTCGGGGCC
GCGAGCGGCA CGAAAGGGGA GGGGCGAGCC GCGGGGACTT TCGACATCTC CTTCTACCTG
GCCGTTGGGG TGGGGCCGCT TCTGGGCGGG GTGCTGCATG ACCGGTGGGG CTTCTACGGC
ATCTTTTCCT GTCTGGCGCT ACTGTGTCTG CTGGCGTTGA CGGTTGCGCT GCGGGGCATC
CCTCGCCGTT GCGGTACGGC AATTCCCTCC GGGAGGCAAA CCGTCCGCGC GGCACTGCCG
GCCGCCCTTG AGGCGGCGCG CCACCGCCCC ATGAGAGGGC TGTTGGTCTT CATCTTCGGC
AGGGGGTGCG GCATATCGCT TCTGGCGGGG TTCCTTCCCA TCCTGCTGAA CGCGCGGCTT
GGTCTCAACG GCACCCAGAC CGGCATGGTG CTCGCCTCCA GTACCCTGGT GATCACCTCG
CTGCTGCGTC CGGTGGGCCG GCTCTCGGAC CGCCTCCCCC GCAAATCCCT GGTGCTCATG
GGAGGGGTCT CCGTGTCGCT CCTTTACTTC CTGATCCCTG TGGCGCAGGG GTTCCACCAG
GTGCTTATGC TGGGGGGAGG GATCGGGCTT TGCAGCGTGC TCTCCCAGCC TGCCAGTACC
GCGCTCCTTT TGGAGCAGGG GGAACGTCAC GGGACGGGTC TGGCGGTCGG GATCTTCAAC
ACGTCGCTTA ACCTTGGATT CGTGGCTGGA CCGCTTTTGG GGGGATGGCT GCAAAACCGT
TTGGGGCTAA CTGCCGTCTT CTATGCCGCC GGCTGGATCG GCCTGGCGGC AGTGGGGCTC
TTCGCTGCCA GTACCGTGGC GTGGGGGAAA AAGAGCTGCA TCGGGTCGGC TTGA
 
Protein sequence
MKNGNLFKSL FLINFATSLG FGIADAFFST YLFSLGGRGI LLGLPLFLFS LSKILFGPVM 
GACVDRFGPR GAVTLSLTLY LIVSLGYLFS SDLALITLLR LVQGVACAMF RPVMLSLVGA
ASGTKGEGRA AGTFDISFYL AVGVGPLLGG VLHDRWGFYG IFSCLALLCL LALTVALRGI
PRRCGTAIPS GRQTVRAALP AALEAARHRP MRGLLVFIFG RGCGISLLAG FLPILLNARL
GLNGTQTGMV LASSTLVITS LLRPVGRLSD RLPRKSLVLM GGVSVSLLYF LIPVAQGFHQ
VLMLGGGIGL CSVLSQPAST ALLLEQGERH GTGLAVGIFN TSLNLGFVAG PLLGGWLQNR
LGLTAVFYAA GWIGLAAVGL FAASTVAWGK KSCIGSA