Gene Gdia_1705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1705 
Symbol 
ID6975120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1886349 
End bp1888010 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content64% 
IMG OID643391232 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002276089 
Protein GI209543860 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCACG CTGATGCGCT GACACGTCCG CGCCGCGAAG TGCTGCCCGC GCCCGTGTCT 
ATGCAACCGG AACCACCTCC GGCGCCGCCG AAGGCGGCGG TGTTCGGTCC GCGCCTGGCG
ACGGGGCTGG CGGGCGTGCT GCTGGCAGTC CTGCTGGCGG GGTTCAACGA GCACACGACC
GAGGCTGGGC TCGCCGATAT CCGGGGCGCA TTCCATCTCG GGCATGACGA GGGGACCTGG
ATCACCGCCC TGTTCGAGGC GTTCAACATC GCTGCGATGG CGTTCGCTCC CTGGTGTTCG
GTGACGTTCT CCATCCGACG CCTGACGATC GCGATGACGG GGCTAGTGGG CGTGCTGGGC
ACGGCGGCCC CCTTCATGCC GAACCTGTCG TCTCTTCTTC TGCTGCGCTG CGTGCAGGGA
TTGGCTTGCG GTAGCCTGCC GCCAATGCTG ATGACGGTCG CGCTGCGCTT CCTGCCGCCC
AATATCAAGA TCTACGGTCT GGGCGCCTAT GCTCTGACCG CGACTTTCGG GCCGAATATT
GGCGGCGCGC CGCTGGCGGG ATTCTGGTTC GAGTATGTCG GCTGGCCGTT CCTGTTCTGG
CAGATCGTGC CGCTGTGCCT GATCTCGATG GTGTGCGTGG CATGGGGCTT GCCACAGGAT
CCGATCCGGT TGGAGCGTTT CCGTCAGTTC GACTGGCGCG GCCTGCTGAC CGGGTTGCCC
GCGATCTGCA TGCTGGTGAT TGCCCTCGAA CAGGGCGACC GGCTGGACTG GTTTCGTTCG
CCGTTGATTA CCCACCTCTT CTTCGGCGGC GGGTTCCTGT TCGTGCTCTT CATCGTCAAT
GAATGGTTCC ATCCGGTGCC GTTCTTCCGC ATCCAGTTGC TCAAGCAGCG CAACGTGACG
GATGCACTGC TGACGCTGGC AGCGGTTCTG GTATTGGGGG CGGTCACGGC GGAAATCCCT
GCGGTCTATC TGGAGGAGGT GCGTGGCTAT CGCCCGATCC AGATCGCGCC GGTCATGCTG
ATCCTCGCGG TACCACAAGT GCTCGCGCTG CCGCTGGTCT CGGCCTTGTG CAATATCCGC
CGGGTGGATT GCCGTCATGT GCTGATGGCA GGCCTTGCGA TTCAGGGGCT GTCCTATTTC
CTAGGCACCT GGATCGATGC TGACTGGGTG CGCGAGAACT TCTATGTCAT GCAGCTCCTG
CAGGTGGCCA GCGAACCGAT GATGGTCATC GCGATCCTCA TGCTGGCGAC GATGGGCCTG
GGCCCGGCCG ATGGACCGTT CATCTCGGGC ATGTTCAACA TGACCAAGGG AGTGGCCAAT
GCCATCGCGG CGGGCGTCAT TTCGGCCCTG CTGCGTCGGC GCGAGCAATT CCATTCCACC
ATGCTGCTCG ACACCTATGG CACCAACCAT ACCGCGTTGC AGTCCTGGGG CGACCCGGCG
CAGGCGCTGC TGGCGCCACA CATCGCGGAT CCGGCCCGGT TGGACTGGAA CATGGCCGTC
CTGCTGCACG GCGAGGCGTC TGAGCAGGCG CTCATTCTGG CCTGTGCCGA TATCTACACG
ATCATGATCG GCGTCTGCGC CGCCCTGTTC ATGCTCAATC TGGTTCTGCC GCAACGGGTC
TATCCGCCCC GCGCGCCGTC CACCTCCGCT CCTGCACGCT GA
 
Protein sequence
MDHADALTRP RREVLPAPVS MQPEPPPAPP KAAVFGPRLA TGLAGVLLAV LLAGFNEHTT 
EAGLADIRGA FHLGHDEGTW ITALFEAFNI AAMAFAPWCS VTFSIRRLTI AMTGLVGVLG
TAAPFMPNLS SLLLLRCVQG LACGSLPPML MTVALRFLPP NIKIYGLGAY ALTATFGPNI
GGAPLAGFWF EYVGWPFLFW QIVPLCLISM VCVAWGLPQD PIRLERFRQF DWRGLLTGLP
AICMLVIALE QGDRLDWFRS PLITHLFFGG GFLFVLFIVN EWFHPVPFFR IQLLKQRNVT
DALLTLAAVL VLGAVTAEIP AVYLEEVRGY RPIQIAPVML ILAVPQVLAL PLVSALCNIR
RVDCRHVLMA GLAIQGLSYF LGTWIDADWV RENFYVMQLL QVASEPMMVI AILMLATMGL
GPADGPFISG MFNMTKGVAN AIAAGVISAL LRRREQFHST MLLDTYGTNH TALQSWGDPA
QALLAPHIAD PARLDWNMAV LLHGEASEQA LILACADIYT IMIGVCAALF MLNLVLPQRV
YPPRAPSTSA PAR