Gene Gdia_2303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2303 
Symbol 
ID6975733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2551847 
End bp2553484 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content66% 
IMG OID643391831 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002276673 
Protein GI209544444 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.292812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0663224 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAACG TCGTTCCTGT GCCATCGGCT GGCGAGCCGG TCATCCACGC CTCGCCCCCC 
CGGCCGTCCT ATATCCCACC CTTCGGGGTG CGCACGGTGA TCGGCTGCCT GGGCATGCTG
CTGGCCGTGC ATGTGGCGGG CTTCAACGAA CACGTGACCG AGATCGGCCT GACCGATATC
CGCGGCGCCA TGCACATCGG CTACGATGAG GGAACGTGGC TGATCGCCAT CTACGAATCC
TTCAACATCG CGGCCATGGC CTTCACGCCG TGGTTCTACA TGACGTTTTC CATCTACCGC
TTCTCGATCT TCGTGACGGC CGTCATGGCG CTGCTGGCGA TCCCGGCGCC GTTCATGCCC
GATGTCACGT CGCTGTGCAT CCTGCGGGCC TTCCAGGGGC TGATGGCCGG GTGCCTGCCG
CCGGTGCTGA TGACGGTGAT GCTGAAATAC CTGCCGCCGG AAATCCGCGT GTTCGGCATC
GGCGGCTACG CGATGAGCGC GACCTTCGGC CCCAACCTGG GCCTGCCGCT GGAGGCCTTC
TGGTTCGAAC GTGTCGGCTG GCACTGGCTC TATTGGGAAA TCATCCCGCT TGCCGCGCTG
TCGATCGCCA TGATCGCGTA CGGGCTGCCG CGCGATCCCA TGCATTTCGA ACGGTTCCAG
AAGTTCAACT GGCTCGGCCT GCTGGTCGGC CTGCCCGCCA TCTGCGCGCT GGTCATCGTG
CTCTACCAGG GGGACCGGCT GGACTGGTTC CGCTCGCCCG TCATCACCAA CCTGTCCTTC
TGGGGCGGGG CGGCGTTCAT CGTCTTCGTC ATCAACGAGG CGTACCATCC CAGCCCGTAT
TTCCGCGTGC AGTACTGGCG GTCGCGCAAC ATCCAGGCCT CGCTGCTGTC GCTGGTCGGC
ATCCTGGCCA TCTGCGCCAT GATGGGCGAA ATCCCCGGCA TCTACCTGGA GGCCGTGCGC
GGCTATCGCC CGATCCAGGC GGCCCCCGTC TCACTGGTCG TGGCGCTGCC GCAGCTCCTG
ATGCTTCCGC TGATCGCGGC CATCTGCAAC AGCCGCAGGG TGGATTGCCG CTACGTGCTC
TCGGGCGGAA TGCTCTGCCT GGCCGGCGCG GCCTGGCTGG GCACGTGGCT GACCGTGGAC
TGGGTGCGGG ACAATTTCTA CGCGCTGCAG GTCCTGCAGA TCTTCGGCCA GCCCATGACC
GTCATTCCCA CGCTGATGCT CGCCACCCTC GCGATGGGGC CCGCCGACGG TCCGTTCATC
TCGGGCATGG TGAACATGCT CAAGGGCCTG GCCAACGCGG TGGCATACGC CGTGTTCGCG
GCCCTGACCC GGCGGCGGGA GCAATATCAT TCCACCATGC TGCTGGACCA TCACGGCACG
CACGGCCTGG CGCTGCAGGG GATGGGCGAT CCGGTCAACC GGCAGCTTGC CGCCACGTCG
CCCGACAGCG CGCATGTCGC GCGCAACACG CTCCAGGTCT TTCATACCTT CGTGCACGAG
CAGTCGCTCG TCCTCGCGCT GGCCGACATC TACTTCGTGC TGATCTGGGT CTGCCTCGGC
TACGCGGTCA TGAACCTCAT CCTGCCGCGC CGGGTCTATC CGCCGCGCGC GCCGTCGCCG
AACACTCCCG CCCGCTAA
 
Protein sequence
MNNVVPVPSA GEPVIHASPP RPSYIPPFGV RTVIGCLGML LAVHVAGFNE HVTEIGLTDI 
RGAMHIGYDE GTWLIAIYES FNIAAMAFTP WFYMTFSIYR FSIFVTAVMA LLAIPAPFMP
DVTSLCILRA FQGLMAGCLP PVLMTVMLKY LPPEIRVFGI GGYAMSATFG PNLGLPLEAF
WFERVGWHWL YWEIIPLAAL SIAMIAYGLP RDPMHFERFQ KFNWLGLLVG LPAICALVIV
LYQGDRLDWF RSPVITNLSF WGGAAFIVFV INEAYHPSPY FRVQYWRSRN IQASLLSLVG
ILAICAMMGE IPGIYLEAVR GYRPIQAAPV SLVVALPQLL MLPLIAAICN SRRVDCRYVL
SGGMLCLAGA AWLGTWLTVD WVRDNFYALQ VLQIFGQPMT VIPTLMLATL AMGPADGPFI
SGMVNMLKGL ANAVAYAVFA ALTRRREQYH STMLLDHHGT HGLALQGMGD PVNRQLAATS
PDSAHVARNT LQVFHTFVHE QSLVLALADI YFVLIWVCLG YAVMNLILPR RVYPPRAPSP
NTPAR