Gene Gdia_0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0801 
Symbol 
ID6974198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp910847 
End bp912175 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content67% 
IMG OID643390330 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002275206 
Protein GI209542977 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.469185 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTCTT CCACGGAGAT GCATGTGACG CAGCAGCCCG GTCCGACCGA TGCCGATCGT 
GGCCGGCGCT TTTGGCAGAC ACGCAATTTC ATCCTGTTTT TCGTTGCCAC CAGTGCCTCG
ACGCTGGGAT CGGCCATGGT ATCGGTCGCG TTGACGTTCG CGGTGCTGTC GCACGGGCAT
TCGGCATCGA TCCTGGGCCT GGTCCTGGCC GCGCAGGCGG CCCCGGTCGT GGTGCTGATG
GTTCCGGCGG GGGCGATCGC GGACCGGTGG GTGCGGCGGT CCCTGATGGT GGGCGCGGAC
CTGCTGCGCT GTGCCAGCCA GGGTCTGACC GCGATCCTGA TGGCGGGTGC CCATCCCTCG
GTCGCCGTGC TGATCGGCCT GGTCACGCTG GTCGGGGTCG GCAACGCATT CTACGGCCCT
GCGGAAAGCG GCCTGATTCC GGTTCTGGCG CGGCCCGAGG ATCTGCGCCG CGTCAACAGC
CTGCTCAGTC TTTCCGGCTC GATCACCGCG ATACTGGGGC CGTCGCTGGG CGGCATGCTG
GTTGCGATCG GCAGCGCACC GATCGCGATC GGCTGCGACG CGGTGACCTA TGCCATCAGC
GCCATCTGCC TGACGGCGAT AAGTACCCTG CGCCCGGCGC GCCGGGCGAC CGCACCGTTC
CAGACCCAGT TGCTGGCCGG CCTGCGCGAG TTCCATCAGC GGCGATGGCT GATCCTTATG
ACGGCGCAAT ACGGGTTCCT GAATCTGGCG GCGTTCGCGC CGTTCCTGAT CCTCGGCCCC
GTCTCACTGG CCCATGTGGT GCGTGGCGCC CAGTCCTGGG GCATCATTTC CTCGGCCATC
GGCATCGGCG GCATTTTCGG TGGGGGCGTC AGCCTGTTCT GGCATGTTTC CCGTCCGTTG
GTGCTTTATG AAACGGCGGC TGCCGTCCTG GTGATTCCGC TGGTCCTGCT GGCCGCGCAG
GCGTCGGTTC CCTACCTGGC CCTGGGCGGC GTCGCCTTTG GCGCGGGGAT CGTGATCCTG
AACCTGGTCG CGCAGACCAC CATTCAACGG CAGGTGCCCG AGGAGGCGTT ATCGCGGATC
AACGCCCTGT TCGGCCTGGT CGCGCAAGGC CTGACACCGC TGAGCTACGC CATGTGCGGC
TTCCTGGCCC GTGCGGTCGG CATAAAGCCT GTTCTGGCGG CCAGCAGCGT CGTGGTCGGG
GTCAGCGTCG TGGTCCTGCT GATGCGCAGG GAAACCTGGG ACCTGCGGGA TGCGCCGGCC
GTCGCCGACG GCCGAAGTCA GGATAAGGGG GGCAGGACAA GGGGTCAGGA TAACCGGTCC
AGCCGGTAG
 
Protein sequence
MGSSTEMHVT QQPGPTDADR GRRFWQTRNF ILFFVATSAS TLGSAMVSVA LTFAVLSHGH 
SASILGLVLA AQAAPVVVLM VPAGAIADRW VRRSLMVGAD LLRCASQGLT AILMAGAHPS
VAVLIGLVTL VGVGNAFYGP AESGLIPVLA RPEDLRRVNS LLSLSGSITA ILGPSLGGML
VAIGSAPIAI GCDAVTYAIS AICLTAISTL RPARRATAPF QTQLLAGLRE FHQRRWLILM
TAQYGFLNLA AFAPFLILGP VSLAHVVRGA QSWGIISSAI GIGGIFGGGV SLFWHVSRPL
VLYETAAAVL VIPLVLLAAQ ASVPYLALGG VAFGAGIVIL NLVAQTTIQR QVPEEALSRI
NALFGLVAQG LTPLSYAMCG FLARAVGIKP VLAASSVVVG VSVVVLLMRR ETWDLRDAPA
VADGRSQDKG GRTRGQDNRS SR