Gene Gdia_1056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1056 
Symbol 
ID6974453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1183311 
End bp1184597 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content58% 
IMG OID643390578 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002275454 
Protein GI209543225 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGCTGA ACGAAGAGAA CGTGATGACC CAGCCTCGTA CCGCCACAGT CGTTACTGAT 
GAAACTCCTG CTATTCTGGG GTTCAATCGG GCCTTGCTCG CCTTAGGCGC TCTCAATTTC
TTCATGGCGG ACGTCCAGGC GGGGATAGGG CCCTTTCTGG GCGTTTTCCT TCAGCGTCAC
GGCTGGCAGA CGGGACCGAT CGGAACCGTC ATGACTGTGG GTGGAGTTGC GGGAATGCTC
GCCACCATTC CCGCAGGCGC GCTGATTGAT CACACGACGA AAAAGCGGTT GCTCGTCATT
GTGGCAGCGC TCTGCACGAT TTCTGCTTCA CTTCTTCTTC TAAGCTCGCA AGCCGTGCCC
GTTGTGACGG TCAGTCAGCT TGCAACCGCT CTGGCCGGCG CTGGAATTGG CCCTCTGATG
GCTGCCATAA CCCTTGGTAT CGTGCGCCAG AAAGGCTTCA ACACACAGAT CGGTCGTAAC
CAGGCCTGGA ACCATGCCGG CAATATGGCC GGGGCCGGAC TGTCCGGCTG GCTGGGATGG
CAGTTTGGCC TCTCAGCGAT TTTCTTTCTT GAAGTCGCCT TCGGTCTGTT CGCCATTTCT
GCGGTGCTCC TGATCCCGGA AAAATCCATA GATCATAAAG CTGCACGCGG ACTGGACGAT
GAACCCGTTC ACGATGAGGG GACGGCGGAG GGATTACGAC CCTTTCTGCG ACACAAGCCT
CTTCTCATTC TGGCGAGTTG TTTGTGTTTC TTCCATCTCG GAAATGCCGC GATGCTCCCG
CTGTACGGCA TGGCGGTCGT CAGTGCAGGC AAAGGTAATC CAGCCATGTT CACGGCGATG
ACCGTGATGG TCGCACAGGC TGTGATGATC GTCGTGAGCC TGCTGGCCAT ACGTTTCGTC
AGGGACCGTG GTTACTGGTT CGTCCTGCTG ATATCGTTTG CCGCCCTGCC GCTGCGTGGT
TTGATTGCGG GAAGCTTCAT CCAGCATTGG GGGGTGTGGC CGGTGCAGAT CCTAGATGGG
ATCGGTGCGG GGCTTCAAAG TGTTGCCGTG CCGGGTCTGG TGGCCAGACT GCTGAACGGA
ACCGGACGGA TCAATATCGG ACAGGGCGTG GTCATGACGG CGCAGGGCAT TGGAGCAAGC
CTCTCCCCGG CTCTGGGAGG ATGGCTTGCC GAAGATCTGG GATACGCCGT GGCGTTCTAT
ACTCTGGGTT GCTTTGCAAT CGTGTCACTG GGGCTCTGGA TAGGCTCGGC ATCGACCCTG
CGATCTGCCG ATCAGGTGTC GGCATGA
 
Protein sequence
MLLNEENVMT QPRTATVVTD ETPAILGFNR ALLALGALNF FMADVQAGIG PFLGVFLQRH 
GWQTGPIGTV MTVGGVAGML ATIPAGALID HTTKKRLLVI VAALCTISAS LLLLSSQAVP
VVTVSQLATA LAGAGIGPLM AAITLGIVRQ KGFNTQIGRN QAWNHAGNMA GAGLSGWLGW
QFGLSAIFFL EVAFGLFAIS AVLLIPEKSI DHKAARGLDD EPVHDEGTAE GLRPFLRHKP
LLILASCLCF FHLGNAAMLP LYGMAVVSAG KGNPAMFTAM TVMVAQAVMI VVSLLAIRFV
RDRGYWFVLL ISFAALPLRG LIAGSFIQHW GVWPVQILDG IGAGLQSVAV PGLVARLLNG
TGRINIGQGV VMTAQGIGAS LSPALGGWLA EDLGYAVAFY TLGCFAIVSL GLWIGSASTL
RSADQVSA