Gene Gdia_0954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0954 
Symbol 
ID6974351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1078402 
End bp1079955 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content65% 
IMG OID643390477 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002275353 
Protein GI209543124 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACCCG AGATGCGAGG ACGAATCCTC ACGGGAACCT GCGGTGTCTG CGCGGCCTGG 
ATGGCCATGA TGTCGACGCG GTTCTTCGCG CTTTCCCTTC CCGATATCGA GGGCGTGCTG
GGGATCGGGC CGGATGAGGG TTCATGGCTC GCCGCGAGCT ACCTGGCATT CGAGCCGGTC
GGTGTCGCTC TTGGATGCTG GCTGGCCGTC ATCTTCTCCC TGCGCCGGGT CATGCTGCCT
GCGATCCTGC TGTTCGTGAT CGCCTCCGCG ATCGCGGCGG TCACGCATGA CCTGACGATG
ATGATCATGG CACGCGCGGG CATGGGTTTC GCCGCCGGAA TCATCGTGCC GCTCGCCATC
ATCACCGAGC TGCGGACCTT CCCCCCGACA TGGCGGGCGC TAGCGATCGC GCTCTATTCC
ACGGCGGCGA CGATGGCGCC GCAGGCCGCA GCACCTCTGG ACATCTGGCT GGTGACCCGC
TGGGGCATAC CGGCAATCTT CTGGATCGCG ATCCCATTGG GGCTCATCGC GTTCGTCTGC
GGATATCTCG GCCTATGGCG GGAGCGAATT CGCTGGCTGT TATTTGCTCG CGCCGACCTG
CGCGTGGTCT TCGCCTTTTC GGTGGGTGCT ATCCTGCTCG GCGGCGGCGT CAGTCAGGGA
AACCGGCTGC ACTGGTCGGA GACACCGCTG ACCCTGTTCA CCGTCGTCAC GGGCATCGCG
CTGCTAACCG GCGTTCTGGT GCTGGGTGGA CGCCGGGTGG TCCATCCGAT CCTGATGAAG
CGATTGCTGA CCCGCCGCAA TGCGCTGCTC GGTGCGCTCA TCACCATTCC CTATCAGTTC
GCATCGGTCC TTTCGGGCAG CCTAGTGCCC GCCTTCCTCG CCGATATCCC CGCCTATCGG
CCGGAGCAGA TCGCTCCCGC GCTCGACGCT GTGTTCTGGC CGCAGGCCCT CGCTTACCCC
GTCTGCGTCA TGGCGTTGCG CTATGGATGG GTGGAAGCCC GGACCTTCCT CGTGCTGGGT
TTCGGCCTGA ACGCACTGGC CTGCCTGCTC GACCTCGACG TCACGAGCGA CTGGATTCCC
GGCAATTTCC TCACGGGACA AATCCTGCAG GGGATCGGCC TGCCGTGGAT CCTCCTGCCG
TTGCTGATGC TTTTCGTGGG CGACGTGGTA CCAAGCGAGG GGCTTTATGC CGCCGCCATC
TTCAATGTCG CGCGCAGCCT CGCCGGCACG GTCGCAGCGG CATGGGCGGC GACCGGACTA
CGGCTTCGGG CCGAGGGGCG CTACGGCGAA TTGCTTACCA GCACCGGGCT TCAGCCGCAC
CGTCGTATGG TCGCGGTATG GGACGGAGTT GGTGCCTTCC CCCGGACGAC GCCCGATCCC
GTGCTGCTGG ACCATCGCGC CCGGTCAGTG TTCGGCACGC TCGTGCGGCA CCAGTCGGTC
GTCATCGGGT TCTCGACGCT GCTGGCCGAT CTGGCGCTCA TGCTGGCCGT AACCTGCATC
ATCGCGGCCT GTATGCCACC GGCCGGCGCT GATCGACCGG AGGAGCAATC ATGA
 
Protein sequence
MIPEMRGRIL TGTCGVCAAW MAMMSTRFFA LSLPDIEGVL GIGPDEGSWL AASYLAFEPV 
GVALGCWLAV IFSLRRVMLP AILLFVIASA IAAVTHDLTM MIMARAGMGF AAGIIVPLAI
ITELRTFPPT WRALAIALYS TAATMAPQAA APLDIWLVTR WGIPAIFWIA IPLGLIAFVC
GYLGLWRERI RWLLFARADL RVVFAFSVGA ILLGGGVSQG NRLHWSETPL TLFTVVTGIA
LLTGVLVLGG RRVVHPILMK RLLTRRNALL GALITIPYQF ASVLSGSLVP AFLADIPAYR
PEQIAPALDA VFWPQALAYP VCVMALRYGW VEARTFLVLG FGLNALACLL DLDVTSDWIP
GNFLTGQILQ GIGLPWILLP LLMLFVGDVV PSEGLYAAAI FNVARSLAGT VAAAWAATGL
RLRAEGRYGE LLTSTGLQPH RRMVAVWDGV GAFPRTTPDP VLLDHRARSV FGTLVRHQSV
VIGFSTLLAD LALMLAVTCI IAACMPPAGA DRPEEQS