Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2303 |
Symbol | |
ID | 6975733 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 2551847 |
End bp | 2553484 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643391831 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002276673 |
Protein GI | 209544444 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.292812 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.0663224 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAACG TCGTTCCTGT GCCATCGGCT GGCGAGCCGG TCATCCACGC CTCGCCCCCC CGGCCGTCCT ATATCCCACC CTTCGGGGTG CGCACGGTGA TCGGCTGCCT GGGCATGCTG CTGGCCGTGC ATGTGGCGGG CTTCAACGAA CACGTGACCG AGATCGGCCT GACCGATATC CGCGGCGCCA TGCACATCGG CTACGATGAG GGAACGTGGC TGATCGCCAT CTACGAATCC TTCAACATCG CGGCCATGGC CTTCACGCCG TGGTTCTACA TGACGTTTTC CATCTACCGC TTCTCGATCT TCGTGACGGC CGTCATGGCG CTGCTGGCGA TCCCGGCGCC GTTCATGCCC GATGTCACGT CGCTGTGCAT CCTGCGGGCC TTCCAGGGGC TGATGGCCGG GTGCCTGCCG CCGGTGCTGA TGACGGTGAT GCTGAAATAC CTGCCGCCGG AAATCCGCGT GTTCGGCATC GGCGGCTACG CGATGAGCGC GACCTTCGGC CCCAACCTGG GCCTGCCGCT GGAGGCCTTC TGGTTCGAAC GTGTCGGCTG GCACTGGCTC TATTGGGAAA TCATCCCGCT TGCCGCGCTG TCGATCGCCA TGATCGCGTA CGGGCTGCCG CGCGATCCCA TGCATTTCGA ACGGTTCCAG AAGTTCAACT GGCTCGGCCT GCTGGTCGGC CTGCCCGCCA TCTGCGCGCT GGTCATCGTG CTCTACCAGG GGGACCGGCT GGACTGGTTC CGCTCGCCCG TCATCACCAA CCTGTCCTTC TGGGGCGGGG CGGCGTTCAT CGTCTTCGTC ATCAACGAGG CGTACCATCC CAGCCCGTAT TTCCGCGTGC AGTACTGGCG GTCGCGCAAC ATCCAGGCCT CGCTGCTGTC GCTGGTCGGC ATCCTGGCCA TCTGCGCCAT GATGGGCGAA ATCCCCGGCA TCTACCTGGA GGCCGTGCGC GGCTATCGCC CGATCCAGGC GGCCCCCGTC TCACTGGTCG TGGCGCTGCC GCAGCTCCTG ATGCTTCCGC TGATCGCGGC CATCTGCAAC AGCCGCAGGG TGGATTGCCG CTACGTGCTC TCGGGCGGAA TGCTCTGCCT GGCCGGCGCG GCCTGGCTGG GCACGTGGCT GACCGTGGAC TGGGTGCGGG ACAATTTCTA CGCGCTGCAG GTCCTGCAGA TCTTCGGCCA GCCCATGACC GTCATTCCCA CGCTGATGCT CGCCACCCTC GCGATGGGGC CCGCCGACGG TCCGTTCATC TCGGGCATGG TGAACATGCT CAAGGGCCTG GCCAACGCGG TGGCATACGC CGTGTTCGCG GCCCTGACCC GGCGGCGGGA GCAATATCAT TCCACCATGC TGCTGGACCA TCACGGCACG CACGGCCTGG CGCTGCAGGG GATGGGCGAT CCGGTCAACC GGCAGCTTGC CGCCACGTCG CCCGACAGCG CGCATGTCGC GCGCAACACG CTCCAGGTCT TTCATACCTT CGTGCACGAG CAGTCGCTCG TCCTCGCGCT GGCCGACATC TACTTCGTGC TGATCTGGGT CTGCCTCGGC TACGCGGTCA TGAACCTCAT CCTGCCGCGC CGGGTCTATC CGCCGCGCGC GCCGTCGCCG AACACTCCCG CCCGCTAA
|
Protein sequence | MNNVVPVPSA GEPVIHASPP RPSYIPPFGV RTVIGCLGML LAVHVAGFNE HVTEIGLTDI RGAMHIGYDE GTWLIAIYES FNIAAMAFTP WFYMTFSIYR FSIFVTAVMA LLAIPAPFMP DVTSLCILRA FQGLMAGCLP PVLMTVMLKY LPPEIRVFGI GGYAMSATFG PNLGLPLEAF WFERVGWHWL YWEIIPLAAL SIAMIAYGLP RDPMHFERFQ KFNWLGLLVG LPAICALVIV LYQGDRLDWF RSPVITNLSF WGGAAFIVFV INEAYHPSPY FRVQYWRSRN IQASLLSLVG ILAICAMMGE IPGIYLEAVR GYRPIQAAPV SLVVALPQLL MLPLIAAICN SRRVDCRYVL SGGMLCLAGA AWLGTWLTVD WVRDNFYALQ VLQIFGQPMT VIPTLMLATL AMGPADGPFI SGMVNMLKGL ANAVAYAVFA ALTRRREQYH STMLLDHHGT HGLALQGMGD PVNRQLAATS PDSAHVARNT LQVFHTFVHE QSLVLALADI YFVLIWVCLG YAVMNLILPR RVYPPRAPSP NTPAR
|
| |