Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0801 |
Symbol | |
ID | 6974198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 910847 |
End bp | 912175 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643390330 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002275206 |
Protein GI | 209542977 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.469185 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTTCTT CCACGGAGAT GCATGTGACG CAGCAGCCCG GTCCGACCGA TGCCGATCGT GGCCGGCGCT TTTGGCAGAC ACGCAATTTC ATCCTGTTTT TCGTTGCCAC CAGTGCCTCG ACGCTGGGAT CGGCCATGGT ATCGGTCGCG TTGACGTTCG CGGTGCTGTC GCACGGGCAT TCGGCATCGA TCCTGGGCCT GGTCCTGGCC GCGCAGGCGG CCCCGGTCGT GGTGCTGATG GTTCCGGCGG GGGCGATCGC GGACCGGTGG GTGCGGCGGT CCCTGATGGT GGGCGCGGAC CTGCTGCGCT GTGCCAGCCA GGGTCTGACC GCGATCCTGA TGGCGGGTGC CCATCCCTCG GTCGCCGTGC TGATCGGCCT GGTCACGCTG GTCGGGGTCG GCAACGCATT CTACGGCCCT GCGGAAAGCG GCCTGATTCC GGTTCTGGCG CGGCCCGAGG ATCTGCGCCG CGTCAACAGC CTGCTCAGTC TTTCCGGCTC GATCACCGCG ATACTGGGGC CGTCGCTGGG CGGCATGCTG GTTGCGATCG GCAGCGCACC GATCGCGATC GGCTGCGACG CGGTGACCTA TGCCATCAGC GCCATCTGCC TGACGGCGAT AAGTACCCTG CGCCCGGCGC GCCGGGCGAC CGCACCGTTC CAGACCCAGT TGCTGGCCGG CCTGCGCGAG TTCCATCAGC GGCGATGGCT GATCCTTATG ACGGCGCAAT ACGGGTTCCT GAATCTGGCG GCGTTCGCGC CGTTCCTGAT CCTCGGCCCC GTCTCACTGG CCCATGTGGT GCGTGGCGCC CAGTCCTGGG GCATCATTTC CTCGGCCATC GGCATCGGCG GCATTTTCGG TGGGGGCGTC AGCCTGTTCT GGCATGTTTC CCGTCCGTTG GTGCTTTATG AAACGGCGGC TGCCGTCCTG GTGATTCCGC TGGTCCTGCT GGCCGCGCAG GCGTCGGTTC CCTACCTGGC CCTGGGCGGC GTCGCCTTTG GCGCGGGGAT CGTGATCCTG AACCTGGTCG CGCAGACCAC CATTCAACGG CAGGTGCCCG AGGAGGCGTT ATCGCGGATC AACGCCCTGT TCGGCCTGGT CGCGCAAGGC CTGACACCGC TGAGCTACGC CATGTGCGGC TTCCTGGCCC GTGCGGTCGG CATAAAGCCT GTTCTGGCGG CCAGCAGCGT CGTGGTCGGG GTCAGCGTCG TGGTCCTGCT GATGCGCAGG GAAACCTGGG ACCTGCGGGA TGCGCCGGCC GTCGCCGACG GCCGAAGTCA GGATAAGGGG GGCAGGACAA GGGGTCAGGA TAACCGGTCC AGCCGGTAG
|
Protein sequence | MGSSTEMHVT QQPGPTDADR GRRFWQTRNF ILFFVATSAS TLGSAMVSVA LTFAVLSHGH SASILGLVLA AQAAPVVVLM VPAGAIADRW VRRSLMVGAD LLRCASQGLT AILMAGAHPS VAVLIGLVTL VGVGNAFYGP AESGLIPVLA RPEDLRRVNS LLSLSGSITA ILGPSLGGML VAIGSAPIAI GCDAVTYAIS AICLTAISTL RPARRATAPF QTQLLAGLRE FHQRRWLILM TAQYGFLNLA AFAPFLILGP VSLAHVVRGA QSWGIISSAI GIGGIFGGGV SLFWHVSRPL VLYETAAAVL VIPLVLLAAQ ASVPYLALGG VAFGAGIVIL NLVAQTTIQR QVPEEALSRI NALFGLVAQG LTPLSYAMCG FLARAVGIKP VLAASSVVVG VSVVVLLMRR ETWDLRDAPA VADGRSQDKG GRTRGQDNRS SR
|
| |