Gene Gdia_1952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1952 
Symbol 
ID6975378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2166724 
End bp2168172 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content67% 
IMG OID643391481 
Productsugar transporter 
Protein accessionYP_002276327 
Protein GI209544098 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAATT TTTCACCGCA GGCCCCTTCG GTTCCCGCCC TGTCCGATTT CGAGGTTTCG 
GACAATGCGG GGCGGACGCT CTGGCTCGCA GCCGCCGTGG CCGCCATCTG TGGCGGCCTC
TACGGCTACG ACACCGGTAT CATTTCGGGT GCGTTGCTTC TGATTACCCG GGATTTCCAT
CTGGGCAGCC TGTATCAGGA ACTGGTCGCC TCCGCGATCC TGGCGGGGGC GGTGCTGGGC
GCCGTCGGCA CCGGCTGGCT GTCGGAACGG TTCGGCCGCC GGACGTCGGT CATGATCGTC
ACCGCCGTGT TCGTGACCGG CGCGCTGGCC TGCGCCGCCG CCCCGGATGT GGACATGCTG
ATCGCGGCAC GCGTCTATCT GGGGCTGGGG GTCGGCGGAT CGACCCAGGT GGTTCCGATG
TATATCTCGG AACTGGCGCC GGCGGCCCGG CGCGGCAAGC TGGTCACGCT GTTCAACGTC
GCGATCGGGA TCGGCATCTT CGTCGCCAAC ATCATCGGTT TCGCCGCGCG CGACGCCTGG
GGCTGGCGGC CGATGATCGC GGTCGCGGCC CTGCCGGCGG CACTGGTATT CGTGTCCATG
TTCTTCCTGC CCAAGAGCCC CCGCTGGACG GCGGAAAACG AGGGACTGGA TTCCGCGGTC
ACGCATCTGG CGCGCGTGCG GACGTCGCGC AAGGAAGTCC GCAAGGAAAT CCGCAGGATC
CACGAAGCCG CGGAAGACGT CGATGACGCG CATCGCGGCT GGCGCGGCCT GATGCAGCCC
TGGGTGCGCC CGGCGCTGGT CGCGGCGCTG GGGGTGGCCT TCTTCACCCA GTGCGGCGGG
CTGGAGATGA TGATCTATTA CGCCCCGACC TTCCTGTCGG ACGCGGGCTT CGGCCATTCC
TCGGCGCTGC TGGCCAGCCT GGGGGTCTCG ATGGTCTATC TGGTCATGAC GATGCTGGGC
TCGGCGATCG TCGATCATGT CGGCCGGCGC CGCCTGATGC TGATCATGGG GCCGGGATCG
GTGGCCAGCC TGCTGGGGCT GGGGCTGATG TTCGCCATCC ATCCCGACAA GGGCAGCGTC
GGAAGCTGGA TGATCATCGT GTTCCTGCTG ATGTTCATGG CGTTCAATTC CGGCGGCATC
CAGGTCGTCG GCTGGCTGCT GGGGGCGGAA ATGTTCCCGC TGTCGATGCG CGGCACCGCC
ACCAGCCTGC ACGCCGCGAC CCTGTGGGGC AGCGACCTGC TGGTGACCAG CACGGCGCTG
ACGCTGGTCA ACCTGATCTC GCTGGGCGGG ACGATGTGGT TCTATGCCGG GGTCAATCTG
GCGTCGGTCG CGTTCATCTA CTTCCTGGTG CCAGAGACGC GCGGTGCATC ACTGGAAGAC
ATCGAAACCG CCCTGCATGA GGGGCGCTTC CGGCCCACCA GGGGCCATAC CGCGATCGTC
GAGACCTGA
 
Protein sequence
MQNFSPQAPS VPALSDFEVS DNAGRTLWLA AAVAAICGGL YGYDTGIISG ALLLITRDFH 
LGSLYQELVA SAILAGAVLG AVGTGWLSER FGRRTSVMIV TAVFVTGALA CAAAPDVDML
IAARVYLGLG VGGSTQVVPM YISELAPAAR RGKLVTLFNV AIGIGIFVAN IIGFAARDAW
GWRPMIAVAA LPAALVFVSM FFLPKSPRWT AENEGLDSAV THLARVRTSR KEVRKEIRRI
HEAAEDVDDA HRGWRGLMQP WVRPALVAAL GVAFFTQCGG LEMMIYYAPT FLSDAGFGHS
SALLASLGVS MVYLVMTMLG SAIVDHVGRR RLMLIMGPGS VASLLGLGLM FAIHPDKGSV
GSWMIIVFLL MFMAFNSGGI QVVGWLLGAE MFPLSMRGTA TSLHAATLWG SDLLVTSTAL
TLVNLISLGG TMWFYAGVNL ASVAFIYFLV PETRGASLED IETALHEGRF RPTRGHTAIV
ET