Gene Gdia_3110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3110 
Symbol 
ID6976544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3406265 
End bp3407674 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content71% 
IMG OID643392618 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002277455 
Protein GI209545226 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTGC GGTCCGTGCC CGATACGTCC GCGCCCGATG CCCCGGCGCG CCGAACGCTG 
GCGGAGGGTG AAGGCCTGCA CGGCACGGCG CGGGCGATGG CGATGTTCGC CGTGGCGCTG
TCGGTGCTGC TGTCGGTGCT GGACTATGCC ATCGCCAATG TGGCGCTGCC CACCATCGCG
CACGACATCC ATGCCAGCGC GTCATCGTCC ATCTGGGTGA TCAACGCCTA CCAACTTGCC
AGCGTGTCCA CACTGCTGCC GCTGGCGTCG CTGGGCGCGC GGGTGGGATT CGCGCGGCTG
TGCCGGATCG GGCTGGGCAC CTTCGTGGTC GCGTCCCTGC TGTGCGCGAT GTCGCACACG
CTGCTGGAAC TGGCCCTGGC CCGCGCGCTG CAGGGCGTGG GCGGGGCGTG CATCATGAGC
GTCAATATCG CGTTGGTCCG CTTCATCTAT CCGCATTCCC AGATCGGCAA GGGGATCTCG
CTGAACGGCA TGGTGGTGGC GACCGGCGTG GCGCTGGGGC CCACCATCGC CTCGGCGGTG
CTGTCGGTGG CCGACTGGCC GTGGATCTTC CTGATCAACC TGCCGCTGGG CGGGGCGGCG
ATGGTGGCGG CCGCGTTCTA TCTGCCGCGC ACGCCGCGCA GCCCGTCCTC GTTCGACCTG
GCCAGCGCGG TGCTGAACGT CGTGGCGTTC GGGGGCGTGA TCGTCGGGCT GGACAGCCTG
GTGCATCGTA GCGGGCCCGC GCTGGCGGTG TGGCTGCTGG CGGGCGGGGC GGGGGCGTTC
GTCCAGCTGG CCCGGCGCCA GGCCGGGCGG TCCGACCCGA TGCTGCCGAT CGACCTGCTG
GCGGTGCCGG ATTTCCTGGT CGCGTTCGGC GTGGGGTTCG GGGCGTTCGT CGCGTCGAAC
TTCTTCATCA TTTCCATGCC GTTCGCGCTG CAATCGGTGC TGCATCGCTC GGCGGTGGCC
ACCGGCCTGC TGATCACACC CTGGCCGGTC GGGATCGTCG CCATCTCGCT GTTCGTCGGG
CGGGTGGCGG ACCGGGTGCC GGCGGCCATC CTGTCCTCGA TCGGCCTGTG CGTGACGGGC
ACGGGCTTCG TCCTGCTGTG CCTGCTGCCG CCCGATGCCG GGAACCTGGA CATTGCCTGG
CGTATCGGCC TGGCCGGGGT CGGGTTCGGG ATTTTCCAGC CGCCGAACAA TCGCGCGATG
ATGGTGTCCG CCCCGCGCGG CCGGTCGGGC AGCGCCAGCG GCATGGTGTC GGTCGCGCGC
CTGTCGGGCC AGACCATGGG GGCGATCTTC GTGGCGCTGA CCTTCGGTTT CGTCACCCAT
GACCCCACCC TGCGCTGCAT GGAACTGGCG GCGGCGTGTG CCTTCCTGTC GGCGCTGCTC
AGCGCCAGCC GGCTGCGGGC CCTGCGGTGA
 
Protein sequence
MALRSVPDTS APDAPARRTL AEGEGLHGTA RAMAMFAVAL SVLLSVLDYA IANVALPTIA 
HDIHASASSS IWVINAYQLA SVSTLLPLAS LGARVGFARL CRIGLGTFVV ASLLCAMSHT
LLELALARAL QGVGGACIMS VNIALVRFIY PHSQIGKGIS LNGMVVATGV ALGPTIASAV
LSVADWPWIF LINLPLGGAA MVAAAFYLPR TPRSPSSFDL ASAVLNVVAF GGVIVGLDSL
VHRSGPALAV WLLAGGAGAF VQLARRQAGR SDPMLPIDLL AVPDFLVAFG VGFGAFVASN
FFIISMPFAL QSVLHRSAVA TGLLITPWPV GIVAISLFVG RVADRVPAAI LSSIGLCVTG
TGFVLLCLLP PDAGNLDIAW RIGLAGVGFG IFQPPNNRAM MVSAPRGRSG SASGMVSVAR
LSGQTMGAIF VALTFGFVTH DPTLRCMELA AACAFLSALL SASRLRALR