Gene Gdia_1653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1653 
Symbol 
ID6975069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1843517 
End bp1844809 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content69% 
IMG OID643391188 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002276045 
Protein GI209543816 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.389792 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGTG ACGCCACGGC CACGATCGGG CCGGGCGGGG CATCGCGGGC GTCCGGGCTG 
GGCCCGGCCC GCGACCCGGC CTTCGACCGG GCGACGGCGC TGCATGTGGT CGCGGCCTCG
CTGCTGGCAT GGCTGCTGGA TGCCTGTGAC TTCTTCATCG TCCTGTTCAC GCTTGACGAC
GTCGCGCACA GCTTCCAGGT CTCGCTGCAA AGCATCCTGC TCGCACCGAC GCTGACGCTG
CTGACGCGCC CGATCGGCGC CTTCCTGTGC GGGCATGCCG CGGACCGGTA TGGCCGCAAG
CCGGTGATGA TCGCCACCAT CGTCGTCTAT TCCGTGATCG AGATCCTGTC CGCCTTCGCG
CCGACGGTGA CCGTCTTCCT GTTCCTGCGC GCCCTGTTCG GCGTGGCGCT GGGCGGCGAG
TGGGGGGTGG GCACGTCCCT GCTCATGGAA AGCATTCCGC GGTCATGGCG CGGTACCGCG
TCGGGCATCC TGCAGGCCGG CTATCCTGCC GGCTATCTGC TGGCATCGCT GCTCTTCCTG
CTGCTGCCTG TCGTGGGCTG GCGCGGACTG TTCATCCTCG GCGGCGGCGC GCTGGTGGCC
GCGCTGTATA TCTGGGTGCG GGTGCCGGAA AGCCCCGAAT GGCTGCAGCG CCATCGCCAG
CAGGGCACCG ACGCCGTGCG CGCGCCCGGA TTGTGGTCCA TCGTCAGGAA CAATGCGGCG
CTCTGCCTGT TCGCGGTGAC GCTGATGGCC GCGTTCAACT TCATGAGCCA CGGGTCCCAG
GACCTGTATC CGAAAGTGTT CCTGGGGCTG GAGCGCGGGC TGCCGCACCC CACGATCACG
CTGATCGTGG TGCTGTATAA CATCGCCGCC ATCGCCGGCG GCCTGTTCTT CGGCGTGCTG
TCGCAGAAGA TCGGCCGGCA GTACAGCATC GCCCTGGCGG GTATCCTGAC TCTGCCGTTC
CTGCCGCTGT GGGCGCTGTC CCATTCCGCC TTCTGGCTGG CGGCCGGTGC CGTCTGCATC
CAGTTCTGCG TCCAGGGCGC GTGGGGCGTG GTGCCCGCCT ACCTGAGCGA GCTTTCGCCC
CCGTCGGTCC GGGCCACCTT TCCGGGGCTG GCCTATCAGT GCGGCAACCT GATCGCCGCC
GGCAACGCCC TGCTGCAAAC CTGGATCGCG GCGTTCCTGG GAACGGGCCT GGCGCCGGCG
CTGATGCTGA CGGTCGGCGG CGCCGCGACC GTCGTGGTGA CCCTGATCCT GCTCAACGCT
ACATGGTACG CGCGATCCAG GCCGATCGGG TAG
 
Protein sequence
MNGDATATIG PGGASRASGL GPARDPAFDR ATALHVVAAS LLAWLLDACD FFIVLFTLDD 
VAHSFQVSLQ SILLAPTLTL LTRPIGAFLC GHAADRYGRK PVMIATIVVY SVIEILSAFA
PTVTVFLFLR ALFGVALGGE WGVGTSLLME SIPRSWRGTA SGILQAGYPA GYLLASLLFL
LLPVVGWRGL FILGGGALVA ALYIWVRVPE SPEWLQRHRQ QGTDAVRAPG LWSIVRNNAA
LCLFAVTLMA AFNFMSHGSQ DLYPKVFLGL ERGLPHPTIT LIVVLYNIAA IAGGLFFGVL
SQKIGRQYSI ALAGILTLPF LPLWALSHSA FWLAAGAVCI QFCVQGAWGV VPAYLSELSP
PSVRATFPGL AYQCGNLIAA GNALLQTWIA AFLGTGLAPA LMLTVGGAAT VVVTLILLNA
TWYARSRPIG