Gene Gdia_2389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2389 
Symbol 
ID6975819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2648837 
End bp2650099 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content59% 
IMG OID643391913 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002276755 
Protein GI209544526 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGC CGCATACCAC TGCAGCCTGT ACTGGCAAAA CCCTCGCTAT TCCGAGGCTC 
AATCGGGCCT TGTTTGCCTT ATGCGCTCTC AATTTCTTCA TGGCGGACGT CCAAGCGGGG
ATAGGGCCCT TTCTGGGCGT TTTTCTTCAG CGTCACGGCT GGCAGACGGG ACCGATCGGA
ACCGTCATGA CCGTGGGAGG CGTTGCCGGA ATGCTCGCCA CCATTCCCGC GGGCGCGCTG
ATTGATCACA CGACGAAAAA GCGGTTGCTC GTCATTGTGG CAGCGCTCTG CACGATTTCC
GCGTCACTCC TTCTGCTAAG CTCGCAAGCG GTGCCGGTTG TGACGGTCAG TCAACTTGCA
ACCGCTCTGG CCGGAGCCGG GATTGGTCCT CTGATGGCGG CCATAACGCT CGGGATCGTG
CGCCAGAAAG GCTTCAACAC ACAAATTGGC CGTAATCAGG CCTGGAACCA CGCCGGCAAT
ATGGCCGGGG CCGGACTGTC TGGCTGGCTC GGTTGGCAGT TTGGCCTCTC AGCGATTTTC
TTTCTTGAAG TCGCCTTCGG TCTGTTCGCC ATTTCTGCGG TGCTCCTGAT CCCGGAAAAA
TCCATAGATC ATAAAGCTGC ACGCGGACTG GACGATGAAC CTGCTCACGA TGAGGGGACG
ACCGAGGGGC TACGATCCTT TCTGCGACAC AAGCCTCTTC TCATTCTGGC GAGTTGTTTG
TGTTTCTTCC ATCTCGGAAA TGCCGCGATG CTCCCGCTCT ACGGCATGGC GGTCGTCAGT
GCAGGCAAAG GTAATCCCGC CATGTTCACG GCGATGACTG TGATGGTCGC ACAGGCTGTG
ATGATCGTCG TGAGCCTGCT GGCCATACGT GTCGTCAGGA ACCGTGGGTA CTGGATCGTC
CTGCTGATAT CGTTTGCCGC CCTACCGCTG CGTGGTTTGA TCGCGGGAAG CTTCATCCAG
CATTGGGGGG TGTGGCCGGT GCAGATCCTC GATGGGATCG GTGCGGGGCT TCAGAGTGTC
GCCGTGCCGG GTCTGGTGGC CAGACTGCTG AACGGAACCG GACGCATCAA TATCGGACAG
GGTGTGGTCA TGACGGCGCA GGGCATTGGA GCAAGCCTTT CTCCGGCTCT GGGAGGATGG
CTTGCCGAAG ATCTGGGATA TCCGGTGGCG TTTTATAGTC TGGGCTGTTT TGCAATCTTG
TCACTGGGGC TCTGTATAGG CTCGGCATCG ATCCTACGCT CTGCCGATCA GGTGTCGGCA
TGA
 
Protein sequence
MTQPHTTAAC TGKTLAIPRL NRALFALCAL NFFMADVQAG IGPFLGVFLQ RHGWQTGPIG 
TVMTVGGVAG MLATIPAGAL IDHTTKKRLL VIVAALCTIS ASLLLLSSQA VPVVTVSQLA
TALAGAGIGP LMAAITLGIV RQKGFNTQIG RNQAWNHAGN MAGAGLSGWL GWQFGLSAIF
FLEVAFGLFA ISAVLLIPEK SIDHKAARGL DDEPAHDEGT TEGLRSFLRH KPLLILASCL
CFFHLGNAAM LPLYGMAVVS AGKGNPAMFT AMTVMVAQAV MIVVSLLAIR VVRNRGYWIV
LLISFAALPL RGLIAGSFIQ HWGVWPVQIL DGIGAGLQSV AVPGLVARLL NGTGRINIGQ
GVVMTAQGIG ASLSPALGGW LAEDLGYPVA FYSLGCFAIL SLGLCIGSAS ILRSADQVSA