Gene Gdia_1820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1820 
Symbol 
ID6975242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2018106 
End bp2019479 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content63% 
IMG OID643391345 
Productputative transporter protein 
Protein accessionYP_002276195 
Protein GI209543966 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.398976 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGTTT CCCCCTGGCT GGCCGTCACC CTGACGTTGC GCCGTTTTTC CCTGGCGGTC 
CCCTGGCTGG CCGTGCTGTC CTCCAGCCTG ATTCCGCTGA CCGGCAATCT GACGCTCCTT
TACATCCTGC GGATCTTCCA GGGGCTGTCC GGTGGGTTTA CCGTTCCGCT GCTGATGACG
ACGGCCCTGC GGGTGCTGCC GCCGCCCATC CGGCTTTACG GCCTGGCGGC CTATGCCCTG
ACCGCCACCT TTTTTCCCAG CCTGAGCACG GCCTTTGCGG GCCTGTGGAC CGATCTGGTC
GATTGGCGCT TCGTGTTCTG GCAGTCCATT CCGCTCTGCA CGATCGCCTT CGTCCTGCTC
TGGTATGGAA TGCCCGTCGA ACAACCCCGG CACGAGCGCT TCTGGCGTTT CGACTGGCAG
GGGTTCCTGC TTGTCCTGAT CGGAATGGGC GCGTTTTCGA CCATGCTGCA AATGGGCGAC
TGGATGGACT GGTTCAACAG TCCGGCCATC TGCGTCATGG CGTTGCTCAG TGGGGTCTGC
ATTCCGCTTT TCGTGCTGAA CGAATGGTTC CATCCGTTGC CCCTGTTCAA GTTCCAGCTT
CTGGAACGGC GGAATTTCGC CTACGGCGCC AGCACCCTGC TGACATTCAT GATCATCTCG
CTGTCATCTT CCGCCCTGCC GGCAGATTTC CTGCGTGAAG CCGCCGGATA CAGGCCGGAG
CAGACCTATC CCATCACCCT GGAAATCGCG GCGATCCAGA TCGTCATGCT GCCGCTGATG
GCGGTGCTGC TGAACCGGAA AGGGGTGGAT TCCCGTATCG TCAGCCTTAT CGGCATGGCC
TGCATCCTGA CGGCCTGCAT CGGGGACTTC TTCGTGACGT CCAACTGGAA CCGGGACCAG
TTCTATCTGT GGCAGGCGTT TCAGGGCGTC GGCAACGCCA TGATCGTCAT GCCGTTGCTG
ATGATGTCCA CCAATGCCCT TGTCCCCGAG GAAGGGCCCT TTGCCTCCGC CATGGTCAAC
ACGCCCCGCG CCGTGGCCGA GGCCGTGGGG ATCTGGCTGA TCCAGCTTGT CCATCGCGAG
CGCGGCGCGC TGCATTCCGA CCGTATCACC GACCGGCTCG GCCAGGATCG GTTCCAGCTT
GTCCAGGGCA TGAATCCGGT GCTCCAGCGA CCCGCCGCGC TGACGCCGGA CGGGCTGCCG
GCCTTTCCCG GCAGCATGAC CGCCCTGCAC GCGCAGGTCA CGCGACAGAC CGCGACCCTG
ACCTACAGCG ACGACTGGCT CATCATTGCC GGGATCGTGG TGTGCCTGAT GGTGTGGGTT
TGTGTCCTTC CGGTCCGAAC CTATCCCCCC CGTATCGTGT TCCAGTCGAA ATAG
 
Protein sequence
MSVSPWLAVT LTLRRFSLAV PWLAVLSSSL IPLTGNLTLL YILRIFQGLS GGFTVPLLMT 
TALRVLPPPI RLYGLAAYAL TATFFPSLST AFAGLWTDLV DWRFVFWQSI PLCTIAFVLL
WYGMPVEQPR HERFWRFDWQ GFLLVLIGMG AFSTMLQMGD WMDWFNSPAI CVMALLSGVC
IPLFVLNEWF HPLPLFKFQL LERRNFAYGA STLLTFMIIS LSSSALPADF LREAAGYRPE
QTYPITLEIA AIQIVMLPLM AVLLNRKGVD SRIVSLIGMA CILTACIGDF FVTSNWNRDQ
FYLWQAFQGV GNAMIVMPLL MMSTNALVPE EGPFASAMVN TPRAVAEAVG IWLIQLVHRE
RGALHSDRIT DRLGQDRFQL VQGMNPVLQR PAALTPDGLP AFPGSMTALH AQVTRQTATL
TYSDDWLIIA GIVVCLMVWV CVLPVRTYPP RIVFQSK