Gene Gdia_3390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3390 
Symbol 
ID6976836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3714261 
End bp3715214 
Gene Length954 bp 
Protein Length317 aa 
Translation table11 
GC content68% 
IMG OID643392906 
ProductMammalian cell entry related domain protein 
Protein accessionYP_002277731 
Protein GI209545502 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0881876 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCAGG ATCAAAAGAC CGACCTGGCG CGCCAGTTGG TGCGCGTGCG CTATGCCGAC 
GAATGGGTCG GCGTGCTGGT CCTGCTGTCG CTGGTGATCT GCTTCGCCGC GATCGTCGAG
GCCGGGGTCC TGCGCGACTG GCTGACACCC GCCGGCCGGC TGCAGATCGT CCTGCCCGAC
GGCGGGGTCA GCGGCCTGTC GGTGGGTAAC GATATCGAGG TGCTGGGCAT CCATGCCGGC
ACGATCCGGC GCATCCGCAT CAATCCCTCG GGCGGGATGT TCGCGGTGGC CGATATCGAC
CCGGATATCG AACCCTATAT CCGCCGTGAC AGCACGGCCA CCATACGCCG GCGCTTCGTC
GTGGCGGGGG CGGATTACAT CGACATCTCG CGCGGCACCG GCACGCCGAT GGACTGGCAT
TACGCGGTGC TGACCGCGCA CAGCGCCCCC AACCCCGCCG ACATGATCAC CCAGACCTTC
GCCGATATCA GGGCGCGCAT CCTGCCGGTG CTGGACAGCT CCCAGCACAT GATGTCGCAG
CTCGACGCCA CGATCACCGA CATGCATTCC GGCAAGGGCA CGGTGGGGCG CCTGATGACC
AGCGACGACC TGATCCGCCA GTCGGAAAAG ATGGTCGCCT CGCTCAATAC CGCCATCGCC
CAGTTGACCC CGGTGGAAAA GCGGCTGTCG GCGGTGATGG CCAAGGCGGA CAGTTCCATG
GCCAATGTCC GTGCGTCCAC CGACGATCTG CGCAAGGCGA CGCCGCGCCT GCCGGCGATC
ACCCGCGACC TGCAGGAGAC TTCGGCCGAA CTGCCGGTCC TGCTGACCCA GGCGCAGGTC
ACGGCGGCCA GCCTGCAGAA GCTGACCGAC CAGCTTCGCG GCCTGTGGCT GCTGGGCGGC
GGGGGCACGC CCGCGCCACA GCGGCGCCTG CCGGCCGCGA GAATCCAGCC ATGA
 
Protein sequence
MAQDQKTDLA RQLVRVRYAD EWVGVLVLLS LVICFAAIVE AGVLRDWLTP AGRLQIVLPD 
GGVSGLSVGN DIEVLGIHAG TIRRIRINPS GGMFAVADID PDIEPYIRRD STATIRRRFV
VAGADYIDIS RGTGTPMDWH YAVLTAHSAP NPADMITQTF ADIRARILPV LDSSQHMMSQ
LDATITDMHS GKGTVGRLMT SDDLIRQSEK MVASLNTAIA QLTPVEKRLS AVMAKADSSM
ANVRASTDDL RKATPRLPAI TRDLQETSAE LPVLLTQAQV TAASLQKLTD QLRGLWLLGG
GGTPAPQRRL PAARIQP