Gene Gdia_2013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2013 
Symbol 
ID6975439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2233636 
End bp2235210 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content69% 
IMG OID643391542 
Productprotease Do 
Protein accessionYP_002276388 
Protein GI209544159 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.316391 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGACG AGCTTCAGCC GATGTCTTCT CTCCGCGCCC GCCGGATGCG GGGCGGAATC 
CTGGCCGCGC TGGTTGCGGG AACCATGCTG GGGGGCGTGG CCGCCGACGG GCTGGTCCCC
GTGGCGCGGG CCGACGATAC CGGGGTGATC CGTCCCGATA CGCAGGTCCA GACCCTGCCG
AACTTCGTCA ACCTGGTGAA GCAGGTCCGG CCCGCCGTGG TGTCGATCAC CTCCAGCATC
CGTGCCGAGG ACCTGGGGGA CGAAGGCGGC GGCGGCGCCG AGGGGCAGCA GCAGATGCCC
TTCCCGTTCC CCTTCCCGTT CCAGATGATG CCGCAGCAGC AGCGCCGCAC GGTCGAGGCG
CGGGGATCGG GCTTCATCAT CTCGGCCGAC GGCTATGTCG TGACCAACAA CCATGTGGTC
AAGGGCGCGA CCAAGGTCAC GGTGACGCTG GATGACGGCA CCACCCTGCC GGCCAAGATC
GTCGGCCGCG ATTCCAAGAC CGACCTCGCG CTGCTGAAGG TCACGTCGCA GGGCAAGCTG
CGCTTCATCG AACTGGGCGA TTCCGACAAG GTCGAGCCCG GGGAATGGGT GGTCGCGGTC
GGCAATCCCT ACGGGCTGGG CGGCACGGTC ACCGCCGGCA TCGTCTCGGC GCGCGGGCGT
GACATCGGCG ACGGGCCGTA CGATTCGTTC ATCCAGGTCG ATGCCCCGAT CAATCGCGGC
AATTCCGGCG GCCCGCTCTT CACCCAGGAC GGCAAGGTCG TGGGCGTCAA TACCGCCATC
CTCTCGCCCT CGGGCGGGTC GATCGGCATC GGCTTCGCCA TCCCGTCCGA CGTGGTGAAG
AACGTCGTCT ACCAGTTGCA GAAGACCGGG CACGTCACCC GGGGTTACCT CGGCGTGGTC
GCGCAGGTGA TCACGCCCGC GATGGCCACG GCGCTGGGCC TGAAGCCCGC GGCGCCCGGC
GCGCCGCCCA GCGGCGCCCT GGTCGCCAGT GTCAGCAACG GCAGCCCGGC CGAAAAGGCG
GGGATCAAGG CCGGGGACGT GATCACCACC CTGAACGGGC AGAAGATCGA CAGCCCGCAT
GATCTGGCGG TCAAGGTGGC CTCGATCGTG CCGGGCAGCA AGGCGGCGGT GAACTATATG
CGCGGCACGG CCGCGCAGAG CACGACGGTC ACGATCGCCA ACCTCTCCGG CGCTCCGTCG
CCCGACGGCG CGGTCGGGGA CAGGAACGAC GGCGGTCCGC GCCTGGGCGT CTCGCTGTCG
CCCCTGACGT CGGACCTGCG CCAGCAACTG GGCCTGGACG GGTCGGTGCG CGGCGTCGTC
GTCAGCGACG TCCAGTCGGG TTCGGCGGCG GAACAGGCCG GAATCCACGC GGGCGACGTG
ATCCAGGCGG TGGGCAACAA GCCGGTGGAA AACCCCGGCG CTACCGTCAC CGCCGTCCGC
GCGGCGCTGA AATCCAACCA GTCGGTCCTG CTGCGCATCC TGCGCAACGG GCAGAACATC
TTCGTCGCCG TCACGCCGGG CTCGGATGAG GGCGACAGCG GCAATGGCAA CAGCGACCCC
GACGGCAACG ACTGA
 
Protein sequence
MSDELQPMSS LRARRMRGGI LAALVAGTML GGVAADGLVP VARADDTGVI RPDTQVQTLP 
NFVNLVKQVR PAVVSITSSI RAEDLGDEGG GGAEGQQQMP FPFPFPFQMM PQQQRRTVEA
RGSGFIISAD GYVVTNNHVV KGATKVTVTL DDGTTLPAKI VGRDSKTDLA LLKVTSQGKL
RFIELGDSDK VEPGEWVVAV GNPYGLGGTV TAGIVSARGR DIGDGPYDSF IQVDAPINRG
NSGGPLFTQD GKVVGVNTAI LSPSGGSIGI GFAIPSDVVK NVVYQLQKTG HVTRGYLGVV
AQVITPAMAT ALGLKPAAPG APPSGALVAS VSNGSPAEKA GIKAGDVITT LNGQKIDSPH
DLAVKVASIV PGSKAAVNYM RGTAAQSTTV TIANLSGAPS PDGAVGDRND GGPRLGVSLS
PLTSDLRQQL GLDGSVRGVV VSDVQSGSAA EQAGIHAGDV IQAVGNKPVE NPGATVTAVR
AALKSNQSVL LRILRNGQNI FVAVTPGSDE GDSGNGNSDP DGND