Gene Gdia_3198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3198 
Symbol 
ID6976638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3503081 
End bp3504022 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content68% 
IMG OID643392711 
Productsignal peptide peptidase SppA, 36K type 
Protein accessionYP_002277543 
Protein GI209545314 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.216895 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACTG ATCCCGACCT GACGGTCGAC CGGCTGCGAC TGCGGCGTCG GCTGGTATTC 
TGGCGGATCG CCGCCGTTGC CTCGTTCGTG CTGGCCCTGG TCGGCGTGGG CGGTCATTCG
CTGATGCGCA GCGGCGGTTT CGCGTCGCGC GATCATCTGG TGCGCGTGCG CGTGGCGGGC
ATCATCGGAT CGGACAATCG CAAGCTGATC GACCTGGTCG ACAAGGCGGC GAAAACCGAT
TCGGTACGCG GCATGATCCT GGCCGTGGAC AGCCCCGGCG GATCGGTCAG CGGGGGCGAG
GCACTGCATG ACGCGGTGGC CCGTTTCGCG GCCCGCAAGC CGGTGGCGGT GACGATGGGG
GGCCTGGCCG CTTCGGCGGG GTACATGATT TCGGTGCCGG CGCAGCGCGT GTTCGCGGTC
CAGTCCACCC TGACGGGGTC GATCGGCGTG ATCATGGAAG CACCGGACGT GTCGGGCCTG
CTGGACCGGG TAGGGGTCAA GGTCGATCAA CTGGTCTCCG GCCCGATGAA GGGCCAGCCT
TCGGGAACGC AGCCCCTGTC GCCGGAAGGG CGGCAGATGC TGCAGGGGGT GGTGGCGGAC
CTGTTCGACC AGTTCGTCAC CATGGTGGCC GACGGGCGGC ATATGCCGGT CGAACGGGTG
CGGACCCTGG CCGACGGCCG GCCCTATACC GGCCGGCAGG CGCTGTCGCT GGGACTGATC
GACCAGATCG GGGACGAGCG CGACGCCAAG GCGTGGCTGA CCAGTACCCG GCACCTGAGC
GGGACGATTC CGGTCGTGGA CCTGAAAGTG ACGACCGGGC AGGGCTGGAT GCACCGGATC
ACGCGCAGCA TGCTGGGCGT CGTTTTCGGT GACGAGTGGG CAGGAAGCGT GCTTTCGCAA
GGCGTTGCGC TTGACGGGGC TGTTGCGATC TGGAAACCTT GA
 
Protein sequence
MATDPDLTVD RLRLRRRLVF WRIAAVASFV LALVGVGGHS LMRSGGFASR DHLVRVRVAG 
IIGSDNRKLI DLVDKAAKTD SVRGMILAVD SPGGSVSGGE ALHDAVARFA ARKPVAVTMG
GLAASAGYMI SVPAQRVFAV QSTLTGSIGV IMEAPDVSGL LDRVGVKVDQ LVSGPMKGQP
SGTQPLSPEG RQMLQGVVAD LFDQFVTMVA DGRHMPVERV RTLADGRPYT GRQALSLGLI
DQIGDERDAK AWLTSTRHLS GTIPVVDLKV TTGQGWMHRI TRSMLGVVFG DEWAGSVLSQ
GVALDGAVAI WKP