Gene GM21_1150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1150 
Symbol 
ID8136472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1340345 
End bp1341226 
Gene Length882 bp 
Protein Length293 aa 
Translation table11 
GC content62% 
IMG OID644868761 
Productsignal peptide peptidase SppA, 36K type 
Protein accessionYP_003020969 
Protein GI253699780 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones98 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGG GATGCATGCT CTATGGCTCC TTGTTCATAG TAGGGATGTT GCTTCTTTTC 
CTTGCCTGCG TCGGCATCGT CAAAGCGTTA TTGAACGACG GGGATAGCTT GAAGGGCGAC
GGGGTAGGCC TGGTCGAACT GAAGGGGCCG ATCATCGACG GGCAGGAGAC GGTGCGGCAG
TTGCGCGAGT TGAAAAAGGA CAAGCGGGTG AAGGCGGTTG TTTTGCGCAT CGACTCCCCG
GGCGGGGTCG TCGGCCCCTC CCAGGAGATC CACGCCGCCG TCAAGGGGGT GGCGAAGGTG
AAGAAGGTGG TGGTGTCCAT GGGGAGCGTC GCCGCTTCGG GAGGGTACTA CGCGGCTGCG
CCGGCCACCC TCATCTACGC CAACCCCGGC ACCATCACCG GGAGCATCGG CGTGCTGATG
AAGTTTTCCA ACATCGAGGG GCTGATGGAC AAGGTGGGGC TGAAGGCCTT CACCATCAAG
ACCGGCAAGT TCAAGGACGT AGGCTCCCCG GCCCGCACCA TGAGCGACGA GGAGAGGGGG
ATGCTGCAGG GGGTGATCGA CAGCACGCAC CAGCAGTTCA TAAGGGCGGT CGCCGAGGGG
AGGAAGCTTC CGGTCGAACA GGTGCGCGCC ATCGCCGACG GCAGGATCTT TTCCGGAGAG
CAGGCGCTGG CTGCAAAGCT CGTGGACCGG ATCGGCACCC TGCAGGACGC GGTCGAGGAA
GCGGGGAGGC TTGGGGGTGT CAAGGGGGAG CCCGAGTTGA TACGCCCCCC CAGGAAGAAG
ACAAGAATTT TCGGTGTTCT GAGTGAGCGG GCCGAGCAGC ATCTGGAACA GTTTTCCGGC
TCAGACAGTG GCGTCAGTCT CGACTACAAG TTGGGTTGGT AA
 
Protein sequence
MKKGCMLYGS LFIVGMLLLF LACVGIVKAL LNDGDSLKGD GVGLVELKGP IIDGQETVRQ 
LRELKKDKRV KAVVLRIDSP GGVVGPSQEI HAAVKGVAKV KKVVVSMGSV AASGGYYAAA
PATLIYANPG TITGSIGVLM KFSNIEGLMD KVGLKAFTIK TGKFKDVGSP ARTMSDEERG
MLQGVIDSTH QQFIRAVAEG RKLPVEQVRA IADGRIFSGE QALAAKLVDR IGTLQDAVEE
AGRLGGVKGE PELIRPPRKK TRIFGVLSER AEQHLEQFSG SDSGVSLDYK LGW