Gene GM21_1526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1526 
Symbol 
ID8136855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1788683 
End bp1789921 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content62% 
IMG OID644869138 
Productpeptidase U32 
Protein accessionYP_003021340 
Protein GI253700151 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value2.31073e-27 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGGGCAGA AGCCGCGCCA AGGACGGGTG AAGCCCGAGC TTTTGGCGCC TGCGGGGAAC 
ATGGAGAAAC TGAAGGTCGC CATCCGTTAT GGCGCCGACG CGGTGTACCT GGGGGGCAAA
TCGTTTGGCC TTAGAAACCT GGCCGGCAAC TTCAGCCTCC CGGAGCTCTC GCACGCAGTC
GACTACGCGC ACCGGCACGG CGTCAAGGTG TATCTGACCG TGAACGCCTT CGCCGACAAC
CGCGATTTGA TCGAGCTGGA GCGCTACCTG GAGGAGATCC GCGAGATCCC CTTCGACGCC
CTGATCGCCG CCGACCCCGG CGTCGTTGCG CTGATAGCCG AGCGCTGCCC CGGCCGCGAC
ATCCACCTTT CCACCCAGGC CAACACCACC AACTGGCGCT CCGCCCGCTT CTGGCAGGCG
CAGGGGGTGA AACGGGTGAA TCTCGCCCGT GAGATGTCGC TGGAGCAGAT TGCCGAGACG
GCGGGGATGT GCAACGAAAT CGAGCTTGAA GTCTTCGTCC ACGGCGCCAT GTGCATCTCG
TATTCGGGGC GCTGCCTCCT CTCCCTTGCC ATGACCGGCA GGGACGCCAA CAAGGGAGAG
TGCACCCAGC CCTGCCGCTG GAACTACGCC ATCGTAGAGG AGAGCCGTCC CGGGGAGTAT
TTCCCCATCC ACGAGGACGA AAGCGGGAGC TTCATCTTCA ACTCGAAGGA TCTCTGCCTG
ATCGAGCAGC TTCCGGATCT GGTGGAGAGC GGCGTCCACT CCTTGAAGAT AGAGGGGAGG
ATGAAAGGGA TCTACTACGC GGCCAGCGTG ATCCGCATCT ACCGCGAGGC GCTGGACAGC
TACTGGGAGG ACCCGGTGAA CTACCGGTTG AACCCGGCGT GGCTGGAGGA GCTAAGCAAG
ATCAGCCACC GCGGGTACAC CACCGGGTTT TTGTTGGGCA AGCCGCGCGA CGTGGACCAC
GAGTACCTCT CGCGTTACGT GAGGAATTTC GAGTTTGTGG CGCTGGTCGA GGGGGAAGCT
AAAGGGGGAG GCACCCTGGT TGCGGTGAGG AACAGGTTGC AGTTGGGCGA CGCGTTGGAA
CTGATCGGTC AAGGCACGTG CTTCACCAGA TTCATATTGG AGTCGATGGA AGACGAGGAC
GGCGTCCCGC TCCAGGTAGC CCATCCAAAC CAGCGGGTAG TACTGAAAGA ACTCACCGGT
GCAGGAGAGT ACGATCTGAT CAGGAGAGAA AAAACTTGA
 
Protein sequence
MGQKPRQGRV KPELLAPAGN MEKLKVAIRY GADAVYLGGK SFGLRNLAGN FSLPELSHAV 
DYAHRHGVKV YLTVNAFADN RDLIELERYL EEIREIPFDA LIAADPGVVA LIAERCPGRD
IHLSTQANTT NWRSARFWQA QGVKRVNLAR EMSLEQIAET AGMCNEIELE VFVHGAMCIS
YSGRCLLSLA MTGRDANKGE CTQPCRWNYA IVEESRPGEY FPIHEDESGS FIFNSKDLCL
IEQLPDLVES GVHSLKIEGR MKGIYYAASV IRIYREALDS YWEDPVNYRL NPAWLEELSK
ISHRGYTTGF LLGKPRDVDH EYLSRYVRNF EFVALVEGEA KGGGTLVAVR NRLQLGDALE
LIGQGTCFTR FILESMEDED GVPLQVAHPN QRVVLKELTG AGEYDLIRRE KT