Gene GM21_3233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3233 
Symbol 
ID8138585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3751792 
End bp3753009 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content64% 
IMG OID644870837 
Productpeptidase U32 
Protein accessionYP_003023017 
Protein GI253701828 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones87 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTT CCGTCGCCAC AAACTTCCAG CCCGACCTGA TTCCCGCCAT CAAGGGATAT 
CCGGTCGCCG AACTTTTCGG CAAGCTACCC TCCGACAGCG TGGGGGGCGG CCGCGCCTCG
TTCATGCTGG CCCCCCTGGG ACAGGAGCAG TTCCGGGCCC ACGTAAGGGA AGCGGGGAAG
AACGGGGTCG GCTTCAACTA CCTGATCAAC CCAGCCTGCA TGGATAACCG CGAGTTCACG
CGCCAAGGGC AGGCGGCTCT CGACCAGCTC CTCGATTTCG TGGACGGCTG CGGCGTCACC
GCGGTCACCG TCTCGCTTCC TTTCCTGCTG CCGATCATCA AGAAACGGCA TCCGCGGCTC
AAGGTGCGGG TCGGTGTCTA TGCCCGCGTC GACTGCGTGG CGAAGGCCCG CTTTTGGGAG
GATCTCGGGG CGGACTGCGT CACGCTGGAA TCAATCGCCA TCAATCGCGA CTTCGGCATG
CTGCAGGCGA TACGTCAGGC GGTGCAACTG GAGCTGCAAC TCATCGCCAA CTCCAACTGC
ATGATCTTCT GCCCGCTCTC GGGGCAGCAC ATGGTGAACC TCTCGCACGC CTCGCAAAAG
GGGCACGCCA GCCGCGGCTT CATGATCGAT TACTGCGCGC TCAGGTGCTC TGCGCAGAAA
CTGGCCGACC CTTCCCTGTA CCTTCGTTCC GAGTTCATCC GGCCGGAGGA TCTGGGGAGC
TACACCGAAC TTGGCTTTAA CTCCTTCAAG ATACTTGAGC GCGGCGCACC GACCCCGGTC
CTCGTCGAGC GGGTCCGCGC CTACAGCGAA GGAAGGTTCT CGGGAAACCT GCTGGACCTG
ATCCAGCCCT ACGGCTACAA GCGCACCCCG GGCAAGGTGA AGGGGCGGTT AAGCGGCCTG
CGCAGGTTTG CCCGGTACTT CCTGCGCCCC GGCGTGATCA ATCTCGCCGG GTTGGTCCGG
CTGAAACGGC TGGCGGAGAA GCGGGGGCTC ATCGAGGAGC TGGAAGGAGC CCCGGTCTAC
CTGGACAACG CCAAACTGGA CGGGTTCCTG GCGGGCTTCC GGGCGATCGA CTGCCGGAAG
ACGGACTGCG CACGCTGCGG CTACTGCGCC GCCTGGACCG AGAAGGCGCT GCGGCTGGAC
CCTGTCTACC GCGACGAGAT GCTGCGACTG TACCGCGACG CCTTCGATGA GATGTATTCC
GGCGAATTAT GGGACTGA
 
Protein sequence
MKFSVATNFQ PDLIPAIKGY PVAELFGKLP SDSVGGGRAS FMLAPLGQEQ FRAHVREAGK 
NGVGFNYLIN PACMDNREFT RQGQAALDQL LDFVDGCGVT AVTVSLPFLL PIIKKRHPRL
KVRVGVYARV DCVAKARFWE DLGADCVTLE SIAINRDFGM LQAIRQAVQL ELQLIANSNC
MIFCPLSGQH MVNLSHASQK GHASRGFMID YCALRCSAQK LADPSLYLRS EFIRPEDLGS
YTELGFNSFK ILERGAPTPV LVERVRAYSE GRFSGNLLDL IQPYGYKRTP GKVKGRLSGL
RRFARYFLRP GVINLAGLVR LKRLAEKRGL IEELEGAPVY LDNAKLDGFL AGFRAIDCRK
TDCARCGYCA AWTEKALRLD PVYRDEMLRL YRDAFDEMYS GELWD