Gene Gmet_0888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_0888 
Symbol 
ID3738847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp978332 
End bp979543 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content65% 
IMG OID637778167 
Productpeptidase U32 
Protein accessionYP_383855 
Protein GI78222108 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0335605 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0000000000241821 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGATTC CCGAGCTCCT CGCGCCGGCC GGCAACCTGG AAAAGCTGAA GGTGGCCATC 
CACTACGGCG CCGACGCCGT CTACCTGGGA GGGGAGAAGT TCGGGCTCCG CAGCCTCGCC
GACAACTTCA CCCTGGCCCA CATGGCCGAG GGGATTGCCT ATGCCCACGA CCGGGGGGTA
AAGGTCTACC TGACGGTCAA CGCCTTTCCC GACAACAGCG AACTGGAGGA CCTGGACCGC
TACCTGGAGG GGGTTGCTAC AGTCCCCTTC GACGCTTACA TCGCCGCCGA CCCGGGGGTG
ATCGCCGCCA TCCGCCGCAT ATCGCCAGAT CGTCCCATCC ACCTTTCCAC CCAGGCCAAC
ACCACCAACT GGCGCAGCGT CCTCTTCTGG CAGAAGCAGG GGATCGCCCG GGTGAACCTG
GCCCGGGAAA TGTCCCTGGA TGCGATCCGC GAGACCCGGG AGCGGGTCAC GGCCGAACTG
GAAGTCTTCG TCCACGGCGC CCTCTGCGTC TCCTACTCGG GTCGCTGCCT CCTCTCCAGC
GTCATGACCG GCCGCAACGC CAACAAGGGG GAGTGCGCCC ACCCCTGCCG CTGGAGCTAC
GCCCTGGTGG AAGAGACCCG ACCGGGCGAG TACTTCCCCG TGGTGGAGGA CGAGCGGGGG
ACCTTCATCT TCAACTCCAA GGATCTCTGC CTCATCCGCC ATATCCCTGA ACTGGTGGGG
GCCGGCGCCG ATTCCCTCAA AATAGAGGGG CGCATGAAGG GAATCCACTA CGTGGCCTCG
GTGGTGCGAG TCTACCGGGA GGCCCTGGAC AGTTATGCCG CCGATCCTCG CGCCTGGCAC
ATGCAGTCCG AGTGGCTCGA AGAGCTCTCC AAGATCAGCC ACCGGGGGTA CACCACCGGC
TTCTTCCTGG GAAAACCAGT GGATGTGGAC CTGGAATTCG ACTCCCGCTA CCGGCGCAGC
CACGAATTCG TCGGCGTGGT GGAAGAGGCG CACCCCGACG GCACCGTTAC CGTGGAAGTC
CGCAACCGGA TCGTGGCAGG GACCACGGTG GAGGTCATCG GCCGGCGGAT GCGCTCCACC
CTCCACCGGC TCGACGCGTT CACCGACATG GATGGCAACA GCCTCTCCGA GGCCCATCCG
AACCAGCGGA TCCGCGTGAG TCTTCCCGTA GCGGCGGAAC GCTACGACCT TATCCGGCGG
GAAAAGCCAT AG
 
Protein sequence
MKIPELLAPA GNLEKLKVAI HYGADAVYLG GEKFGLRSLA DNFTLAHMAE GIAYAHDRGV 
KVYLTVNAFP DNSELEDLDR YLEGVATVPF DAYIAADPGV IAAIRRISPD RPIHLSTQAN
TTNWRSVLFW QKQGIARVNL AREMSLDAIR ETRERVTAEL EVFVHGALCV SYSGRCLLSS
VMTGRNANKG ECAHPCRWSY ALVEETRPGE YFPVVEDERG TFIFNSKDLC LIRHIPELVG
AGADSLKIEG RMKGIHYVAS VVRVYREALD SYAADPRAWH MQSEWLEELS KISHRGYTTG
FFLGKPVDVD LEFDSRYRRS HEFVGVVEEA HPDGTVTVEV RNRIVAGTTV EVIGRRMRST
LHRLDAFTDM DGNSLSEAHP NQRIRVSLPV AAERYDLIRR EKP