Gene Gmet_0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_0040 
Symbol 
ID3739617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp51627 
End bp53999 
Gene Length2373 bp 
Protein Length790 aa 
Translation table11 
GC content68% 
IMG OID637777319 
Productpeptidase U32 
Protein accessionYP_383015 
Protein GI78221268 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAAAC CCTACGAACA AAAGAAAAAA CCGGAACTTC TCGCCCCGGC CGGCTCCCTG 
GAGGCCTTCT TCGCCGCCAT GGAGAAGGGG GCTGACGCGG TCTACGCGGG GCTCCGGGAG
TTCTCAGCCC GAGCCAAGGC GAAAAACTTT CCCCTCACCC AGATGGAGCG GATGGTGGCC
TACGCCCACG GCCTCGGCCG CAAGGTCTAC ATCACCCTCA ACACCCTGGT GAAGGAGGGG
GAACTGCCGC AGCTCGTGGA GACCCTCGCC GCCCTGGAGG CCATGGGGGC CGATGCCGTC
ATCCTCCAGG ACCTGGGGGT TGCCCGTCTC ATCCGGGACC ACTTTCCGGG CCTTCCCCGC
CATGCCTCCA CCCAGATGAC CATCCACAAC CTCCCCGGCG CCCGGATGCT CAGGGATATG
GGGTTCGAGC GGGTGGTCCT GGCACGGGAG CTCCACCTCG ACGACATCCG CCATATCAGC
CGGGAGTCGG GGGTGGAGAT CGAGTGCTTC ATCCACGGCG CCCTCTGCTT CGCCATCTCG
GGGCAGTGCT ACTTCTCGTC ATTCCTCGGG GGGCACAGCG GCAACCGGGG GCGCTGCGCC
CAGCCCTGCC GCCGCCACTA CCGCTATCGA CAGAACGAGG GGTACTTCTT CTCCACCAAC
GACCTCTCCA CCGTGGACCT CATCCCCGAC CTGTCCGCCG CTGGTGTGGC GTCCCTGAAG
ATCGAGGGGC GGATGAAGTC GGCCGAGTAC GTGGCAAGCG TGGTGGAGGC CTACCGCCTC
GTCCTCGACG CCCCGGAGCG CAAGCGGGCC GAGGCGGCGG CCCGGGCCAA GGAGATCCTC
AAGTTCTCCT TCGGCCGGGT CCCCACCAAG GGGTTCATGG CCTCCCGCAC CCCCACCGAC
ATCGCCATCC CCACCCTCCG GGGGGCTACG GGGCGCTTTC TCGGGGAAGT GAAGAGCGTG
CGGGGGGACC GGCTCACCTT CGAGACGAAA GACCGCCTCT TCGTGGGGGA CCGGGTCCGG
GTGCAGCCCA AGAGCGACAT GGCCGGCAAG GCCTTCACGG TGAAGGATAT CTTTGTGGGG
AAGGAGCGGG TGAAGTCGGC GAAGGAACGG AGCATCGTCA CGGTGGTGTC CCCTTTCGCC
TTCAAGGCGG GGGACGCGGT CTTCAAGGTC TCCTCCGAGA CCGCCTTCAC CATGAGCGAG
AACGCCTGCC TGAAACGCCT CGACGCGGTC AAGCCGGGGA AGATCCCTTG TGCCCTTGAG
CTTTCCCTTG CCGGGGAGAC CCTCCGCATC GGCGCTCAGG CCGCCGGTGC CGAATTTGCC
GCCGAGTTTC CGGTCGGTCT TCTGGAGCCC TCCACCACCA GCGACATGGC GGGGGTCCTT
CGGGCCCAGT TCTCCCGCAC CGGCGAGACC CCCTTCGAGC TTCTGGGTCT TGACGCTCCG
GGCTTCCCCT CGGTCCTCAT CCCGCCGGCT CGGCTCAAGG ATATCCGTCG CGACTTCTAT
CGCCGACTGG GCGAGAAGGT GGCGGCCGGC GTTGCCCAGC GCCGGGCCGA AGCGCGGAAA
CGGGCACTGG CCTCCCTGGT CCCGCCGGGG CACCCGCGTC GTGAGGTCAA GGGCGAGGTG
ACGGTCCGGA TCGAGCACCT GCGCGATACC GCAATCCTTC GCCAGCAGGG GGTAGACGCC
ATCATCCTTC CGGTCTCCCG GGCCAACATC CACCAGGTTC CCCTTGCGGC GCGCAAGCTC
CGGGGGGATG AGGGGCGGAT CATCTGGCAC CTCCCCTTCG TCATCTTCGA CGCGGACCTC
CCCTTCTACC AGGAGGCGGT CGCCCTCCTC ACGGGGCAGG GGTTCCGCCG CTTCGAGCTC
TCGAACCTCT CCCACTTCCC CCTCCTTGCA GGACGGGACG TGGAGCTCTC CACCGACTAC
CGCCTCTTCT CCCTCAACAC CCAGGCCATC ATGGCGTGGC ACGAGCTGGG GGTGACCACT
TCCACCCTCT ACATCGAGGA CGACGTGGAG AACATGGCGG TACTCCTCGG CGCGCCGGTG
CCGGTGCGAC GGCGGGTCCT GGTCTACGGC GGGGTGCCGG CCATGACCAC CCGCGTCGCC
ATCAAGGGGG TGAAGGGGGA CGCCCCCCTG GTCTCTGACC GGGGCGAAGA GTATGAGGTG
GCGGTGCGGG GAGACCTCAC CACCATCACC CCGGCGGTCC GCTTCTCCGT CACCCACTTC
CGGGGCCGGC TCCAGGAGGC GGGGTGCGGC TCTTTCGTGG TGGACCTCTC CCAGGCCCCC
CGGGAGCGGT GGCGCCCCAT CCTCGACGCC CTTGCCCGGG GCGAGGGGGT GCCGGGGACG
AGCGAATTCA ACTTTGTCAT GGGATTGGTA TAG
 
Protein sequence
MHKPYEQKKK PELLAPAGSL EAFFAAMEKG ADAVYAGLRE FSARAKAKNF PLTQMERMVA 
YAHGLGRKVY ITLNTLVKEG ELPQLVETLA ALEAMGADAV ILQDLGVARL IRDHFPGLPR
HASTQMTIHN LPGARMLRDM GFERVVLARE LHLDDIRHIS RESGVEIECF IHGALCFAIS
GQCYFSSFLG GHSGNRGRCA QPCRRHYRYR QNEGYFFSTN DLSTVDLIPD LSAAGVASLK
IEGRMKSAEY VASVVEAYRL VLDAPERKRA EAAARAKEIL KFSFGRVPTK GFMASRTPTD
IAIPTLRGAT GRFLGEVKSV RGDRLTFETK DRLFVGDRVR VQPKSDMAGK AFTVKDIFVG
KERVKSAKER SIVTVVSPFA FKAGDAVFKV SSETAFTMSE NACLKRLDAV KPGKIPCALE
LSLAGETLRI GAQAAGAEFA AEFPVGLLEP STTSDMAGVL RAQFSRTGET PFELLGLDAP
GFPSVLIPPA RLKDIRRDFY RRLGEKVAAG VAQRRAEARK RALASLVPPG HPRREVKGEV
TVRIEHLRDT AILRQQGVDA IILPVSRANI HQVPLAARKL RGDEGRIIWH LPFVIFDADL
PFYQEAVALL TGQGFRRFEL SNLSHFPLLA GRDVELSTDY RLFSLNTQAI MAWHELGVTT
STLYIEDDVE NMAVLLGAPV PVRRRVLVYG GVPAMTTRVA IKGVKGDAPL VSDRGEEYEV
AVRGDLTTIT PAVRFSVTHF RGRLQEAGCG SFVVDLSQAP RERWRPILDA LARGEGVPGT
SEFNFVMGLV