Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU3347 |
Symbol | |
ID | 2686448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 3677009 |
End bp | 3679381 |
Gene Length | 2373 bp |
Protein Length | 790 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637128041 |
Product | U32 family peptidase |
Protein accession | NP_954387 |
Protein GI | 39998436 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCAACA CCAACGAACC AATGAAAAAA CCCGAACTTC TCGCCCCTGC CGGCTCCCTT GAGGCATTCT TCGCCGCCAT GGAAAAGGGC GCCGACGCGG TCTATGCCGG TCTGCGCGAT TTCTCCGCCC GGGCCAAGGC CAAGAACTTC AGCACCTCCC AGATGGAGCG GATGACCACC TATGCCCACA GCCTGGGCCG CAAGGTCTAC GTCACCATCA ATACCCTTGT GAAGGAAGCG GAGCTGCCGC AGCTCGTGGA GACCCTCGCT GCCCTGGAGG CCATGGGCAC CGACGGGGTC ATCCTCCAGG ATCTGGGGGT GGCACGGCTG ATCCGCGACC ACTTTCCCGG CATTCCCCGC CACGCATCCA CCCAGATGAC GATCCACAAC CTGGCTGGGG TCCAGGTGCT GGGCGAGATG GGCTTCGAGC GGGTCGTGCT CGCCCGCGAG CTCCACCTGG ACGACATCCG CCGCATCTGC GGATCGACGC CGGTGGAAAT CGAATGCTTC ATCCATGGAG CGCTCTGCTT CGCCATCTCG GGGCAGTGCT ACTTCTCGTC CTTTCTCGGC GGGCACAGCG GCAACCGGGG GCGCTGTGCC CAGCCCTGCC GCCGCCACTA TCGCTATCGG AGCAAGGATG GGTACTACTT CTCCACCAAT GACCTCTCCA CGGTGGATCT GATCCCTGAC CTGGCCGCCG CCGGCGTGGC GTCGCTCAAG ATCGAGGGAC GGATGAAGTC GGCCGAGTAC GTGGCGAGCG TGGTGGAGGC CTACCGGATG GTGCTGGACG CGCCGGAGCG GAAACGCGCC GAGGCCACCG GCCGCGCCAA GGAGCTTCTG AAGCTCTCCT TCGGCCGGGT GCCGACCAAG GGCTTCATGG GCTCCCGCAC CCCCACCGAC ATTGCCATCC CCACCCTGCG CGGGGCCACG GGCCGGTTTC TGGGAGAGAT CACCGGGGTG AAGGGCAACA GAATCACCTT CGAAACCAAG GACCCACTCT TCGTGGGGGA CCGGATCAGG GTGCAGCCGA AAAGCGACAT GGCCGGCCGC GCCTTCACGG TGAAAGAACT TTTTGCCGGT CAGGGCAAGG TGAAATCGGT CCGGGAAAAA AGCATTGTCT CCGTGATCTC GCCTTTCCCA TTCAAGGTGG GGGATGCAGT CTTCAAGGTC TCGTCCGAAA CCGCGTTCAC CATGAGTGAA AATGCCTGCC TCAAGCGCCT GGATGCGGTC AAGCCCTGCG CCATCCCCTG CGACCTGTCG CTAGCCCTGG ACGGGGAGAC CCTTCAGGTG ACCGGCCTGG CTGCCGGGGG GCGGGTTGAG GCCGCCTTCC CCGTGGGTGT TTTGGAACCT GCCCGGACCG AAGACATGAC CGGCGTGCTC AGGGCCCAGT TCTCCCGCAC CGGCGACACC CCCTTCGAAC TGAGGGGCCT CGACGCGCCC GGCTTCCCCC GCGTTCTCAT CCCGCCGGCA AAGCTCAAGG AAATCCGCCG GGAATTCTAT CGGCTTCTGG CCGAGGAGGC CGTGGCCGGT GCCCGGATCC GCAAGGCGGA GGCCCGGCGA CGGGCACTGG CGGCCCTCGT TCCGGCGGCA CAGCCGCGTC GGGAGCCGAG GTCGGAAGTC ATCGTGCGAA TCGAGCACCT GCGGGACGCC AGCCTCCTGC GGCAGCCGGG TGTCGATGGC ATCACCCTGC CGGTCTCCCG GGCCAACATC CACCAACTGC CCCTTGCGGC CCGCAAGCTG CGAGGGGACG CGGATCGGAT CACCTGGCAC CTGCCGTTCA TCATGTTCGA TGACGACCTC CCCTTCTACC GGGAGGCGGT GGATGTCATC CTGGCCCACG GATTCCGGCG TTTCGAGCTA TCCAACCTCT CCCACGCGGC CCTGCTGAAG GGACGCGACG CCGAGCTTGC CACCGACTAT CGCCTGTTCT CGCTCAACAC CCAGGCGATC CTTGCCTGGC ACGAACTGGG GGTCACAACC GCCACCCTCT ACATCGAGGA TGACGCCGAG AACATGGCGC GGCTCCTGGG GGCCGCCGTG CCGGTGAAAC GGCGGGTGCT GGTCTACGCC GGCGTACCGG CCATGACCAC TCGCATCGCC ATCCGCGGGG TAAAAAACGA TGCCCCTCTC GTCTCGGACC GGGGCGACGA GTATGACGTG GCGATCCGGG GAGACCTCAC CACGATTACC CCAGCCACAC GCTTCTCCAT CACCCAATTC CGGGGACAGC TCCAGGAGAC GGGCTGCGGC ACCTTCATCG TGGACCTGTC CCAAGCGCCC CGTGAGCAGT GGCGACCGAT CCTCGACACC TTTGCCCGGG GCGGCGAACT GCCGGGGACC AGCCCCTTCA ACTTCGTAAT GGGCCTCGTG TAA
|
Protein sequence | MLNTNEPMKK PELLAPAGSL EAFFAAMEKG ADAVYAGLRD FSARAKAKNF STSQMERMTT YAHSLGRKVY VTINTLVKEA ELPQLVETLA ALEAMGTDGV ILQDLGVARL IRDHFPGIPR HASTQMTIHN LAGVQVLGEM GFERVVLARE LHLDDIRRIC GSTPVEIECF IHGALCFAIS GQCYFSSFLG GHSGNRGRCA QPCRRHYRYR SKDGYYFSTN DLSTVDLIPD LAAAGVASLK IEGRMKSAEY VASVVEAYRM VLDAPERKRA EATGRAKELL KLSFGRVPTK GFMGSRTPTD IAIPTLRGAT GRFLGEITGV KGNRITFETK DPLFVGDRIR VQPKSDMAGR AFTVKELFAG QGKVKSVREK SIVSVISPFP FKVGDAVFKV SSETAFTMSE NACLKRLDAV KPCAIPCDLS LALDGETLQV TGLAAGGRVE AAFPVGVLEP ARTEDMTGVL RAQFSRTGDT PFELRGLDAP GFPRVLIPPA KLKEIRREFY RLLAEEAVAG ARIRKAEARR RALAALVPAA QPRREPRSEV IVRIEHLRDA SLLRQPGVDG ITLPVSRANI HQLPLAARKL RGDADRITWH LPFIMFDDDL PFYREAVDVI LAHGFRRFEL SNLSHAALLK GRDAELATDY RLFSLNTQAI LAWHELGVTT ATLYIEDDAE NMARLLGAAV PVKRRVLVYA GVPAMTTRIA IRGVKNDAPL VSDRGDEYDV AIRGDLTTIT PATRFSITQF RGQLQETGCG TFIVDLSQAP REQWRPILDT FARGGELPGT SPFNFVMGLV
|
| |