Gene GSU3347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3347 
Symbol 
ID2686448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3677009 
End bp3679381 
Gene Length2373 bp 
Protein Length790 aa 
Translation table11 
GC content66% 
IMG OID637128041 
ProductU32 family peptidase 
Protein accessionNP_954387 
Protein GI39998436 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCAACA CCAACGAACC AATGAAAAAA CCCGAACTTC TCGCCCCTGC CGGCTCCCTT 
GAGGCATTCT TCGCCGCCAT GGAAAAGGGC GCCGACGCGG TCTATGCCGG TCTGCGCGAT
TTCTCCGCCC GGGCCAAGGC CAAGAACTTC AGCACCTCCC AGATGGAGCG GATGACCACC
TATGCCCACA GCCTGGGCCG CAAGGTCTAC GTCACCATCA ATACCCTTGT GAAGGAAGCG
GAGCTGCCGC AGCTCGTGGA GACCCTCGCT GCCCTGGAGG CCATGGGCAC CGACGGGGTC
ATCCTCCAGG ATCTGGGGGT GGCACGGCTG ATCCGCGACC ACTTTCCCGG CATTCCCCGC
CACGCATCCA CCCAGATGAC GATCCACAAC CTGGCTGGGG TCCAGGTGCT GGGCGAGATG
GGCTTCGAGC GGGTCGTGCT CGCCCGCGAG CTCCACCTGG ACGACATCCG CCGCATCTGC
GGATCGACGC CGGTGGAAAT CGAATGCTTC ATCCATGGAG CGCTCTGCTT CGCCATCTCG
GGGCAGTGCT ACTTCTCGTC CTTTCTCGGC GGGCACAGCG GCAACCGGGG GCGCTGTGCC
CAGCCCTGCC GCCGCCACTA TCGCTATCGG AGCAAGGATG GGTACTACTT CTCCACCAAT
GACCTCTCCA CGGTGGATCT GATCCCTGAC CTGGCCGCCG CCGGCGTGGC GTCGCTCAAG
ATCGAGGGAC GGATGAAGTC GGCCGAGTAC GTGGCGAGCG TGGTGGAGGC CTACCGGATG
GTGCTGGACG CGCCGGAGCG GAAACGCGCC GAGGCCACCG GCCGCGCCAA GGAGCTTCTG
AAGCTCTCCT TCGGCCGGGT GCCGACCAAG GGCTTCATGG GCTCCCGCAC CCCCACCGAC
ATTGCCATCC CCACCCTGCG CGGGGCCACG GGCCGGTTTC TGGGAGAGAT CACCGGGGTG
AAGGGCAACA GAATCACCTT CGAAACCAAG GACCCACTCT TCGTGGGGGA CCGGATCAGG
GTGCAGCCGA AAAGCGACAT GGCCGGCCGC GCCTTCACGG TGAAAGAACT TTTTGCCGGT
CAGGGCAAGG TGAAATCGGT CCGGGAAAAA AGCATTGTCT CCGTGATCTC GCCTTTCCCA
TTCAAGGTGG GGGATGCAGT CTTCAAGGTC TCGTCCGAAA CCGCGTTCAC CATGAGTGAA
AATGCCTGCC TCAAGCGCCT GGATGCGGTC AAGCCCTGCG CCATCCCCTG CGACCTGTCG
CTAGCCCTGG ACGGGGAGAC CCTTCAGGTG ACCGGCCTGG CTGCCGGGGG GCGGGTTGAG
GCCGCCTTCC CCGTGGGTGT TTTGGAACCT GCCCGGACCG AAGACATGAC CGGCGTGCTC
AGGGCCCAGT TCTCCCGCAC CGGCGACACC CCCTTCGAAC TGAGGGGCCT CGACGCGCCC
GGCTTCCCCC GCGTTCTCAT CCCGCCGGCA AAGCTCAAGG AAATCCGCCG GGAATTCTAT
CGGCTTCTGG CCGAGGAGGC CGTGGCCGGT GCCCGGATCC GCAAGGCGGA GGCCCGGCGA
CGGGCACTGG CGGCCCTCGT TCCGGCGGCA CAGCCGCGTC GGGAGCCGAG GTCGGAAGTC
ATCGTGCGAA TCGAGCACCT GCGGGACGCC AGCCTCCTGC GGCAGCCGGG TGTCGATGGC
ATCACCCTGC CGGTCTCCCG GGCCAACATC CACCAACTGC CCCTTGCGGC CCGCAAGCTG
CGAGGGGACG CGGATCGGAT CACCTGGCAC CTGCCGTTCA TCATGTTCGA TGACGACCTC
CCCTTCTACC GGGAGGCGGT GGATGTCATC CTGGCCCACG GATTCCGGCG TTTCGAGCTA
TCCAACCTCT CCCACGCGGC CCTGCTGAAG GGACGCGACG CCGAGCTTGC CACCGACTAT
CGCCTGTTCT CGCTCAACAC CCAGGCGATC CTTGCCTGGC ACGAACTGGG GGTCACAACC
GCCACCCTCT ACATCGAGGA TGACGCCGAG AACATGGCGC GGCTCCTGGG GGCCGCCGTG
CCGGTGAAAC GGCGGGTGCT GGTCTACGCC GGCGTACCGG CCATGACCAC TCGCATCGCC
ATCCGCGGGG TAAAAAACGA TGCCCCTCTC GTCTCGGACC GGGGCGACGA GTATGACGTG
GCGATCCGGG GAGACCTCAC CACGATTACC CCAGCCACAC GCTTCTCCAT CACCCAATTC
CGGGGACAGC TCCAGGAGAC GGGCTGCGGC ACCTTCATCG TGGACCTGTC CCAAGCGCCC
CGTGAGCAGT GGCGACCGAT CCTCGACACC TTTGCCCGGG GCGGCGAACT GCCGGGGACC
AGCCCCTTCA ACTTCGTAAT GGGCCTCGTG TAA
 
Protein sequence
MLNTNEPMKK PELLAPAGSL EAFFAAMEKG ADAVYAGLRD FSARAKAKNF STSQMERMTT 
YAHSLGRKVY VTINTLVKEA ELPQLVETLA ALEAMGTDGV ILQDLGVARL IRDHFPGIPR
HASTQMTIHN LAGVQVLGEM GFERVVLARE LHLDDIRRIC GSTPVEIECF IHGALCFAIS
GQCYFSSFLG GHSGNRGRCA QPCRRHYRYR SKDGYYFSTN DLSTVDLIPD LAAAGVASLK
IEGRMKSAEY VASVVEAYRM VLDAPERKRA EATGRAKELL KLSFGRVPTK GFMGSRTPTD
IAIPTLRGAT GRFLGEITGV KGNRITFETK DPLFVGDRIR VQPKSDMAGR AFTVKELFAG
QGKVKSVREK SIVSVISPFP FKVGDAVFKV SSETAFTMSE NACLKRLDAV KPCAIPCDLS
LALDGETLQV TGLAAGGRVE AAFPVGVLEP ARTEDMTGVL RAQFSRTGDT PFELRGLDAP
GFPRVLIPPA KLKEIRREFY RLLAEEAVAG ARIRKAEARR RALAALVPAA QPRREPRSEV
IVRIEHLRDA SLLRQPGVDG ITLPVSRANI HQLPLAARKL RGDADRITWH LPFIMFDDDL
PFYREAVDVI LAHGFRRFEL SNLSHAALLK GRDAELATDY RLFSLNTQAI LAWHELGVTT
ATLYIEDDAE NMARLLGAAV PVKRRVLVYA GVPAMTTRIA IRGVKNDAPL VSDRGDEYDV
AIRGDLTTIT PATRFSITQF RGQLQETGCG TFIVDLSQAP REQWRPILDT FARGGELPGT
SPFNFVMGLV