Gene GSU2556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2556 
Symbol 
ID2685493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2818455 
End bp2819669 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content67% 
IMG OID637127246 
ProductU32 family peptidase 
Protein accessionNP_953602 
Protein GI39997651 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGATCC CCGAACTCCT GGCCCCGGCC GGGAACCTCG AAAAACTCAA AGTGGCCGTC 
CACTACGGCG CCGACGCCGT CTACCTGGGC GGTGCCCGCT TCGGACTGCG GAGCCAGGCC
GACAATTTCA CCCCCGCCAC CATGGCCGAG GCCGTGGCCT ATGCCCACGA CCGGGGGGTG
AAGGTCTACC TCACGGTCAA CAGCTATCCC GACACCGACG AGCTGGAGGA ACTGGACCGC
TACCTGGAAG AGGTAGCGCC GATCCCCTTC GACGCCTTCA TCGCCGCCGA TCCCGGGGTC
ATCGCCACCA TCCGCCGCAT CGTCCCGGAC CGCACCATCC ATCTCTCGAC CCAGGCCAAT
ACCACCACCT GGCGCAGCGC CCTCTTCTGG CAGCAGCAGG GCATCAGCCG CATCAACCTG
GCCCGGGAGA TGTCCCTGGA GGCGATCCGC GAAACCCGCC GCCGCGTGTC GGCCGAACTG
GAGGTCTTTG CCCACGGCGC TCTCTGCGTG GCCTATTCGG GGCGCTGCCT CCTCTCCGCC
GTCATGACCG GGCGCCATGC CAACCGGGGG GAGTGCACCC ATCCCTGCCG CTGGAGCTAC
GCCCTGGTGG AGGAAAGCCG GCCCGGCGAG TACTACCCGG TCACTGAGGA CGAAAACGGC
ACGTTCATCT TCAACTCCCG GGACCTCTGC CTCATCCGCC ACATTCCCGA GCTGGTGGAG
GCGGGAGTCG ACTCCCTCAA GATCGAGGGG AGAATGAAGG GAATCCACTA CGTGGCGTCG
GTGGTGCGGG TCTACCGGGA GGCGCTCGAC CGCTATGCCG CCGACCCCGC CGGCTACGCG
TTCCGTCCCG AGTGGCTGGA GGAACTGTCC AAGGTGAGCC ACCGGGGATA CACCACCGGC
TTCCTCCTGG GCCGCCCCGA GGCGGCGGAC CTGGAGTACG ACTCCCGTTA TCTGCGCAGC
CATGACTTTC TCGCCGTGGT GGACGAGATC CTACCCGACG GCACCGCCAT CCTTGCCGTC
CGCAACCGCA TCCGGCCAGG CTGGACCATG GAGCTGATGG GGCCGGGCAT GCGCTCGGAT
ACCTTCAGGC TCGACACCTT CACCGACGAG AACGGGGCTC CCCTGACCGA AGCCCACCCG
AACCAACGGA TCCGGACGAT ACTCCCCGAA GCGGCCGCCC CCTGGGATCT GCTACGGCGG
GAACGGGACG ACTGA
 
Protein sequence
MQIPELLAPA GNLEKLKVAV HYGADAVYLG GARFGLRSQA DNFTPATMAE AVAYAHDRGV 
KVYLTVNSYP DTDELEELDR YLEEVAPIPF DAFIAADPGV IATIRRIVPD RTIHLSTQAN
TTTWRSALFW QQQGISRINL AREMSLEAIR ETRRRVSAEL EVFAHGALCV AYSGRCLLSA
VMTGRHANRG ECTHPCRWSY ALVEESRPGE YYPVTEDENG TFIFNSRDLC LIRHIPELVE
AGVDSLKIEG RMKGIHYVAS VVRVYREALD RYAADPAGYA FRPEWLEELS KVSHRGYTTG
FLLGRPEAAD LEYDSRYLRS HDFLAVVDEI LPDGTAILAV RNRIRPGWTM ELMGPGMRSD
TFRLDTFTDE NGAPLTEAHP NQRIRTILPE AAAPWDLLRR ERDD