Gene GSU0080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0080 
SymboldegQ 
ID2687866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp91907 
End bp93322 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content63% 
IMG OID637124746 
Productprotease degQ 
Protein accessionNP_951142 
Protein GI39995191 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGTCC GTCGTCTGCT ACTGATATCG TTGGTCTTTG TCACAACTCT CACCGCCTGC 
TCGAAGAAGG AAGAAAAGCT CTTCTATGAG TCGGGCCGTG CCGACGCGCC GGTCAAGGAG
GTTCCCAGAG ACATCCTTGC CACCCAGCAG GCCTTCGTCG AGCTGGTCAA GAAGGTTACT
CCGTCGGTCG TGAACATCTC CACCGTCAGC CGGAGAAAGA TCGAGCAGCC CTTCTTCGAG
TTTTCCCCCT TCTTCAATGA TTTCTTCGAC AATCGCCCCC GGTTCCGGCG GGAACAGAGC
CTCGGCTCTG GCTTCATCAT CAACCGGGAA GGGTACATCG TCACCAATGA CCATGTGGTG
CGCGACGCCG AAAGCATCAA GGTCAAACTC TCCAATGAGA ACGTCTACGA CGGCCACATC
GTCGGCAGCG ACCCCAAGAC CGACATCGCG GTCATCAAGA TCGACTCGCG GGAGGAACTC
CCCGTGGCGG TCCTGGCCGA TTCGGACAAG CTTCAAGTGG GGCAGTGGGC GGTGGCCATC
GGCAACCCCT TCGGCCTGGA CCGGACCGTG ACCGTCGGCG TGGTGTCGGC CACCGGCCGG
TCCAACATGG GAATCGAGAC CTATGAAGAT TTCATCCAGA CCGACGCCTC CATCAACCCG
GGCAATTCGG GGGGGCCGCT GCTGAACGTC CACGGCGAGG TGATCGGCAT CAACACCGCC
ATCGTGGCCG CCGGTCAGGG GATCGGCTTT GCCATCCCGG TCAACATGGC AAAGCAGATC
GTAACTCAGC TCATCACCAA GGGCAAGGTC ACCCGCGGCT GGCTCGGCGT TACCATTCAA
CCGGTCACCG ACGATCTTGC CAAGGAATTC GGCCTGAAAA AGGCCCAGGG CGTCCTGGTG
AGTGATGTGG TTAAGGGGAG CCCCGCTGCC GGCGCCGGTA TCCGGCAGGG GGACATCATC
CTCAGGTTCG CCGGCAAGGA GATCAAGGAT GCCCAGCACC TCCAGCGGGT GGTGGGCGAC
ACGGCGCCGG GGACAAAGGT GCCGGTGGTG GTCTTCCGAG AAGGGAAAGA GGTTCAACTC
TCCCTGGCGA CGGCCAGTTC CGACAGTGCC CAGGCACGCC AGGCGCGCCC TCAGGGAGGG
GCGCCCGACA CCCTTGGCCT CGCCGTGGAG GAACTACCCC GCGAATACCG TCAGGAAGGT
TTCACCGGCG TCCTGGTGGT CCAGGTGGAT GATGGGAGCG CCGCCGGCGA GGCGGGCATC
CGGGAGGGGG ACGTGATCGT GGCGGTGAAC CGGCGGCCCG TGGCGAACCT GGCAGAGTAC
GACCGCGTCA TGCGCGAGGC GGCCCGGCGC GGTTCGGTAG TGCTTCTGGT GCGACGAGGC
GAGGCGAGCA TCTATTTCTC CCTCAGGCTC AGGTAG
 
Protein sequence
MFVRRLLLIS LVFVTTLTAC SKKEEKLFYE SGRADAPVKE VPRDILATQQ AFVELVKKVT 
PSVVNISTVS RRKIEQPFFE FSPFFNDFFD NRPRFRREQS LGSGFIINRE GYIVTNDHVV
RDAESIKVKL SNENVYDGHI VGSDPKTDIA VIKIDSREEL PVAVLADSDK LQVGQWAVAI
GNPFGLDRTV TVGVVSATGR SNMGIETYED FIQTDASINP GNSGGPLLNV HGEVIGINTA
IVAAGQGIGF AIPVNMAKQI VTQLITKGKV TRGWLGVTIQ PVTDDLAKEF GLKKAQGVLV
SDVVKGSPAA GAGIRQGDII LRFAGKEIKD AQHLQRVVGD TAPGTKVPVV VFREGKEVQL
SLATASSDSA QARQARPQGG APDTLGLAVE ELPREYRQEG FTGVLVVQVD DGSAAGEAGI
REGDVIVAVN RRPVANLAEY DRVMREAARR GSVVLLVRRG EASIYFSLRL R