Gene GSU0331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0331 
Symbol 
ID2686703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp361506 
End bp362900 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content58% 
IMG OID637124997 
Producttrypsin domain/PDZ domain-containing protein 
Protein accessionNP_951391 
Protein GI39995440 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00109847 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATGG TGTCCTTGCG GTTCCTTAAG ACCATGCTCA CCGTTATCTG CCTGATGGCG 
TTGTCGGCGG GAGAAGTCCC AGCCAAAGTC ATGGCTCCCG ATTTTGTCAC ACTTGCCGAA
AAGCTGAAGC CAACCGTCGT CAACATCAGT ACATCAAAGA ATCCGGCCCA GACAGCTCGC
CCCCGTCGCC AACCCTCTCC CTTCAATGAC CCGTTCCATG ATTTCTTCGA TCGCTTTTTT
GACGAGGCAC CTCGCCGTCA GCAACGGGAA CGGAGTCTCG GCTCCGGGTT CATCATCAGT
GATCAGGGCT TTATCATCAC CAATAACCAT GTGGTTGCCG GTGCCGACGA GATCAAGGTG
CGCCTCTCCG ACGGCCGCGA GTTCAAGGCG GAATTGAAGG GCGCCGACGA AAAACTCGAC
CTGGCCCTCA TCAAGATTGA GTCCAAAGAT CAACTCCCCG TTGCGATTCT CGGCAACAGC
GATGAAATCA AAGTGGGCGA GTGGGTGATG GCGATCGGCA ATCCGTTCGG CCTTGCCCAG
ACCGTTACCG CCGGAATCGT CAGCGCCACC GGTCGCGTCA TCGGCAGCGG GCCCTATGAC
GATTTCATCC AGACCGATGC CTCCATTAAC CCCGGTAACT CGGGAGGCCC CCTTTTCAGC
GCCGAAGGAA AAGTCATCGG CATCAATACC GCCATCATCG CCGGCGGTCA GGGAATCGGG
TTTGCCATCC CCATCAACAT GGCCAAAGAT GTCATTCCCC AGCTCGAGGA AAAGGGAAAG
GTCATCCGCG GCTGGCTTGG GGTGACGGTT CAGCCCATAA CTCCCGATCT GGCCCGCTCG
TTTGGCCTTG AGGGAGAGCG GGGTGCGCTC ATCGCCGACG TGGTGAAGGA TGGCCCCGCC
GCCAAGGCCG GACTCAAGAG CGGGGATATC GTGCTTGAAT TCGACGGTAA GAAAATCCGG
GAAATGAACG AGCTCCCGCG TATCGTAGCC GCCACCCCTG TGGGGAAGGC CGCATTGGTC
AAGGTGCTGC GTGATGGCAA GATGCAGGAT GTCGAAGTAT CTGTCGGGCG CTTGGCGGAT
ACGGGCGATG AGTCAGATCA GAAGAATGGT GAAGATAAAC TTGGCATGGC AGTCAGGGAG
CTGACACGCG ATCTTGCCGC GCGGATGGGG CTTAAGGAGA CTCAGGGCGT CGTTGTCACG
GGTGTCAAGT CTGGCAGTCT GGCCGAGGAA GCGGGAATCC TGCCGGGCGA TATCGTTCGG
GAGATAGGAG GGCGTTCCAT TACTACTATG GCGGATTACG AAACAGCGAT CCGAGCCGTG
AAGAAGGGAG ACGTAGTCCG CTTTCTGCTG CGCCGCGGCG GTGGCAACCA CTTCCTGGCA
ATCCGGGTCG AATAG
 
Protein sequence
MKMVSLRFLK TMLTVICLMA LSAGEVPAKV MAPDFVTLAE KLKPTVVNIS TSKNPAQTAR 
PRRQPSPFND PFHDFFDRFF DEAPRRQQRE RSLGSGFIIS DQGFIITNNH VVAGADEIKV
RLSDGREFKA ELKGADEKLD LALIKIESKD QLPVAILGNS DEIKVGEWVM AIGNPFGLAQ
TVTAGIVSAT GRVIGSGPYD DFIQTDASIN PGNSGGPLFS AEGKVIGINT AIIAGGQGIG
FAIPINMAKD VIPQLEEKGK VIRGWLGVTV QPITPDLARS FGLEGERGAL IADVVKDGPA
AKAGLKSGDI VLEFDGKKIR EMNELPRIVA ATPVGKAALV KVLRDGKMQD VEVSVGRLAD
TGDESDQKNG EDKLGMAVRE LTRDLAARMG LKETQGVVVT GVKSGSLAEE AGILPGDIVR
EIGGRSITTM ADYETAIRAV KKGDVVRFLL RRGGGNHFLA IRVE