Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU0331 |
Symbol | |
ID | 2686703 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 361506 |
End bp | 362900 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637124997 |
Product | trypsin domain/PDZ domain-containing protein |
Protein accession | NP_951391 |
Protein GI | 39995440 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00109847 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATGG TGTCCTTGCG GTTCCTTAAG ACCATGCTCA CCGTTATCTG CCTGATGGCG TTGTCGGCGG GAGAAGTCCC AGCCAAAGTC ATGGCTCCCG ATTTTGTCAC ACTTGCCGAA AAGCTGAAGC CAACCGTCGT CAACATCAGT ACATCAAAGA ATCCGGCCCA GACAGCTCGC CCCCGTCGCC AACCCTCTCC CTTCAATGAC CCGTTCCATG ATTTCTTCGA TCGCTTTTTT GACGAGGCAC CTCGCCGTCA GCAACGGGAA CGGAGTCTCG GCTCCGGGTT CATCATCAGT GATCAGGGCT TTATCATCAC CAATAACCAT GTGGTTGCCG GTGCCGACGA GATCAAGGTG CGCCTCTCCG ACGGCCGCGA GTTCAAGGCG GAATTGAAGG GCGCCGACGA AAAACTCGAC CTGGCCCTCA TCAAGATTGA GTCCAAAGAT CAACTCCCCG TTGCGATTCT CGGCAACAGC GATGAAATCA AAGTGGGCGA GTGGGTGATG GCGATCGGCA ATCCGTTCGG CCTTGCCCAG ACCGTTACCG CCGGAATCGT CAGCGCCACC GGTCGCGTCA TCGGCAGCGG GCCCTATGAC GATTTCATCC AGACCGATGC CTCCATTAAC CCCGGTAACT CGGGAGGCCC CCTTTTCAGC GCCGAAGGAA AAGTCATCGG CATCAATACC GCCATCATCG CCGGCGGTCA GGGAATCGGG TTTGCCATCC CCATCAACAT GGCCAAAGAT GTCATTCCCC AGCTCGAGGA AAAGGGAAAG GTCATCCGCG GCTGGCTTGG GGTGACGGTT CAGCCCATAA CTCCCGATCT GGCCCGCTCG TTTGGCCTTG AGGGAGAGCG GGGTGCGCTC ATCGCCGACG TGGTGAAGGA TGGCCCCGCC GCCAAGGCCG GACTCAAGAG CGGGGATATC GTGCTTGAAT TCGACGGTAA GAAAATCCGG GAAATGAACG AGCTCCCGCG TATCGTAGCC GCCACCCCTG TGGGGAAGGC CGCATTGGTC AAGGTGCTGC GTGATGGCAA GATGCAGGAT GTCGAAGTAT CTGTCGGGCG CTTGGCGGAT ACGGGCGATG AGTCAGATCA GAAGAATGGT GAAGATAAAC TTGGCATGGC AGTCAGGGAG CTGACACGCG ATCTTGCCGC GCGGATGGGG CTTAAGGAGA CTCAGGGCGT CGTTGTCACG GGTGTCAAGT CTGGCAGTCT GGCCGAGGAA GCGGGAATCC TGCCGGGCGA TATCGTTCGG GAGATAGGAG GGCGTTCCAT TACTACTATG GCGGATTACG AAACAGCGAT CCGAGCCGTG AAGAAGGGAG ACGTAGTCCG CTTTCTGCTG CGCCGCGGCG GTGGCAACCA CTTCCTGGCA ATCCGGGTCG AATAG
|
Protein sequence | MKMVSLRFLK TMLTVICLMA LSAGEVPAKV MAPDFVTLAE KLKPTVVNIS TSKNPAQTAR PRRQPSPFND PFHDFFDRFF DEAPRRQQRE RSLGSGFIIS DQGFIITNNH VVAGADEIKV RLSDGREFKA ELKGADEKLD LALIKIESKD QLPVAILGNS DEIKVGEWVM AIGNPFGLAQ TVTAGIVSAT GRVIGSGPYD DFIQTDASIN PGNSGGPLFS AEGKVIGINT AIIAGGQGIG FAIPINMAKD VIPQLEEKGK VIRGWLGVTV QPITPDLARS FGLEGERGAL IADVVKDGPA AKAGLKSGDI VLEFDGKKIR EMNELPRIVA ATPVGKAALV KVLRDGKMQD VEVSVGRLAD TGDESDQKNG EDKLGMAVRE LTRDLAARMG LKETQGVVVT GVKSGSLAEE AGILPGDIVR EIGGRSITTM ADYETAIRAV KKGDVVRFLL RRGGGNHFLA IRVE
|
| |