Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0228 |
Symbol | |
ID | 8135534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 271231 |
End bp | 272604 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644867849 |
Product | protease Do |
Protein accession | YP_003020071 |
Protein GI | 253698882 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 86 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGCAG CAATCGTAGC AAGATTTCTG GTACTTAGCG TATTTCTCTC CTCCATTTTC TCGACCGTCG CGGGCGCCTC GGTGCTCACT CCGGACTTCG CCAAGCTGGC GAAGAAGCTG AAGCCCGCTG TGGTCAACAT CAGCACCTCG AAGACCATAG CGGTGAAAAA GCGGCAGATC GCCCCCGGAC GCGACCCCTT CCAGGAATAT TTCGAGAAGT TTTTCGAGGG GCCTCACCAG CGTCCGCAAA AGCAGAGGAA CCTCGGCACC GGTTTCATCA TCAGCGACGA CGGCTACATC ATCACCAACA ACCACGTGGT GAAGGATGCC GACGAGATCA AGGTGAAGCT CTCCGATGGC AGGGAGTTCG CGGGGGATGT GAAGGGGCGC GACGAAAAGC TGGACCTTGC CCTGGTGAAG ATCGACGCCA AGGGTCACCT TCCCGTGGCG CCTTTGGGGG ACAGCGACAA GATGGAAGTG GGGGACTGGG TGATGGCGAT AGGCAACCCC TTCGGGCTCT CCCAGACCGT GACGGCGGGG ATCATCAGCG CCCAGGGGCG CGTGATCGGT TCCGGCCCGT ATGACGACTT CATCCAGACC GACGCCTCCA TCAACCCCGG CAACTCAGGG GGGCCGCTAT TCAACACCGA AGGGGAGGTG ATCGGCATCA ACACTGCGAT CGTCGCCGGC GGCCAGGGAA TAGGGTTTGC CATACCGGTC AACATGGCGA AGGAGATCCT CCCGCAGCTG AAGTCGGCCG GCAAGGTGAC ACGCGGTTGG CTGGGCGTAT CGGTGCAGTT GGTGACCCCG GACCTCGCCA AATCCTTCGG GCTTGACAGC GAGAAGGGGG CGCTGGTGGC CGACGTGGTG AAAGAGAGCC CCGCGGAGAA GGCCGGCCTC AAGGGGGGCG ACATCATCCT CGAGTACGAC GGGCATCCCA TCAAGGAGAT GGGGGAGCTT CCGCGCCGCG TGGCCGCCAC CCCGGTAGGA AAGAAGGTGA AACTGGTGGT GCAGCGCGAG GGGCGTCAGG AGACGTTGCA GGTGACCGTC GAGCAGTTGA AGGACGACGA CCAGGATAGC GCGGTCGCCA GCGACCGGCT CGGGGTGAAG GTGACGGAGC TTACCCCGGA GCGCGCGCAG CAGTTGCGGG TGCAGGGGGA CAAAGGGGTC GTGGTGACCG ATGTGGAGCC GGACAGCCTT GCCGACCGCG CCGGCATCCA GGAGGGAGAC CTGATCAGGG AGATCAACGG AGTGCGCGTA AGCGGTGTGA GCGATTACAG CAAGTTGATC GCGGCGGCGA AGAAGGGGGG GTATCTGAAG ATGCTGCTGA GGCGCGGCGA CGCCTCCCTG TTCGTGGCGC TCAGGCTCGA ATAG
|
Protein sequence | MQAAIVARFL VLSVFLSSIF STVAGASVLT PDFAKLAKKL KPAVVNISTS KTIAVKKRQI APGRDPFQEY FEKFFEGPHQ RPQKQRNLGT GFIISDDGYI ITNNHVVKDA DEIKVKLSDG REFAGDVKGR DEKLDLALVK IDAKGHLPVA PLGDSDKMEV GDWVMAIGNP FGLSQTVTAG IISAQGRVIG SGPYDDFIQT DASINPGNSG GPLFNTEGEV IGINTAIVAG GQGIGFAIPV NMAKEILPQL KSAGKVTRGW LGVSVQLVTP DLAKSFGLDS EKGALVADVV KESPAEKAGL KGGDIILEYD GHPIKEMGEL PRRVAATPVG KKVKLVVQRE GRQETLQVTV EQLKDDDQDS AVASDRLGVK VTELTPERAQ QLRVQGDKGV VVTDVEPDSL ADRAGIQEGD LIREINGVRV SGVSDYSKLI AAAKKGGYLK MLLRRGDASL FVALRLE
|
| |