Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3457 |
Symbol | |
ID | 8138829 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 4000636 |
End bp | 4002066 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644871077 |
Product | protease Do |
Protein accession | YP_003023237 |
Protein GI | 253702048 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 102 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGCA TCAGACTCCT CGCTGTCTGC CTTATCGTCT CCACGTTAAT GATAGCCTGC AAGAAAAAGG AGCAGGCATC CTTCTACGAA TCCGAGCGCA AGGAGAGCGT CCCTGCCCCG GTCCAGGAGG TTCCCAAGGA CATCCTGGTG ACGCAGCAGG CCTTCACCGC GCTGGTCAGG ACCGTGACCC CCTCCGTCGT CAACATCTCC ACCATCGGCA AGAAAAAGCT GGTCCGCCCC TTCTTCGAGT CGTCCCCGTT CTTCGACGAG TTCTTCGGCG AAAGAACACG CCCGCAGTAC CGCCGAGAAC ACAGCTTGGG CTCCGGCTTC ATTCTGAACA AGGAAGGGTA TATCGTCACC AACGATCACG TGGTGCGGGA CGCCGAAACC ATACAGGTGA AGCTTTCCAA CGAAAGCGTC TACAAGGGGA AGGTGATCGG CTCGGACCCG AAGACGGACA TCGCGGTGAT AAAGATCGAC GCCAAAGAGC CGCTGCCGGC AGCAGTCCTC GGGGATTCCA ATAAGCTGCA GGTCGGGCAG TGGGCGATTG CCATAGGGAA CCCCTTCGGC CTCGACCGGA CCGTCACCGT GGGCGTTGTC TCCGCAACCG GCAGATCGAA CATGGGGATC GAGACCTACG AGGATTTCAT ACAGACCGAC GCCTCCATCA ACCCGGGGAA CTCCGGCGGT CCGCTGCTTA ACATCTACGG CGAGGTGATC GGAATCAACA CCGCCATCGT CGCCGCCGGA CAGGGAATCG GCTTCGCCAT CCCGGTAAAC ATGGCGAAGC AGGTGGTGAC GCAGTTGATC AGCAAGGGGA ACGTGAGCCG CGGCTGGCTC GGCGTCTCGA TCCAGTCGGT GACCGAGGAG ATGGCGAGCT CCTTCGGGCT TCCCAAAGCG TACGGCGCGC TGGTGAACGA CGTCGTTGCA GGCGGCCCGG CTGCGAAGGC CGGCGTCATG CAGGGGGACG TGATCACGAG CTTTGCCGGG ACAGCGGTGA AGGACGTGCG CCAGTTGCAG CGCCTGGTCG GGGAAACGCC TATCGGGAAG AAGGTCCCGG TGGAGCTCTA TCGCGACGGC AAGAAGATCA CCGTCCAGAT AACCACCGCA CCTGCCGACA GCGCCCAGGC CCAGATGCAA AGACCTGCCG AGCGGGAAGC GGGGACACTG GGGCTCTCGG TCGAGGAACT GGGAGCGGAG ATGCGTTCCC GCGGCGTCAC CGGCGTGGTG GTGAGCGACC TCGAACCGGG CGGGATCGCT GAGGAGAGCG GCATCCAGCG CGGCGACATC ATCGTCTCCG TGAACCAGAG GAAGGTGAGG AATCTGGCCG AGTACCAGAA GGCGATGGCG GACGCCGGCA AACGCGGCGC CGTGGCGCTC CTGGTCCGGA GAGGCTCGGC CAGCATCTAC TTCGCACTAA AATTAAGATA G
|
Protein sequence | MKSIRLLAVC LIVSTLMIAC KKKEQASFYE SERKESVPAP VQEVPKDILV TQQAFTALVR TVTPSVVNIS TIGKKKLVRP FFESSPFFDE FFGERTRPQY RREHSLGSGF ILNKEGYIVT NDHVVRDAET IQVKLSNESV YKGKVIGSDP KTDIAVIKID AKEPLPAAVL GDSNKLQVGQ WAIAIGNPFG LDRTVTVGVV SATGRSNMGI ETYEDFIQTD ASINPGNSGG PLLNIYGEVI GINTAIVAAG QGIGFAIPVN MAKQVVTQLI SKGNVSRGWL GVSIQSVTEE MASSFGLPKA YGALVNDVVA GGPAAKAGVM QGDVITSFAG TAVKDVRQLQ RLVGETPIGK KVPVELYRDG KKITVQITTA PADSAQAQMQ RPAEREAGTL GLSVEELGAE MRSRGVTGVV VSDLEPGGIA EESGIQRGDI IVSVNQRKVR NLAEYQKAMA DAGKRGAVAL LVRRGSASIY FALKLR
|
| |