Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_0879 |
Symbol | |
ID | 5162652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | + |
Start bp | 1043702 |
End bp | 1045132 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640548375 |
Product | protease Do |
Protein accession | YP_001229658 |
Protein GI | 148262952 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000781376 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATCC GTATTATTAT TTTTTTGTTG ATGCTGGCCA CCCTTCTTTC GGCTTGCAAA AAGAAGGAAG AAGCCTATTT CTTTGAATCA AACCGCAAAG GGACTGCCGA GGCCCCGGTA AAAGAAGTAC CCAAGGACAT CCTTGCCACC CAGCAGGCTT TCGCCAATGT GGTAAAAGCG GTTAACCCGG CAGTGGTCAA TATTTCCACG GTGAGCAAGA AAAAGCTCGT GCAGCCGTTC TTCGAGATGT CCCCCCTTTT CGATGATTTT TTCGGCGGCA GGGGTGGGAC GCCCCAATAC CGCCGTGAAA ACAGCCTGGG TTCCGGCTTT ATCATCAACC GGGACGGATA CATCATCACC AACGATCATG TGGTGCGGGA TGCGGAGAGC ATCCAGGTAA AACTTTCCAA CGAAAATGTT TATAGCGGCA AAGTGGTCGG CAGCGATCCC AAAACCGACA TCGCGGTGAT CAAGATCAAC GCCAAGGAAC AGCTCCCGGT GGCAGTGCTG GGCGATTCCG ACAAACTCCA GGTAGGACAG TGGGCCATTG CTATCGGCAA CCCGTTCGGT CTCGACCGGA CCGTTACCGT CGGGGTGGTG TCGGCGACCG GCCGTTCCAA TATGGGGATC GAAACCTACG AGAACTTCAT CCAGACGGAC GCTTCCATCA ACCCGGGCAA TTCCGGCGGA CCGCTCTTGA ATGTCTATGG CGAGGTAATC GGCATCAATA CTGCCATTGT AGCGGCAGGC CAGGGCATCG GCTTCGCCAT TCCCATAAAT ATGGCGAAAC GGGCAGTGCC GCAGTTGATA AAAAAGGGGA ATGTCAGCCG CGGTTGGCTG GGTGTTTCCA TTCAGCCGGT GACGGAAGAG ATTGCCCAGT CCTTTGGCTT GAAACGGGCA CAAGGTGCCT TGGTGAGCGA TATAATGGCG GGGAGCCCTG CTGCCAAGGC CGGCCTCAGG CAGGGTGACA TCATAACCGG GATTGCCGGC AAAGAGATAA AGTCCGTCCA GCAACTCCAG TTGCTGGTGG CTGATATGCC GGTCGGCTCT CCGGTGGAGA TAGAGGTTTT CCGCGAAGGC CGGGCAAAAA AACTTTCAAT CATCCCTGCC TCCGCTGACA GTGCCGCGGG TGCGAAGCCT AAATCGGTCG AGACGGAAAC GGCATGGGTC GGATTGTCGG TAGAGGAACT CCCCCGTGAT ATACGACTGA AGGGTCTGCA GGGTGTTGTC GTGACGAGCG TTGAGCCGGG CAGCCTTGCT GCCGACAGCG GCATCCAGCA GGGCGATGTC GTTGTTTCAG TCAACCAGAG AAAGATCGCC GGTGTGAACG ACTATGCAAA GGCCATGAAA GATGCTGAAA AGAAAGGGTC CGTAGCCCTG CTGGTAAGGC GGGGGGATGC CAGCATCTAT TTTGCGATCA GAATAAAGTA G
|
Protein sequence | MKIRIIIFLL MLATLLSACK KKEEAYFFES NRKGTAEAPV KEVPKDILAT QQAFANVVKA VNPAVVNIST VSKKKLVQPF FEMSPLFDDF FGGRGGTPQY RRENSLGSGF IINRDGYIIT NDHVVRDAES IQVKLSNENV YSGKVVGSDP KTDIAVIKIN AKEQLPVAVL GDSDKLQVGQ WAIAIGNPFG LDRTVTVGVV SATGRSNMGI ETYENFIQTD ASINPGNSGG PLLNVYGEVI GINTAIVAAG QGIGFAIPIN MAKRAVPQLI KKGNVSRGWL GVSIQPVTEE IAQSFGLKRA QGALVSDIMA GSPAAKAGLR QGDIITGIAG KEIKSVQQLQ LLVADMPVGS PVEIEVFREG RAKKLSIIPA SADSAAGAKP KSVETETAWV GLSVEELPRD IRLKGLQGVV VTSVEPGSLA ADSGIQQGDV VVSVNQRKIA GVNDYAKAMK DAEKKGSVAL LVRRGDASIY FAIRIK
|
| |