Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_2048 |
Symbol | |
ID | 5163087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | + |
Start bp | 2396175 |
End bp | 2397896 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640549543 |
Product | TPR repeat-containing protein |
Protein accession | YP_001230811 |
Protein GI | 148264105 |
COG category | [N] Cell motility [R] General function prediction only [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.732994 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA AGTATTGTGT TGTCAACCTC TGGTTCTTTG TGGTTCTCTC AGGATGCGCG ACCGGCCATG TAGCCGAGCC GCCTCTTCTT CAGGCTCAAG CCCTCCATCC CGACGTTAAT ATCGCTGAAT CACGCTCCAT GTATATCTAT TCGCTCTCCC GTATCCATGT TCTGGAAGGT GATTTCGATG GGGCTTTATC CCTCCTCCAG GCGGCGGTTG AGGCAGATCC CAAATCCGCT TTTCTTCGCA AATCCATTGC TCAGGTCTAT TTACAGATGA ACAGGTTCCA GGACGCGCTG GAATCTTGTC AAACCGCCAT CAAACTTGAT CCCGGCTTTG TCGAGGCACA GATCCTGGCA GGAAATATTC TGGTCGGTCT GCAGCGGGAT AAAGAGGCCA TTCCTTACTA CAAAAAGGCC CTTGAAATTG ACCCGTCCAA GGAAGACATC TACCTCCACC TGGCCATCGC CTACGTGAAG GGGTTTGAAT ACGAGGAGGC TGTCAACACC CTTAAGGTGC TTCTCAAGGT CAATCCCGAT TCCGCCATCG GTTATTACTA TCTGGGGAAG ACTTACGATC AGATGAAGCT TTCCAAGGAT GCCGCCAACT ACTATAAAAA GGCGGTGGAG CTGAAGCCGG ATTTTGAGCA GGCTATCATT GATCTGGGGA TTTCCCAGGA AATGCAGGGG CTCGCCGGTG AGGCAATTAA TACCTACAAT GAATTGTTGC GGATCAATCC GGTTAATTAC AATGTCATTC AACATCTGGT TCAGCTGTAT ATCCAGCAGA AGCGCCTTAA TGACGCCCTT ACGTTGTTGA AAAATATGGC CGACAGCGGT ATCGGCGGAC AGGAAACACA CCGTAAGATC GGCCTCATTT ATCTGGAGAT GGAGCGTTAC GACGACGCAA TCAAGGAATT TACCGAGATA CTCGGGCAGG AGCCGGACGC TCAGCAGGTT CGATATTATC TGGCATCCAC TTATGAGGAG ATGGAAGATT TCGACCGTGC CATCGAAGAA TTCAAGAAGA TCCCCCCGTC ATCAGCCCAT TATTTTGATG CTTTGGGACA TCTCGGCTTC CTTTATAAGG AAAACGGAGA GCCGGAGAAG GGGATTGCAC TTCTTAAGGA AGCAATCACC AACCAGCCGA ATCGAATCGA ACTTTATCTG AATCTTGCCG GACTCTATGA ATCGATGGAT CAATTTGCCG AGGGGCTCCG GGTGTTGACG GATGTGGAAG GGAATTTCCC CAATGATCCT CGGCTGAGCT TCCGCATGGG CGTTCTTTAT GACAAGATGG GTAACAAGGA CGAATCTATT GCCCGGATGA AAAAGGTCAT TGCCCTGGCG CCGAACGATG CGCAGGCATT GAACTATCTC GGCTATACCT ACGCAGAGCT TGGCGTCAAT CTGGATGAGG CGTTGCAGTA TCTGAACAAG GCCGTTTTGC TCCGCCCGGA TGACGGCTTC ATTCTGGACA GCCTCGGTTG GGCCTATTAC AAAATGAAGC GCTACGACCA GGCGGTGTTC CATCTGGAAC GGGCCGTCCA GCTGGTTGAC GAGGACGCCA CCATAATTGG TCACCTGGCC GATGCATATT TTGCCAACAG GGAATACCGC AAGGCGCTTA CACGTTATCG CCGCGTCCTG CAGCTGGAGC CTGAGCGCAA GGACATCGCC GAGAAGATAA AGAAGATCAT GGCGGAGACC GGTGAAAAAT GA
|
Protein sequence | MKKKYCVVNL WFFVVLSGCA TGHVAEPPLL QAQALHPDVN IAESRSMYIY SLSRIHVLEG DFDGALSLLQ AAVEADPKSA FLRKSIAQVY LQMNRFQDAL ESCQTAIKLD PGFVEAQILA GNILVGLQRD KEAIPYYKKA LEIDPSKEDI YLHLAIAYVK GFEYEEAVNT LKVLLKVNPD SAIGYYYLGK TYDQMKLSKD AANYYKKAVE LKPDFEQAII DLGISQEMQG LAGEAINTYN ELLRINPVNY NVIQHLVQLY IQQKRLNDAL TLLKNMADSG IGGQETHRKI GLIYLEMERY DDAIKEFTEI LGQEPDAQQV RYYLASTYEE MEDFDRAIEE FKKIPPSSAH YFDALGHLGF LYKENGEPEK GIALLKEAIT NQPNRIELYL NLAGLYESMD QFAEGLRVLT DVEGNFPNDP RLSFRMGVLY DKMGNKDESI ARMKKVIALA PNDAQALNYL GYTYAELGVN LDEALQYLNK AVLLRPDDGF ILDSLGWAYY KMKRYDQAVF HLERAVQLVD EDATIIGHLA DAYFANREYR KALTRYRRVL QLEPERKDIA EKIKKIMAET GEK
|
| |