Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_0828 |
Symbol | |
ID | 3755706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | + |
Start bp | 851871 |
End bp | 854531 |
Gene Length | 2661 bp |
Protein Length | 886 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637781693 |
Product | TPR repeat-containing protein |
Protein accession | YP_387324 |
Protein GI | 78355875 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | [TIGR02917] putative PEP-CTERM system TPR-repeat lipoprotein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGTTCT CTGTGGTTAT GCGTCTTCTT GCCGCTCTTG CTCTTGTTCT GTCTCTTCTT GCTGCCGGTT GTTCCGGCGA AACCAGCGAA GATTTTCTGG CTGAAGGCCG CAAGCTCATG GAGCAGGGCA ATTCGTCCGG CGCCATTGTC TTCTTTAAGA GTGCCCTTGA AAAAGAACCC CAGCTGTACG AGGCCCGTCT GGCTCTCGGG CAGGCTTATG CGGCCGAAGG AAAGCTTGAT CAGGCCGAAA CCGCCTTTCA GAAGTGTCTG CGGCAGAATG CATCCGATCC GGAACTGCGC CTTGCCCTTG CCCGCCTGTA TACGCAGCTG CGCAAAAGCC GCGACACCAT CGAGCACGTG AATGCGTATG AAGGGCTGAA AGGCCGCAGC GCTGAAACCG AAGAACTGAA AGGAATCGCG CTTGCCATGG GCGGCAATGC CCCCGGGGCC GAAGATGCGT TGCGCTCGGC GCTGCAGCTG GAACCCAGAA GAATTTCGTC CCGCCTTGCC CTTGCCCGCC TGTATGCTTC CACCGGCAGA GTGGATGAGG CGCGCACCAT GGTGGGCGAG GCCCTTGCCA TCAGCCCTGC AGATCACGAT TCGTTGCTGC TTGCCGCGGA ACTTACACGC TACAGCGGTG ATGATGCCGC CATAACTGCT GCTTACCGGA CACTTTCCGA ACAGTACCCC GACGATAAAT ATGCCCGTTA CATGATCGGC GCGCAGGAAA TGAAAACCGG TAATCTGGAT GCGGCCCGTC AGACGACGGC GGCCATGCGC ACCCGGTTCG GTGATGATGC GCTGGTGCTC ATGCTGGAAG GCATGCTTGC CTATGCCGAC AAGGATTACG AGGCGGCAGC CCGTGCTTTT CAACGCAGTG TATCGGCACG CCCCACGCTG GACGCTTTTT TCCGTCTGGG GCTGGCGCAG GTCGGGCTGG GCGATCTGGA AACCGCGCTC AGCCAGTTCC GCGTTGTGCT TGACAAACAG CCCCAGCACG CGGAGGCCCG CCGTATGGTG GCCGCCACGC TGCTGCGCCA GAACAGAATT GAAGATGCAC AGGCAGAAGC CCGCCGTCTT GTAGAGATGC ACCCCGACTA CCCGTGGGGA CATTTTCTGC TGGCCTCGGC CGCCATGGCA AGGGGTGACA TGGAACTGGC CGCCAGAGAA CTGGATGTGG TCACCAGTCT TGCTCCCGCC ATGGCCGAAG CTCATCTGCA GAAAAGCTAC ATCAATATCA GCAGCGGCCG GTTTGATCTT GCAGAAAGTG ATCTGAACAG CGCTGTGAGC GCTTCTCCCG GCAACATACA GGCCCGGGTG GCTCTTTTTC AGTTCAATAT GGGCCGGGGC CGCTTTGATG AGGCCGAAAA GGTGCTGACC GAAGGCATGA ACGGCACGCC CAGAGATGCC GTGCTGTGGA ATTACATAGC CGGCATGTAC CAGCTGCGCA GGGACGATGC CAAGGCGCTG GAGGCGCTGG AAAAAGCGCA GCAGGCCGAT CCTGATCTGC GGGACAGTTA CATGACTGCC AGCCGTATCC GCGCTGCCGC AGGCGATGAA GCCGGTGCGC TGGAGCAGCT GGAGCGCTAT CTGCAGCGGC ATCCGGACGA CGGCCGTTTT CTGGTGGTTT CCGCCGTGCT GCTGGATCTG ACAGGAGATG CGCAAAAGGC CGGTGCACGT CTGGACAGGG CGCGCGAGCT GGATGAACCC AGCGCTCTGC CCGCCACCGT CAGCCGTCTG ATGAGCAGGG GCGAGACGGA TAAGGCCCGT CAGCTGCTGC AGGATGCGTA TGCCGCGTCC GGCAGTCTGC GCGACCTTTC CATGCTTACG GGCTTTCTGA CAGGGCAGGG CCGGACGGAC GAAGCGCTGG CCCTTTACGA AGCTTATGCC GCCAAGGATC CTGTGGCTGC TGCGCGCGGG ATTTTTGCGG TGCATACACT CAAGCGTGAC TATCAGAAGG CGCTGGAACA GGCCCGCAAG CTGAGCGATC TTGAGGCATC AAGCCCCGAG GGGGCACTGC AGGAGGCCGC CACCATGGAA CGCATGGGCG ATCCTGCTGC GGCCCTTGCC CGGCTGGAAG AGGCCTACAG AAAATTTCAG GCTCCGCAAC TGCTTATCGC CCAGGCATCG GTGGCAAGGC GCATGGATAA CATAGATAAA GCCGAAGCCT ACATACGCAC CTGCCTTAAG GCCGCACCGG ATTTTGTACC GGCAAAGGTG GCCGCAGCGG AGCTGGCACA CCAGCGCGGC CGGCTGGACG AAGCCGCAGC CGCCTATGAA GAGATTCTGC AGCGTGTGCC GGATGACGCC GCAGTCATGA ATAATCTTGC CATGATCTAC GTCCGTGACG AAGCAACCCT CAGAAGGGCG CTGCAGCTGG CGCTGACGGC ATATGTGCGT CAGCCCGATT CGCCGCAGAT CATGGATACA CTGGGCCTGT GCCTGACCGC GGCGGGACGC CCCGCCGAGG CTGTGCGCGT GCTCAGACGC GCCAGTGCGC TGCTGCCCGA TGATCAGTCC ATCCGCTATC ATTACGCTCA GGCTCTTGCG GCCGCGGGCG AAAAGGGCGC AGCATTGAAA GAAGTCAGCA CCGCTCTGCA GGGCAGCGAG TTTTCCGAGG CAGGGGAAGC CCGGAAGCTG CTCAAAAAAC TGGGCGGGTA A
|
Protein sequence | MQFSVVMRLL AALALVLSLL AAGCSGETSE DFLAEGRKLM EQGNSSGAIV FFKSALEKEP QLYEARLALG QAYAAEGKLD QAETAFQKCL RQNASDPELR LALARLYTQL RKSRDTIEHV NAYEGLKGRS AETEELKGIA LAMGGNAPGA EDALRSALQL EPRRISSRLA LARLYASTGR VDEARTMVGE ALAISPADHD SLLLAAELTR YSGDDAAITA AYRTLSEQYP DDKYARYMIG AQEMKTGNLD AARQTTAAMR TRFGDDALVL MLEGMLAYAD KDYEAAARAF QRSVSARPTL DAFFRLGLAQ VGLGDLETAL SQFRVVLDKQ PQHAEARRMV AATLLRQNRI EDAQAEARRL VEMHPDYPWG HFLLASAAMA RGDMELAARE LDVVTSLAPA MAEAHLQKSY INISSGRFDL AESDLNSAVS ASPGNIQARV ALFQFNMGRG RFDEAEKVLT EGMNGTPRDA VLWNYIAGMY QLRRDDAKAL EALEKAQQAD PDLRDSYMTA SRIRAAAGDE AGALEQLERY LQRHPDDGRF LVVSAVLLDL TGDAQKAGAR LDRARELDEP SALPATVSRL MSRGETDKAR QLLQDAYAAS GSLRDLSMLT GFLTGQGRTD EALALYEAYA AKDPVAAARG IFAVHTLKRD YQKALEQARK LSDLEASSPE GALQEAATME RMGDPAAALA RLEEAYRKFQ APQLLIAQAS VARRMDNIDK AEAYIRTCLK AAPDFVPAKV AAAELAHQRG RLDEAAAAYE EILQRVPDDA AVMNNLAMIY VRDEATLRRA LQLALTAYVR QPDSPQIMDT LGLCLTAAGR PAEAVRVLRR ASALLPDDQS IRYHYAQALA AAGEKGAALK EVSTALQGSE FSEAGEARKL LKKLGG
|
| |