Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_0470 |
Symbol | |
ID | 4662066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | - |
Start bp | 597018 |
End bp | 600230 |
Gene Length | 3213 bp |
Protein Length | 1070 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639818679 |
Product | TPR repeat-containing protein |
Protein accession | YP_965920 |
Protein GI | 120601520 |
COG category | [S] Function unknown |
COG ID | [COG1729] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.524501 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.148928 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAAGA CACCCGCCGG TCTTGCCCGG TTCCTCCTTG CGGCATGGCT TATCCTGCCG CCCATGGCCC ACGAAGCGTT CGCCGCCTCG TGGCAGTGGG CGTCGATGCC GCGTCGGGAA CGTGTCACCA TCGCCCTCGA CGCGCCGCAG GCAGACATTC GCCACAGCCG TACGGGTCGG CAGGAGATAA CGCTTCCGCT TGGGGGGTCG GGTCTGCTGC AACGTACCGG GCCAACCCCG GCGTCGGCCC GCATTCTCGA CGACGTGAAG GTCGAGGGGG CAGACGTCCG CATCTCCACC CGCACCCCGG GCTTCGGCTA CATACTCACC CGCCCCGACC CAGGGCATGT CGTCATCGAC CTTTTTGAAG ACCCGCTTGG CAACTCGTGG CAGCCGGATA CCACGGCACC TGCCGCGACA GCGACAGCGG CGACCGCAAC CGCCCCGGCG ACAACTGCAC CAGTTGGACA AGCGGCGGAG GCGATGGCGC CACAGGCGCC ACAGACCACC GTCACACCGC AACGCCCCTC TGCCCAGACG GAGGCGGCAC CTGCCGCGCC CGCCCCAGTG GCCCCTCCAG TGGCACCCCC TGCGCCGCTC AAATCACGTG GCATCGTGGA GACGCCCCTT ACAGACGCTC GGCCCGCCAC AGGGACCGTG AGCGGACAAC TGGCCCCCGG CAACCCCGTC ACCCCGCTTC CGGCGCAGGA ACCGGCTGCG CCGACAGCCG ATGCGGCCCC GGCACCCGGC CCCACCACCG CCCCGCCGCC CCTGCCCGCC CCGCACACGG GGGCCGACAG GCAGCGTGCC TACTTCACGG TCCCCTACGC ACTGCGGGGA CGCGTCAACT TCGGCGGCCC CGAAGACTGG CCACAGGAAC AGGCCGTTTC GGCTTCGTTC GGTGCGCCGC AAGGCAATGC CACCAACGGT GCCACAGCAC AGGCTGACAA TGCCGTGGGC GGGCGCATGG CCCCCCGCGA CGGCACCGCC GTCACAGCCC CCGCCGGGAC CCCGCAGCAG CCCGCCGGAC AGGCAACGGC TGACCAGACA GCGCAGGGGC AGACTGCCGA CGCCCCCGCG CCGCTGCAAC CCGTCACCCC GCAGGCCGAC CCCACGGCAC AGGCCAGCGC CTCCGCGCCC GCCAATGCCA CGGTCGCCCA TTCTCCCGCA GGCAACGGCA CGGCAGCCAA CGCCACCGGC GTGGTCTATG TGGACGAGAA GGGCAATCCC GTTCCCCCGC CACCCGACCC GCCACAGTTG CTGGCAGAGG CAAAGAGCCT CATCTCCACC AAGGACTGGC CCGGTGCGCT GGAACGTCTC GGTCTGCTCA AGGGGTTGCC CGACATCCCG TCCGACATGC GCGAGGAAGT GCTATACCTC ATCAGTGACA CGCTCTTCGC CCAGCACAAG GACAGCATCC TCGAGGGCTA CGAGAGCATC ATGGACGCCA CCAGCGAAGC CATGAACTAC AATATCCGCT CGCCGCGGGT GCCGCTGGCC CTGCTGCGCC TCGGCCTTCT CAATCTGCGG GCCGGCAACA CGCGCGAGGC CGAAGCCTAC TTCGCGCTGA TGAAGCGCCA GTACCCGCAC GACGACAACA TCCCCCTCGC CTACTTCTAT CTTGGCGAAG ACCAGTTCAG GAAGGGCCAG TACCAGAAGG CCGCCGACCA GTTCCAGTAC ATCCTGCAGA ACCACCCCGA AAGCCGCTAC GTACGCGAAT CGTCGGTGTT CCTTGCACGG TCGCTGCACC GCCTCGGCTA CCTCGAACAG GCGTCTGCCA TCATGGACTT CGTGGACAAG CGCTGGCCGC GCCTCTACCT CGAAACCCCC GAATACCTGC TCATGGCCGC CGACGTGGAG ACGCAGACAG GGCGTCTCGA CCAGGCCCGC GCCTCATACT GGACGTACTT CAACATCCAC CCCGAAGGTG CGGAGAACGA CGTGGTGCTT GCCAAGCTTG GCGACATCTA CGCGCAGCAG AAACAGGACA AGGCCGCCCG CGAAATCTAT GAAGAGGCCC TGCGACGCTT CCCCGACAAG GACGGCGGGC TCATCGCCCT GCTTCGCCTC ACGGAACAGG GCATCTATGA CAAGCCCGAT GTGGCAGCCA TGTTCTCGGT CTTCGACAAG CCCGGCGCCA GCGACCCCGC AGAGGCGTAC AACCGCATCA TCGAGGGACA TCCGAAAAGC GCCCTGGTCC CCATGGCCCG CATCAAGCTC GCCATGTGGC ACCTGTGGAA GCAGAAGTAC CCCGAAGCCC TCGAAGCCAT GGCCGAATTC GCCGCACAGC ACGGCAAGCA CGAACTGCTG GACAAGGCAC GCGAGGTGGC CGTACGCGCC TTCGGCTTGC TCGCCGCAGA CGCCGTCAAG GAAGGCGATT ACGACAGGGT GCTGCGATTC TGGGAAGACT ACCCCATCGT CCGTGAACAG GCGAAGAACT TCGGCCCCGA ACTCAGGCTC GCTCTTGGCA TGAGCTTCTG GAAGAAGGAC AGGCCGGGCC AGGCGCTGGA AGTGCTTGAA CCGCTCATCA AGCAACCGCC CGACGCCAAA TACGGTGAGG CGGCCATGAA CCTCTCGCTC ACGGTCTACC TCGGCACCGA AAGCTGGCAG CCCATCCTCG ACCTCGCCGA GAGCGTCGCG GGCTGGAAGC TCTCGCCCCC GGCACAGCGC CAGCGCGACT ATGCCGTGGC GCTCGCCCAT GAGAACCTGA AGCAGCAGGA CAAGTCCGTT CCCCTGTGGG AGAAGCTGGA CAAGGACCCC GACCTGCCCG AAGACCAGAA GGCCTATGTA ACCTTCTTCC TCTCCCGTGA CGCCGAACGC AAACGCGACC TCCAGCAGGC CTATATGCTC AACAAGGACG CCCTCGCCCG GTTCGTGGCC CTCGGAGAGA AGGACAAGGA AAAGGCCGAC AACGCCCGCA TCCGCGACTG CATCGCCTCG CTGATGGACA TCACCGAAGC GGCGGGACGC ACCCGTGAGG CCCTCGACTG GGCAGGGCAG TTCGCCCATT ACCTCACCAA GGACACGCCC GAGTACACCG CCCTGGACTA CCGGGTGGCG CGGCTGCACC GCAAGCTGGG CGACCTCGGC GAATGGCGGC GCATCCTCGA CGGCATCATC GCCAAGGAAC CGGACTCGGT CTACGGCAAG ATGGCCGCGT CGGAACTTCG CACCTACGAC GTGACGCGCG GTGCATCCTC ATTCACCAAC TGA
|
Protein sequence | MRKTPAGLAR FLLAAWLILP PMAHEAFAAS WQWASMPRRE RVTIALDAPQ ADIRHSRTGR QEITLPLGGS GLLQRTGPTP ASARILDDVK VEGADVRIST RTPGFGYILT RPDPGHVVID LFEDPLGNSW QPDTTAPAAT ATAATATAPA TTAPVGQAAE AMAPQAPQTT VTPQRPSAQT EAAPAAPAPV APPVAPPAPL KSRGIVETPL TDARPATGTV SGQLAPGNPV TPLPAQEPAA PTADAAPAPG PTTAPPPLPA PHTGADRQRA YFTVPYALRG RVNFGGPEDW PQEQAVSASF GAPQGNATNG ATAQADNAVG GRMAPRDGTA VTAPAGTPQQ PAGQATADQT AQGQTADAPA PLQPVTPQAD PTAQASASAP ANATVAHSPA GNGTAANATG VVYVDEKGNP VPPPPDPPQL LAEAKSLIST KDWPGALERL GLLKGLPDIP SDMREEVLYL ISDTLFAQHK DSILEGYESI MDATSEAMNY NIRSPRVPLA LLRLGLLNLR AGNTREAEAY FALMKRQYPH DDNIPLAYFY LGEDQFRKGQ YQKAADQFQY ILQNHPESRY VRESSVFLAR SLHRLGYLEQ ASAIMDFVDK RWPRLYLETP EYLLMAADVE TQTGRLDQAR ASYWTYFNIH PEGAENDVVL AKLGDIYAQQ KQDKAAREIY EEALRRFPDK DGGLIALLRL TEQGIYDKPD VAAMFSVFDK PGASDPAEAY NRIIEGHPKS ALVPMARIKL AMWHLWKQKY PEALEAMAEF AAQHGKHELL DKAREVAVRA FGLLAADAVK EGDYDRVLRF WEDYPIVREQ AKNFGPELRL ALGMSFWKKD RPGQALEVLE PLIKQPPDAK YGEAAMNLSL TVYLGTESWQ PILDLAESVA GWKLSPPAQR QRDYAVALAH ENLKQQDKSV PLWEKLDKDP DLPEDQKAYV TFFLSRDAER KRDLQQAYML NKDALARFVA LGEKDKEKAD NARIRDCIAS LMDITEAAGR TREALDWAGQ FAHYLTKDTP EYTALDYRVA RLHRKLGDLG EWRRILDGII AKEPDSVYGK MAASELRTYD VTRGASSFTN
|
| |