Gene Dvul_0470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0470 
Symbol 
ID4662066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp597018 
End bp600230 
Gene Length3213 bp 
Protein Length1070 aa 
Translation table11 
GC content67% 
IMG OID639818679 
ProductTPR repeat-containing protein 
Protein accessionYP_965920 
Protein GI120601520 
COG category[S] Function unknown 
COG ID[COG1729] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.524501 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.148928 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAAGA CACCCGCCGG TCTTGCCCGG TTCCTCCTTG CGGCATGGCT TATCCTGCCG 
CCCATGGCCC ACGAAGCGTT CGCCGCCTCG TGGCAGTGGG CGTCGATGCC GCGTCGGGAA
CGTGTCACCA TCGCCCTCGA CGCGCCGCAG GCAGACATTC GCCACAGCCG TACGGGTCGG
CAGGAGATAA CGCTTCCGCT TGGGGGGTCG GGTCTGCTGC AACGTACCGG GCCAACCCCG
GCGTCGGCCC GCATTCTCGA CGACGTGAAG GTCGAGGGGG CAGACGTCCG CATCTCCACC
CGCACCCCGG GCTTCGGCTA CATACTCACC CGCCCCGACC CAGGGCATGT CGTCATCGAC
CTTTTTGAAG ACCCGCTTGG CAACTCGTGG CAGCCGGATA CCACGGCACC TGCCGCGACA
GCGACAGCGG CGACCGCAAC CGCCCCGGCG ACAACTGCAC CAGTTGGACA AGCGGCGGAG
GCGATGGCGC CACAGGCGCC ACAGACCACC GTCACACCGC AACGCCCCTC TGCCCAGACG
GAGGCGGCAC CTGCCGCGCC CGCCCCAGTG GCCCCTCCAG TGGCACCCCC TGCGCCGCTC
AAATCACGTG GCATCGTGGA GACGCCCCTT ACAGACGCTC GGCCCGCCAC AGGGACCGTG
AGCGGACAAC TGGCCCCCGG CAACCCCGTC ACCCCGCTTC CGGCGCAGGA ACCGGCTGCG
CCGACAGCCG ATGCGGCCCC GGCACCCGGC CCCACCACCG CCCCGCCGCC CCTGCCCGCC
CCGCACACGG GGGCCGACAG GCAGCGTGCC TACTTCACGG TCCCCTACGC ACTGCGGGGA
CGCGTCAACT TCGGCGGCCC CGAAGACTGG CCACAGGAAC AGGCCGTTTC GGCTTCGTTC
GGTGCGCCGC AAGGCAATGC CACCAACGGT GCCACAGCAC AGGCTGACAA TGCCGTGGGC
GGGCGCATGG CCCCCCGCGA CGGCACCGCC GTCACAGCCC CCGCCGGGAC CCCGCAGCAG
CCCGCCGGAC AGGCAACGGC TGACCAGACA GCGCAGGGGC AGACTGCCGA CGCCCCCGCG
CCGCTGCAAC CCGTCACCCC GCAGGCCGAC CCCACGGCAC AGGCCAGCGC CTCCGCGCCC
GCCAATGCCA CGGTCGCCCA TTCTCCCGCA GGCAACGGCA CGGCAGCCAA CGCCACCGGC
GTGGTCTATG TGGACGAGAA GGGCAATCCC GTTCCCCCGC CACCCGACCC GCCACAGTTG
CTGGCAGAGG CAAAGAGCCT CATCTCCACC AAGGACTGGC CCGGTGCGCT GGAACGTCTC
GGTCTGCTCA AGGGGTTGCC CGACATCCCG TCCGACATGC GCGAGGAAGT GCTATACCTC
ATCAGTGACA CGCTCTTCGC CCAGCACAAG GACAGCATCC TCGAGGGCTA CGAGAGCATC
ATGGACGCCA CCAGCGAAGC CATGAACTAC AATATCCGCT CGCCGCGGGT GCCGCTGGCC
CTGCTGCGCC TCGGCCTTCT CAATCTGCGG GCCGGCAACA CGCGCGAGGC CGAAGCCTAC
TTCGCGCTGA TGAAGCGCCA GTACCCGCAC GACGACAACA TCCCCCTCGC CTACTTCTAT
CTTGGCGAAG ACCAGTTCAG GAAGGGCCAG TACCAGAAGG CCGCCGACCA GTTCCAGTAC
ATCCTGCAGA ACCACCCCGA AAGCCGCTAC GTACGCGAAT CGTCGGTGTT CCTTGCACGG
TCGCTGCACC GCCTCGGCTA CCTCGAACAG GCGTCTGCCA TCATGGACTT CGTGGACAAG
CGCTGGCCGC GCCTCTACCT CGAAACCCCC GAATACCTGC TCATGGCCGC CGACGTGGAG
ACGCAGACAG GGCGTCTCGA CCAGGCCCGC GCCTCATACT GGACGTACTT CAACATCCAC
CCCGAAGGTG CGGAGAACGA CGTGGTGCTT GCCAAGCTTG GCGACATCTA CGCGCAGCAG
AAACAGGACA AGGCCGCCCG CGAAATCTAT GAAGAGGCCC TGCGACGCTT CCCCGACAAG
GACGGCGGGC TCATCGCCCT GCTTCGCCTC ACGGAACAGG GCATCTATGA CAAGCCCGAT
GTGGCAGCCA TGTTCTCGGT CTTCGACAAG CCCGGCGCCA GCGACCCCGC AGAGGCGTAC
AACCGCATCA TCGAGGGACA TCCGAAAAGC GCCCTGGTCC CCATGGCCCG CATCAAGCTC
GCCATGTGGC ACCTGTGGAA GCAGAAGTAC CCCGAAGCCC TCGAAGCCAT GGCCGAATTC
GCCGCACAGC ACGGCAAGCA CGAACTGCTG GACAAGGCAC GCGAGGTGGC CGTACGCGCC
TTCGGCTTGC TCGCCGCAGA CGCCGTCAAG GAAGGCGATT ACGACAGGGT GCTGCGATTC
TGGGAAGACT ACCCCATCGT CCGTGAACAG GCGAAGAACT TCGGCCCCGA ACTCAGGCTC
GCTCTTGGCA TGAGCTTCTG GAAGAAGGAC AGGCCGGGCC AGGCGCTGGA AGTGCTTGAA
CCGCTCATCA AGCAACCGCC CGACGCCAAA TACGGTGAGG CGGCCATGAA CCTCTCGCTC
ACGGTCTACC TCGGCACCGA AAGCTGGCAG CCCATCCTCG ACCTCGCCGA GAGCGTCGCG
GGCTGGAAGC TCTCGCCCCC GGCACAGCGC CAGCGCGACT ATGCCGTGGC GCTCGCCCAT
GAGAACCTGA AGCAGCAGGA CAAGTCCGTT CCCCTGTGGG AGAAGCTGGA CAAGGACCCC
GACCTGCCCG AAGACCAGAA GGCCTATGTA ACCTTCTTCC TCTCCCGTGA CGCCGAACGC
AAACGCGACC TCCAGCAGGC CTATATGCTC AACAAGGACG CCCTCGCCCG GTTCGTGGCC
CTCGGAGAGA AGGACAAGGA AAAGGCCGAC AACGCCCGCA TCCGCGACTG CATCGCCTCG
CTGATGGACA TCACCGAAGC GGCGGGACGC ACCCGTGAGG CCCTCGACTG GGCAGGGCAG
TTCGCCCATT ACCTCACCAA GGACACGCCC GAGTACACCG CCCTGGACTA CCGGGTGGCG
CGGCTGCACC GCAAGCTGGG CGACCTCGGC GAATGGCGGC GCATCCTCGA CGGCATCATC
GCCAAGGAAC CGGACTCGGT CTACGGCAAG ATGGCCGCGT CGGAACTTCG CACCTACGAC
GTGACGCGCG GTGCATCCTC ATTCACCAAC TGA
 
Protein sequence
MRKTPAGLAR FLLAAWLILP PMAHEAFAAS WQWASMPRRE RVTIALDAPQ ADIRHSRTGR 
QEITLPLGGS GLLQRTGPTP ASARILDDVK VEGADVRIST RTPGFGYILT RPDPGHVVID
LFEDPLGNSW QPDTTAPAAT ATAATATAPA TTAPVGQAAE AMAPQAPQTT VTPQRPSAQT
EAAPAAPAPV APPVAPPAPL KSRGIVETPL TDARPATGTV SGQLAPGNPV TPLPAQEPAA
PTADAAPAPG PTTAPPPLPA PHTGADRQRA YFTVPYALRG RVNFGGPEDW PQEQAVSASF
GAPQGNATNG ATAQADNAVG GRMAPRDGTA VTAPAGTPQQ PAGQATADQT AQGQTADAPA
PLQPVTPQAD PTAQASASAP ANATVAHSPA GNGTAANATG VVYVDEKGNP VPPPPDPPQL
LAEAKSLIST KDWPGALERL GLLKGLPDIP SDMREEVLYL ISDTLFAQHK DSILEGYESI
MDATSEAMNY NIRSPRVPLA LLRLGLLNLR AGNTREAEAY FALMKRQYPH DDNIPLAYFY
LGEDQFRKGQ YQKAADQFQY ILQNHPESRY VRESSVFLAR SLHRLGYLEQ ASAIMDFVDK
RWPRLYLETP EYLLMAADVE TQTGRLDQAR ASYWTYFNIH PEGAENDVVL AKLGDIYAQQ
KQDKAAREIY EEALRRFPDK DGGLIALLRL TEQGIYDKPD VAAMFSVFDK PGASDPAEAY
NRIIEGHPKS ALVPMARIKL AMWHLWKQKY PEALEAMAEF AAQHGKHELL DKAREVAVRA
FGLLAADAVK EGDYDRVLRF WEDYPIVREQ AKNFGPELRL ALGMSFWKKD RPGQALEVLE
PLIKQPPDAK YGEAAMNLSL TVYLGTESWQ PILDLAESVA GWKLSPPAQR QRDYAVALAH
ENLKQQDKSV PLWEKLDKDP DLPEDQKAYV TFFLSRDAER KRDLQQAYML NKDALARFVA
LGEKDKEKAD NARIRDCIAS LMDITEAAGR TREALDWAGQ FAHYLTKDTP EYTALDYRVA
RLHRKLGDLG EWRRILDGII AKEPDSVYGK MAASELRTYD VTRGASSFTN