Gene Dvul_3073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_3073 
Symbol 
ID4661947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008741 
Strand
Start bp159736 
End bp162396 
Gene Length2661 bp 
Protein Length886 aa 
Translation table11 
GC content70% 
IMG OID639813993 
ProductTPR repeat-containing protein 
Protein accessionYP_961272 
Protein GI120586927 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID[TIGR02917] putative PEP-CTERM system TPR-repeat lipoprotein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTGGAATA AGCTGTGCCT TTCCGTCTGC TGTGCGATTC TCGTTCTCAC TGCCTCGTCA 
TGCGGGGCGG ACAAGTCATC GGACTTCCTC ACCGAAGGGC GCAAGCTCAT GGACGCGGGC
AATGCCGCGG GCGCGGTGGT GCTGTTCCGC AATGCACTTG AGAAGACCCC CGCCGATTAT
ACACTGCACC TTGAACTGGG CAGGGCCTAT ATCGCCCTTG GCAAGCTCGA CCTCGCCGAG
GGCGAACTGC AGAAGTGCCT GCGCCAGCAA CCCGACGACC CGCCGCTGAA CCTCGCCCTC
GGCGAACTCT ACGTGGCCCG CAACGAACCG GCCAAGGCGC TGCCGCATCT GGCCCGCTAC
GAACAGGGCG CGGGCGAGAC GGCCGTCTCG CGTGAACTGG CGGGGCTGGC CCATGCGCAG
GCGCGGTCGC CGGAAGAGGC GCGCGCGGCC CTCGAACGCT CCATCGCCCT CGACGCCAAG
CGCGTCACGC CACGTCTGGC GCTGGCGCGG CTGTTCCTCT ACAGGGGCGA CCTCAAGCGC
GCCGTCGCCA CCGTGGACGA GGCACTCGCC GTCGCCCCCG AAGACAGGTC GGCCCTCACG
CTGCGTGGCG ACCTGCTGCT GCGGCAGGGC GACGCGCCCG GTGCGCTGGT GGTGTTCCGC
AAGGTCGCCA GCCTGGCCCC CGGCGACGAC TATGCCCGCT ACATGTCGGG CCTTCTGGCC
CTGCAGACCG GCGACGAGGC CGGGGCCGCC GCCGTTGCGG CATCCATGAA CAAGGACTTC
AAGGACAACT CGCTGACCCT GATGCTGGAC GGCGCCCTCG CCGCGCAGCG CAAGGACTAC
ACGCTTGCGG CGTCGCTGTT CCAGCGCAGC GTGGCCATGC GTCCAAGCCT CGAGGGCTAC
TACAAGCTTG GCATGGCCCT CTACGGCAAG GGCGACCTCG AGACGGCCCT CAGCCAGTTC
AACCGCGTGC TCGAAGCCAC ACCGGAATAC GATGCGGCCC GCCGCATGAC CGTCACCATC
CTGCTTGCGC AACGGCGAGT GGCCGAGGCA CGGCAGGAGG CGCAGAAGCT CGTGGAGCGC
AACCCCTCCG ACGCCGCGGC GCACTTCATG CTGGCCTCGG CGCAGATGGC GGCGGGCGAC
AGGGCCGGGG CCGAACGCTC GTTCGAGGCG GGGCTTGCCC TGCAACCGGC CCATGTGCCC
GCACTCCTGC AACTCAGCCG CCTGAAGCAG GCCGACGGAC GGCCCGACGA GGCGCTTGAA
GACCTCAAGG CCGCCGTTGT CGCCGCACCC GACGACCTCG CCGTGCGCAA CGCCCTCTTC
GCCTACCATC TGGGGCGGGG CGACACGGGC AAGGGCGTCC AGGTCGTCCT CGAGGGCCTG
CGTGGCACTC CGCAGGACGC CATCCTCTAC ACCATGCTGG TGCCCGTCTA CGTCAACATG
GGCGACGAGG CGAAGGCCCT CGACGCGGTG GCGCAGGCGC ACAGGGCCGA CCCCGACTTC
CCCGACGCCT ACCTCGCCGG GCTGCGCCTC CACGCCGGGG CCGGCAGGGC TGAACAGGCC
CTTGCCGAGA GCGAGGCCTA TCTCGCCCGC AAGCCCGATG CACCCGGCTT CCTGCTGGCG
TCGGGCGCCC TGCTCGACCT TCTTGGCAGG ACGGCGGAGG CCGACGCCCG CTTCGACAAG
GCGCTGGCCG CCAACGACCC CCGCGTCTCG TTCGCGGTGG CCGAACGCGC CGTGGCCTCG
GGGCAGGACG AGAAGGCCCG TAGCGTTCTC GAAGAGGCCC TGCGTCAGCA CGACCAGACG
ACCCTGCGCG ACGCGCTCGC CATCCAGCTT GCGCGCATGG GCAAGCCCGA CGACGCGCTC
GCCCTCTATA CCGCCATCGA GACGCAACGC CCGCGCGAGG CGCTGCTGGG CCGCTACCGG
CTGCTGACGC ACCTCGACCG GCATCAGGAG GCGGCGGACA CGGCACGTGA ACTGGGACGG
CGTGAGAGCG CGTCGCCGCT GCCAGTCCTG CTCGAAGCCG CCGCGTTCGA GCGCATGGGG
CAGCGCCCGC AGGGCGTGGC CCTGCTCGAA GCGGCCCACC GCAAGAGCGG CGACCCCGAA
CTGCTGCTGG CCATGGGCGG TATGCTCGAA CGCAACGGCG ACGAGGCCCG TGCCGAGACC
TGCTACCGCG ACGCCCTCAA GGCGCGGCCC GACCATGTGC CCACCCTGCT GGCCTACGCG
GGGCTGGACA TGCGCCGCAA GGAATACGCC AAGGCCACGG CCCTCTACGA GAAGGCCGTG
AGCATCGCCC CCGACGACGT GGTGGCGCTG AACAATCTCG CCATGGCCTA CCTCGAGAAG
GCCAGCCGTG ATGCGACGCC GCAGAAGGCC CTGCGTCTGG CGTTGCAGGC TTACACCCGC
GCTCCCGACA ACCCCGCCGT GCTCGACACC CTCGGCGTGT GCATGATGGC CAACGGACGT
GCCGACGACG CAGCGCGCGC CTTCGGTCGT GCCGTGGTCG CCGTGCCCGG CAACCCGTCG
CTGCGCTACC GCCATGCCGA GGCGCTCTTC AAGGCGGGCC GCAAGGACAT GGCGGCTGAA
GAACTGCGTG TGGCCCTTCA GACGCCCGAC TTCCCCGAGG CGGCCAAGGC ACGCGACCTG
TTGCGCAAGA CCGGCAACTG A
 
Protein sequence
MWNKLCLSVC CAILVLTASS CGADKSSDFL TEGRKLMDAG NAAGAVVLFR NALEKTPADY 
TLHLELGRAY IALGKLDLAE GELQKCLRQQ PDDPPLNLAL GELYVARNEP AKALPHLARY
EQGAGETAVS RELAGLAHAQ ARSPEEARAA LERSIALDAK RVTPRLALAR LFLYRGDLKR
AVATVDEALA VAPEDRSALT LRGDLLLRQG DAPGALVVFR KVASLAPGDD YARYMSGLLA
LQTGDEAGAA AVAASMNKDF KDNSLTLMLD GALAAQRKDY TLAASLFQRS VAMRPSLEGY
YKLGMALYGK GDLETALSQF NRVLEATPEY DAARRMTVTI LLAQRRVAEA RQEAQKLVER
NPSDAAAHFM LASAQMAAGD RAGAERSFEA GLALQPAHVP ALLQLSRLKQ ADGRPDEALE
DLKAAVVAAP DDLAVRNALF AYHLGRGDTG KGVQVVLEGL RGTPQDAILY TMLVPVYVNM
GDEAKALDAV AQAHRADPDF PDAYLAGLRL HAGAGRAEQA LAESEAYLAR KPDAPGFLLA
SGALLDLLGR TAEADARFDK ALAANDPRVS FAVAERAVAS GQDEKARSVL EEALRQHDQT
TLRDALAIQL ARMGKPDDAL ALYTAIETQR PREALLGRYR LLTHLDRHQE AADTARELGR
RESASPLPVL LEAAAFERMG QRPQGVALLE AAHRKSGDPE LLLAMGGMLE RNGDEARAET
CYRDALKARP DHVPTLLAYA GLDMRRKEYA KATALYEKAV SIAPDDVVAL NNLAMAYLEK
ASRDATPQKA LRLALQAYTR APDNPAVLDT LGVCMMANGR ADDAARAFGR AVVAVPGNPS
LRYRHAEALF KAGRKDMAAE ELRVALQTPD FPEAAKARDL LRKTGN