Gene Dvul_2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2026 
Symbol 
ID4662513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2357913 
End bp2361977 
Gene Length4065 bp 
Protein Length1354 aa 
Translation table11 
GC content64% 
IMG OID639820269 
Producthypothetical protein 
Protein accessionYP_967469 
Protein GI120603069 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.368957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGA AACAGAATGA CGCCACGAAA CATCCACACA TACGGGAACT GCTCAGGCGC 
ATCACCCCTG CCACACGGCG TGGACGCGTC ATTCTCTGGT CGTTGCTGGG GTTGTACATC
TTCTGGCTTG TCATCGGCGG TCTCGTACTT CCCCCCGTTG TCCGCTCTGA ACTCGAACGC
ACCATGGCAC AACACCTTCG GGCCACCTGC ACCGTAGAGA AGGTGACCAT CAACCCCTTC
ACCCTGCGTA TCCGTGTCCT CGGCGTGAAG GTCCCCGATG CCAGCGGCGA AGGGGTGCTC
TTCGGCTTTC GTGAACTGAG CATCGCCCCA AGTCCGGCAG CCCTGTTCCG GCTGGCACCT
TCGCTCGCTT CGGCGCGCCT CGTAGAACCC GTTGTGGACA TCACCTATTT CGGCGAAGGC
CGCTTCTCGT TCTCGGACAT CGTGCCCCCG TCCGAGGCGA CGACAGACGA CAAGGCTACT
CCGGTATTCC CCTTCGTCAT CAGCGATTTC GAACTCGTGG ACGGCAGCTT CATCTTCCGT
GACGAACCGC GCGGCGTGAC GCACACCATC GCGGACATCG ACTTCATCGT GCCTTTCACC
TCCTCGCTGG ACATGTTCCG CGACACCCCC ATCACCCCAT CCCTCAATGC CACAGTCGAC
GGCAGCCGCA TGACCGTGGC TGGCAGACTG CTTCCCTTCG CCGAGACACA ACGAACGGAA
TTCGACATCG CCACCGAGGA TGTGGCCCTC GAACAATTCA AGGCCTATCT TGCGCCCTTC
ACTCCACTGC GGCTTGAGCA GGGCAAGGCT CGTCTCGAGC TCGACCTGCT GGTTGAAAGG
CTTCCCTCCG GGCAGGTCGA ACTGGGGCTT GGAGGAGCAC TGCGCCTTTC CGACATCCTT
CTCAATACGC CCGACGGGAA AAAGGCGGCA GCACTACGTG AAGCCGAACT GCGTCTGCAC
AAGTTCACAC TGGCGGAACG CCGCGTGGAA CTCGAGTCCG CAACTGTGGA CGGCCTTTAT
GTCAAGGCAG TGCGCGACAC CGACGGCACC GTCGACTGGC AACGCTGGAT AAGCCCGGCC
TCTGGCAAGG CAGCCCCTGT CACGCCTGCG ACATCGCGCA CGGCCATGCA GAACGCGACC
GGGGCAGCCA TGACGCAGAA TGCGACCGGG GCAGCCTCCA TGCCTGCCTC CGCCACCGCC
ACGGACAAGG CTTCGGCAAA GAACCCTGCG GCGGGAACGC CCCCCGTAGC GGCCACCTCC
GGCAGTAGTC CCGAGGGACA TTCCACCGGC AAATCCGCAG CCCCCACAGC CGACAGCAAG
GCCTTCATCG TCGAAGGGGC TGCCTTGCAT CTGAGCGATG CGACGCTTGT CTGGCACGAC
GCATCACTCT CCGGCACACG CGAGATAGCG GTCACCGGGC TCGACGTGCA GATTCCCCGA
TTCTCCACGG GTGACAACAA GACCATGCCG TTTGCCCTCA CTTTCGGATT GAATGGTCAG
GGGCGTTTCC ATGTAGATGG CGAGGCGACG CTCTCTCCGT TGAAGGTGAG CGCCGCCATC
GACAGCACAG GACTGCCGCT TGCCGCCGCG CGGCCCTTCG CCGGGGGCAC ACCTGCCTCC
GACATCGCAG GGAGCTTCGG GGGTAGAGCC AAGGTGGTCT TTCAATCCTC TCCCGCGCTG
CAACTGACCG TATCCGAAGG GGCGCTCATG GTGGACGACC TCGCCCTCGC CGCACAGGGA
AAACAGGGCC ATGCCCTTGG CGTCAAACAC ATCGGGCTCA AGGGACTCGC CGTCGATTAC
GGCAAACAGT CCATCCGGGC AGCGGTGCTC GCGCTGACCT CCCCCTCGGT CAATCTCATC
CTCGGTGACG ACGGGCTGCC GCTGCTTCCG GCATCCACCG GAGACTCACA GCCCGACACG
GGCAAGAAGG TCAAAGGCGA CAGGCAACGA CGCGCACAAG GCAAGGCGGG CTCTTCCACA
AAGGTGGAGT CCAAAGCCCG GGGCACGCAA GAGAAGGCCA GACGCGGCGA CACCAAGACC
GCAGACAGGG ACTGGAACCT TGTGCTGGAC AGCCTCGAAC TGGATGGAGG TACCGTCAAC
ATCACCGAAC GCGGCGCTAA GGCACCGACC CTGCAGGTGT CCGACCTTCG CGTGCGCACG
GGAGCGCTCT CGCCAGACCT GACCCAGCGG CTGCCCTTCG ACGCATCCAT GCGCTGGCAG
AAGGACGGGC AACTCGCCCT CAAGGGGAAT GTGCGCATCC GGCCTCTCGA CCTCGACCTG
AACGTCAAGG CCACAAAGGT CGACCTTGCG CCGCTCGACA TCCCCCTTGC CATGTCCACG
GCCATGCAGG CCGGGGGGCG CCTTTCCGGC GATGTCCGGC TGGGCGCACG GGAACGCGGC
GATGACATCC AGATGACGGC ATCGGGCAGG ACGCAACTGG ATGACGCCCG TCTGCGCAGG
CGCGGAGACC GGCGTGACCT CATCTCGCTT CGGAGGCTGG CGGTCAGGGA CTTCCGGTAC
GGTTCGTCCC CGCTGCGCGT CGAGATTGGC GACATACTCC TCGACCGGCC GCAGGTGTTC
CTCGTGCTGC ACAAGGACGG CACCACCAAC GTACTGCGCG CCCTCGACCC GGAAGGCGCG
GAACGCAGGG CTGCGGCCAT CCGTACGGCA GAAAAGGCCA AGGCCGCAGA AGGAGCGAAG
AAGCAGGGAA CGCAGACCGG GGCCGAGGCC TCCTCGGGTC TTGCGTCGAA GCCGGTGTCC
CCCGCCCCTG TGGCTGCCGG AGAGACGACT GCCGAGGCAG ACGCGTCGGG GGCAGACGCT
TCGGGGGCAG GAGCGGACGC ATCCTCCGCT GCGGGAGCCC GGGCAGAAGC ACCAGCATCG
CTGTTCGACA GGTTCACGCT GGGCGAGGTG ACCGTCCGCG GTGGCAAGAT AGCCTTCCGC
GACGAGCGTT TCTCTCCGGC CTTCGACACC TCGCTGGACA AGGTCGATGC AGCCGTGACC
GGATTCACCA TGGCCCCGGA AAGCCGCGCC GAGGTCTCGG CTGGCGGAAC GCTCGAGGGT
GTGCCCGTCA AGCTCACGGG GACGCTCAAC CCCGTATCGA CGCCCCCCTT CGCCGACATC
GTCTTCTCCA TGGAAGGGCT GGACCTCGTT CCGGTATCAC CATACGCGCT TCAGTATATC
GCCTACCCGG TCGACAAGGG ACGCCTCACA GCCCGTTTGC AATTGCAGAC ATCCGAATGG
GTACTGAGCG CCGATTCGAA GTTCCTGCTG GAAGACATCG AACTTGGCGA CAAGGACTCG
CGTCCCGATG CCCCGGACTA TCCGGTCAAA CTCGGACTGG CCCTCTTGCG GGGGCTTGAT
GGCAACGTGT CCATCGACCT GCCGGTACGG GGGCGACTCG ACGACCCCAA CTTCAGGCTT
GGAGGCGTGG TGGTCCAGGC CGTCATGAAC CTCATGGTCA AGGTGGTCAC ATCGCCCTTC
GCCCTCGTCG GCAGCGTGGT GCGCCTTGCC GGAGGCGGCG GGCAGGACAT GCGCAACGTC
CCCTTCGAAC CCGGGCGGGA AACGCTCTCC GAAAGGGCCG AGGCACAACT GGCAAGCGTG
GCGGAGGTAC TGCGGCAAAG GCCCGGACTT TCACTGGAGG TGCGCGGCAT GGTCGACCCT
GCGACCGATG GGCAGGGGCT TCGCGAAGTC GCCCTTCTGC GCCGGATGCA GGAGGCGAAG
TATGCTTCGC TGTGGCGCGG TGAGCGCGCC AAGACGACGG TCGAGGCCAT CACCATCGAA
GACGACGAGT ACGACGACCT TCTCGAGTCC GTCTACAAGG ATGCCCCCTT CGACAAACCC
CGCAATGTTC TGGGACTCGT AAAAGACCAG CCTCGCGAGG TCATGGAAAA GGCCTTCTAT
GAGCACGAGG ACGTCACGGA CGACGACCTG ACAGCCCTCG CACAGCAGCG GGCACGTGCC
GTGCGCGACA GATTGCTTGA AATCGACCCG GCACTCGGCG CACGGCTTTC ACTCGCCGCT
GCCACAGGCA AGGGGAAGAG CGCGGCAGAG ATGCTCTTGC GCTAG
 
Protein sequence
MTAKQNDATK HPHIRELLRR ITPATRRGRV ILWSLLGLYI FWLVIGGLVL PPVVRSELER 
TMAQHLRATC TVEKVTINPF TLRIRVLGVK VPDASGEGVL FGFRELSIAP SPAALFRLAP
SLASARLVEP VVDITYFGEG RFSFSDIVPP SEATTDDKAT PVFPFVISDF ELVDGSFIFR
DEPRGVTHTI ADIDFIVPFT SSLDMFRDTP ITPSLNATVD GSRMTVAGRL LPFAETQRTE
FDIATEDVAL EQFKAYLAPF TPLRLEQGKA RLELDLLVER LPSGQVELGL GGALRLSDIL
LNTPDGKKAA ALREAELRLH KFTLAERRVE LESATVDGLY VKAVRDTDGT VDWQRWISPA
SGKAAPVTPA TSRTAMQNAT GAAMTQNATG AASMPASATA TDKASAKNPA AGTPPVAATS
GSSPEGHSTG KSAAPTADSK AFIVEGAALH LSDATLVWHD ASLSGTREIA VTGLDVQIPR
FSTGDNKTMP FALTFGLNGQ GRFHVDGEAT LSPLKVSAAI DSTGLPLAAA RPFAGGTPAS
DIAGSFGGRA KVVFQSSPAL QLTVSEGALM VDDLALAAQG KQGHALGVKH IGLKGLAVDY
GKQSIRAAVL ALTSPSVNLI LGDDGLPLLP ASTGDSQPDT GKKVKGDRQR RAQGKAGSST
KVESKARGTQ EKARRGDTKT ADRDWNLVLD SLELDGGTVN ITERGAKAPT LQVSDLRVRT
GALSPDLTQR LPFDASMRWQ KDGQLALKGN VRIRPLDLDL NVKATKVDLA PLDIPLAMST
AMQAGGRLSG DVRLGARERG DDIQMTASGR TQLDDARLRR RGDRRDLISL RRLAVRDFRY
GSSPLRVEIG DILLDRPQVF LVLHKDGTTN VLRALDPEGA ERRAAAIRTA EKAKAAEGAK
KQGTQTGAEA SSGLASKPVS PAPVAAGETT AEADASGADA SGAGADASSA AGARAEAPAS
LFDRFTLGEV TVRGGKIAFR DERFSPAFDT SLDKVDAAVT GFTMAPESRA EVSAGGTLEG
VPVKLTGTLN PVSTPPFADI VFSMEGLDLV PVSPYALQYI AYPVDKGRLT ARLQLQTSEW
VLSADSKFLL EDIELGDKDS RPDAPDYPVK LGLALLRGLD GNVSIDLPVR GRLDDPNFRL
GGVVVQAVMN LMVKVVTSPF ALVGSVVRLA GGGGQDMRNV PFEPGRETLS ERAEAQLASV
AEVLRQRPGL SLEVRGMVDP ATDGQGLREV ALLRRMQEAK YASLWRGERA KTTVEAITIE
DDEYDDLLES VYKDAPFDKP RNVLGLVKDQ PREVMEKAFY EHEDVTDDDL TALAQQRARA
VRDRLLEIDP ALGARLSLAA ATGKGKSAAE MLLR