Gene DvMF_0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_0043 
Symbol 
ID7171916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp57479 
End bp59059 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content69% 
IMG OID643538535 
Producthypothetical protein 
Protein accessionYP_002434472 
Protein GI218885151 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones90 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCG AATTGCTGAC CCTCCGCGAA ATCGCCCGCC GCCTCGACGT GCCCCCCTCC 
AGTATCGCCT ATTACAAGGA CAGGTTCGCG CGCTTCCTGC CCGCCGGAGA AGGGCGGGGC
AGGCGGCTGC GCTATCCCGT GCATGTGCTG GACATCTTCA GGGAGATCAG GAGCATGTAC
ACGCGCAACG TGGCGGCGGA GCAGATCGAG GAGCGGCTGG AGGAACTGGT GGCCGCGCTG
TATGCCCCCG TGGGCCAGCC GGGGGACGCG GTCCGGTCCG GCCTTCATGG GGGGCTCTCG
GGTGGGCAAT CGGGCGGCAG GGCGGGCGCA CGGGCATCCG GCACCGGGGT GGAACCGCAC
CTGCGGGACA GGCTCCGCGC ATCCGATCCC ATGCCGGGCT CAGACTACTC GTCAGCGCAC
TCGCTGGCGC ACTCGCTGGC GCACTCGCCG GACCAACTGC CGGACGCCCC ACCGCACACT
GTGGCCTCTG CCCCTGCGGA CCCCGCCGTG GAAGGGCTGC CCGGCCTGCT CGACCGTATG
GCCGCGCTGC TTCAGGTGCA GGGGCACATG CTGGAAGAGC TTTCCAGCCT GCGTCGTCAG
GTGGAGGAAC TGCGTGCCGA CCGTGACATG CTGCTGACCG CCTTTGACGA GCGTGCTGCC
CGGCTGGACG AGGCGCTGGA ACTGCTGCGC CGCGAACGCG CCGAAGCGGT AACCCGCCTC
GTGGACGACT ATTATGCTGT GGTGTCTGGT TCTGCGGGGG CTGGCTCATC GGAGGCCGGC
TTATCGGAGG CCGGCTTATC GGAGGCCGGG CCATCGGAAG CCGGACGGGC GGAAGCCGGA
CGGAGGGATG CCGAACCAGC GGAAGCTGGA ACAAACGGCG CCAAGACCGA CGGGGGCAAC
GCAGGCTGGG CAGAAAACGC TGCTGGCGAC GGCACGCGCG TCGATGCTGC CGGGGCCAGC
CGGAACGCTT CCGGCCAAAC TGCTCCCGGC CAAGCAACTC CCGATCAGGA CGCTTCCGGC
CGGACGGCTT CCGGTCAAAG CACATCCCGT CAGGGCGCAT CCCATCACGA GACATCCGGT
CAGGCCGCAT CCGGTGCCGA CGCGCGGAAC GACGGCCCCT CGGCCCCCGG CGCTTCCGGT
TCCGCCTCGC AGGCGGATGC GGGCCATGCC CGGGCCACGT CCGGCAAGGA TTCCGCCACC
GCGCCGCCCG CAACCCTGTT CGACAGGCCG CTGGTCATCC GGCGCAACGG CGAATACCTG
GGCGTGTCAG GCCGCGACAA GGCCTTCACC CTGCGAGAAC TGCTGGCGCT GGTGGAGCGT
CACGCTGCCC GGCGGCATCG CGTGACCATG GGCTGGACCC GGTCCAGTTC TGCGTGGGTG
CTGCGCCTGA CCACCGACGA ACCTGCCAGC GGCAGGCAGG GCGGCAGCCG CACCCACGAA
TTGACCTTGC AGCCGGTCAC CACGCCCAGC GGCAACCACG TGACCCGGCT TTCGCGCCTT
GCCATCAACG ACGACACGGT TCCGGATCGC TTCCTGTTGG AATTGTTCCG GAATATCAAG
GAAAGCTACG AATTCCGGTA G
 
Protein sequence
MIRELLTLRE IARRLDVPPS SIAYYKDRFA RFLPAGEGRG RRLRYPVHVL DIFREIRSMY 
TRNVAAEQIE ERLEELVAAL YAPVGQPGDA VRSGLHGGLS GGQSGGRAGA RASGTGVEPH
LRDRLRASDP MPGSDYSSAH SLAHSLAHSP DQLPDAPPHT VASAPADPAV EGLPGLLDRM
AALLQVQGHM LEELSSLRRQ VEELRADRDM LLTAFDERAA RLDEALELLR RERAEAVTRL
VDDYYAVVSG SAGAGSSEAG LSEAGLSEAG PSEAGRAEAG RRDAEPAEAG TNGAKTDGGN
AGWAENAAGD GTRVDAAGAS RNASGQTAPG QATPDQDASG RTASGQSTSR QGASHHETSG
QAASGADARN DGPSAPGASG SASQADAGHA RATSGKDSAT APPATLFDRP LVIRRNGEYL
GVSGRDKAFT LRELLALVER HAARRHRVTM GWTRSSSAWV LRLTTDEPAS GRQGGSRTHE
LTLQPVTTPS GNHVTRLSRL AINDDTVPDR FLLELFRNIK ESYEFR