Gene Dvul_2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2042 
Symbol 
ID4662396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2379214 
End bp2380953 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content60% 
IMG OID639820285 
Producthypothetical protein 
Protein accessionYP_967485 
Protein GI120603085 
COG category[S] Function unknown 
COG ID[COG2855] Predicted membrane protein 
TIGRFAM ID[TIGR00698] conserved hypothetical integral membrane protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAA ATGAAAGCAA CGTCGTGGTC GACCACGGGC AGTCGCGTCT GTCCGACCTG 
TGGACCAAAG AGGACTACTG GGCCATCTGG CTTGGTTTCG TCATTCTCAT CGCAGGGATG
TGGTTGTTCC TCGCCAATCC TTCGCCCGAA TTCGCGCAGA AGGTGGACAA GGCCAACGCG
GTCATGGCGG CAGAGGCCGA ACGCGCGCCC TTCAAGACCC TTGCCTACTA CAAGGCACAG
GATGACAAGG GCAAGCTCAA GGCCATGGAC TCCGCCACCG GCAAGTCCAT CGGAGCCTTC
CTGAAGGCGC CCGGAGGCTG GACATCGAAT CCTCTCGAGT CCTTCGTGCT TTCCAAGGAA
GCCGCCGAGG AGCGCAACGC CGCGGCAAAG TCGAAGTTCG AGGCCGCAAA GGCCAAGTCC
GATGCCGCAT TCGCTGCCGC GCAGGTGGCT GAGGCAGCCG CCGCAGAAGC CGGTTTTGCC
GACACCGCAC TCAATGACGC CGCGCAGGGC AAGATCGCCG AATGGCGCGC AGACCTTGCC
AAGATGAAGT CCGCCGAGAA GAAGGTGAAG ACCAAGGCCT TCAACATCTC CACGTCGCTG
CCGATGCTCA TGGTGGTCAT GGGGCTGTTC TTCGCCATCG GCATGAAGTT CATGGGCCAT
GATGTTCCCA AGTTCCTTGT CGGCTTCATC GGCGTGTTCG TGGTTGCCGT CATCGCGCAG
ATGATGGGCC ACCAGAGCAC CATGAAGTAC TGGGGCATCG GCACCGAAGC CTGGGCCATC
ATCATCGGGA TGCTCATCGC CAACACCGTG GGCACGCCCA ATTTCATCAA GCCCGCCCTG
CAGGTCGAGT ACTACATCAA GACCGGTCTC GTGCTGCTGG GTGCCGAAGT GCTGTTCGAC
AAGATCATCG CCATCGGCAT CCCCGGTATC TTCGTGGCAT GGGTCGTCAC CCCCATCGTG
CTCATCTGCA CCTTCATCTT CGGTCAGAAG ATCCTGAAGA TGCCTTCGAA GACGCTGAAC
ATGGTCATCT CCGCCGACAT GTCGGTGTGC GGCACCTCTG CTGCCATCGC CACGGCTGCG
GCCTGCCGCG CCAAGAAGGA GGAGCTCACC CTGTCCATCG GCCTGTCGCT GGTGTTCACC
GCCATCATGA TGATCGTCAT GCCTGCCTTC ATCAAGTCTG TGGGCATCCC CCAGATTCTC
GGCGGTGCAT GGATGGGTGG TACCATCGAC GCCACGGGTG CCGTTGCCGC TGCCGGTGCG
TTCCTCGGCG AGAAGGCCCT GTACGTGGCT GCCACCATCA AGATGATCCA GAACGTGCTC
ATCGGTGTGG TCGCCTTCGG TGTGGCCGTG TACTGGTGCG CCCGCGTCGA ATGCACTTCG
GGCCGCAGCG TGGGCTGGAT CGAAATCTGG AACCGCTTCC CCAAGTTCGT CCTCGGCTTC
CTCACCGCGT CGATCATCTT CTCGATCATC TCCGGCAGCC TCGGCTCGGA CATGAGCCAG
ATCATGGTCA ACCAGGGCGT CCTGAAGGGG CTGTCGTCGC CGCTGCGTGG CTGGTTCTTC
TGCCTCGCCT TCACCGCCAT CGGTCTTGCC ACGAACTTCC GTGAACTGGC GCACTACTTC
AAGGGCGGCA AGCCGCTCAT CCTGTACGTG TGCGGTCAGA GCTTCAACCT TGTGTTGACG
CTGACCATGG CCTACATCAT GTTCTACATC GTCTTCCCGG AAATCACCGC GAAGATTTAA
 
Protein sequence
MAENESNVVV DHGQSRLSDL WTKEDYWAIW LGFVILIAGM WLFLANPSPE FAQKVDKANA 
VMAAEAERAP FKTLAYYKAQ DDKGKLKAMD SATGKSIGAF LKAPGGWTSN PLESFVLSKE
AAEERNAAAK SKFEAAKAKS DAAFAAAQVA EAAAAEAGFA DTALNDAAQG KIAEWRADLA
KMKSAEKKVK TKAFNISTSL PMLMVVMGLF FAIGMKFMGH DVPKFLVGFI GVFVVAVIAQ
MMGHQSTMKY WGIGTEAWAI IIGMLIANTV GTPNFIKPAL QVEYYIKTGL VLLGAEVLFD
KIIAIGIPGI FVAWVVTPIV LICTFIFGQK ILKMPSKTLN MVISADMSVC GTSAAIATAA
ACRAKKEELT LSIGLSLVFT AIMMIVMPAF IKSVGIPQIL GGAWMGGTID ATGAVAAAGA
FLGEKALYVA ATIKMIQNVL IGVVAFGVAV YWCARVECTS GRSVGWIEIW NRFPKFVLGF
LTASIIFSII SGSLGSDMSQ IMVNQGVLKG LSSPLRGWFF CLAFTAIGLA TNFRELAHYF
KGGKPLILYV CGQSFNLVLT LTMAYIMFYI VFPEITAKI