Gene Dvul_1685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1685 
Symbol 
ID4663670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1998797 
End bp2000347 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content61% 
IMG OID639819924 
Producthypothetical protein 
Protein accessionYP_967129 
Protein GI120602729 
COG category[S] Function unknown 
COG ID[COG2898] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGAC ATCTCCGCTC TCTCGGGTCG TTCGTCGTTC TCTGCATTTT CGCCGGTGCC 
GCATGGCTGC TCTACCACGA AGTTCGCAAA TACCATCTTG CCGACATCCG ACAAAGCATT
GAACTCATTC CCGACCTGAG GCTTCTTGCC TCTTTCGGGC TCATGATCGT CAACTACCTC
ATCCTCGTGG GCTATGACGC CCTTGCGCTC AAGGCCATAG GGAGACCGCT GCCTCTCGGG
AAGACGGCGC TGGTATCGTT CGTTGGCTGC GCATGCAGCT ACAATTTCGG CGCACTGCTG
GGCGGAAGTT CCGTCCGCTA CCGCTTCTAT TCGGCGTGGG GTTTCACCAT CCCCGATGTC
GTGCGGCTTG TGCTCATGCT GGCGGTCACC TTCTGGGTGG GCGCCCTCGG GCTGGCGGGG
CTATCATTCG TCATCGAACC ATTGCCACTC CCTCCGGGGC TTGGTCTGCC CATAGATGAC
GTGCGCCCTC TCGGTTTTGC CCTGCTTGCT GCGACGACAG GCTATCTTCT TCTGACATTC
TTCGTTCGCA AGCCCCTGCA TTTTTTTGGT AGGGAGTTCG CGCTCCCCTG TCCGAAAATC
GCTTTCGCGC AGACACTGAC GGCATGCGCC GACCTTGTGG CCGCCGCGGG CTGCCTCTAC
ATGCTGATGC CGAGCGACCT TGGACTCGAC TTTCTCACGT TTCTCGCCGT GTATCTGCTG
GCCACAGTCG TGGTGGTCCT CACCCACGTT CCCGGCGGGG CGGGAGTCTT CGAACTTGTC
ATCCTCAGCC TTTCACAGAC AACGCACCCG CAGGCCGTCA TCGCGGCACT TCTGGCGTTC
CGGGTCATCT ACTACCTGCT GCCGCTCCTG TTCGCGGCGC TTCTGCTGGC GGGCTACGAA
GTGCAGGTAC GACGCCATCA GGCTGAAAAG GCATTTCGTG ACGCAGGACG CTGGATGTGG
GTACTCTCCC ATATCCTGCT TTCGTATGTG ACATTCGCGG CAGGTGTCAT CCTGCTGCTT
TCCGGCAGCA TTCCCCCGAA CAAGCTGCTC ATCGCCCAGT CGCCCCTTGT CGTTCCGCCT
GCAGTGCAGG AGGCCGCCCA CATTCTTGGA AGTATGGCCG GGGCCGGACT ATTGCTGCTC
TCACGCGGTA TCGAGCGCCG CCTTGCGTCG GTATGGAAGG TCGTCGTCAC CCTGCTGCTT
ACGGGCATGG TGTGCGCCCT GCTCAAGGGC TTCGACTGGC ACGAGGCTGT ACTCCTGTCG
TTTGCCCTTG CAGGACTTCT GGGCTCACGC CGTCGGTTCT ACCGGAAGTC TTCGCTCATC
CGGGAAGAAT ATCCGCTTCG CTGGTTCTTC GCCTCCGCCG CCGTCATCGG TTGCGCTGGC
GCGGTCGCAC TCTTTGCCTT CGGCGATGCC GGTACAGGCA TGAGGGGCCT ATGGGAAGCC
ATCAGTGACG ATGCCGGTGC TGCGCGTGCG GTGCGTGGCA TCGCCGCCGC CTGTGCCGTC
ATGGTTGCCT TCACCCTGCG CCGACTGTTG CTGCCGCTGC ATAAGCAGTA G
 
Protein sequence
MNRHLRSLGS FVVLCIFAGA AWLLYHEVRK YHLADIRQSI ELIPDLRLLA SFGLMIVNYL 
ILVGYDALAL KAIGRPLPLG KTALVSFVGC ACSYNFGALL GGSSVRYRFY SAWGFTIPDV
VRLVLMLAVT FWVGALGLAG LSFVIEPLPL PPGLGLPIDD VRPLGFALLA ATTGYLLLTF
FVRKPLHFFG REFALPCPKI AFAQTLTACA DLVAAAGCLY MLMPSDLGLD FLTFLAVYLL
ATVVVVLTHV PGGAGVFELV ILSLSQTTHP QAVIAALLAF RVIYYLLPLL FAALLLAGYE
VQVRRHQAEK AFRDAGRWMW VLSHILLSYV TFAAGVILLL SGSIPPNKLL IAQSPLVVPP
AVQEAAHILG SMAGAGLLLL SRGIERRLAS VWKVVVTLLL TGMVCALLKG FDWHEAVLLS
FALAGLLGSR RRFYRKSSLI REEYPLRWFF ASAAVIGCAG AVALFAFGDA GTGMRGLWEA
ISDDAGAARA VRGIAAACAV MVAFTLRRLL LPLHKQ