Gene DvMF_2041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_2041 
Symbol 
ID7173960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp2531921 
End bp2532985 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content68% 
IMG OID643540558 
Productprotein of unknown function DUF116 
Protein accessionYP_002436452 
Protein GI218887131 
COG category[S] Function unknown 
COG ID[COG1852] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATGC CCATCCGCAA GAACCCGGAT TCCCTCCCGC GAGAGGACTA CCACGGCGCC 
CGCAAGCGGC TGTTCATCGG GCTCATCAGC CTGACGTCCG CCGCCCTGTG CCTAGTGCTG
CTGGTGGGGT GGATCATCCC GTACATCGGG CTGGGCAACA TCCATCCGCT GGTGCCGGAC
ATCACCGGGG CCCTGCTGGT GGCGTGCATT GCGCTCATCG TGTGGGCCAC GCTGGGCCTT
GTGCTGCACA TCTACACCGG GCGGCCCTGG TTCGGCTCGC AACGGGTGCG CGGCGTGGCG
GTAAAGCTGT TCCTGCCGCT CATGGAGCTG CTGGGGCGGC TGTTCGGCAT CTCGCGCGAA
GAGGTGCGCC ACTCGTTCAT CAAGGTCAAC AACGAGCTGG TGCGCGGCGA GACGGGCAGC
TTTGCCCCGT CGGACGTGCT GATCCTGCTG CCGCACTGCC TGCAGTCCAG CAACTGCGCG
GTGCGCCTGA CCTACGGCGT GGACCACTGC AAGCGCTGCG GCCAGTGCCC CATAGAGCGG
CTGCTGGCCC TGCGCGATCG CTACGGCGTC AAGCTGGCCA TAGCCACCGG CGGCACCATC
GCCCGGCGTA TCGTGGTCAA GGAGCGCCCC CGGCTGATCA TTGCCGTGGC CTGCGAACGC
GACCTTGCCA GCGGCATCCA GGACACCCAC CCCATCCCGG TGTACGGCGT GCTCAACGAG
CGGCCCAACG GCCCCTGCCT GGACACGCTG GTCAGCCTGC TCAACGTGGA AAAGGCCCTG
CGCCACTTCC TGAACGTGCT GCCGCCCGAT GTTGCGATAG ATGATGCGGA AGCGGACACG
GCTTCTGCGG CACCCGGCTT TCACGATGCT CCCGCGACTG GCGTGCCGGG CTCCGTTGCC
GATGCCACCT TCGGGGCCGG CAGCGGACAT GCCACGGGCG TCGGGCATGC CGGGAGTCCC
CATGGTGCCG GTGTTGCCAC AGCCCGCGAC CAGCGCGCGC AGGAACGCAC GCCTGCGGCC
TCTTCCACCG CAGAGGCGCC TGCCGCGCCG TCGGAACCGC AATGA
 
Protein sequence
MSMPIRKNPD SLPREDYHGA RKRLFIGLIS LTSAALCLVL LVGWIIPYIG LGNIHPLVPD 
ITGALLVACI ALIVWATLGL VLHIYTGRPW FGSQRVRGVA VKLFLPLMEL LGRLFGISRE
EVRHSFIKVN NELVRGETGS FAPSDVLILL PHCLQSSNCA VRLTYGVDHC KRCGQCPIER
LLALRDRYGV KLAIATGGTI ARRIVVKERP RLIIAVACER DLASGIQDTH PIPVYGVLNE
RPNGPCLDTL VSLLNVEKAL RHFLNVLPPD VAIDDAEADT ASAAPGFHDA PATGVPGSVA
DATFGAGSGH ATGVGHAGSP HGAGVATARD QRAQERTPAA SSTAEAPAAP SEPQ