Gene DvMF_1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_1034 
Symbol 
ID7172930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp1253821 
End bp1257135 
Gene Length3315 bp 
Protein Length1104 aa 
Translation table11 
GC content63% 
IMG OID643539541 
Producthypothetical protein 
Protein accessionYP_002435457 
Protein GI218886136 
COG category 
COG ID 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value0.881671 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCA AGGGCTTTAC CCTCATCGAA GTTACCGTAG TGCTGGTACT GGCGGGCCTG 
CTGGCCGCCG TGGCCGGGGT GGGCATCGTC ACCGGGGCGC GCAGCTTCGT GACGGCGCGC
GAAGCCAGCG ACCTGGCGCA GGAAGCGCAA CTGGCCATGG ACCGCCTGAC CCGCGAGATT
GTGGAACTGG TGGACATACC TGCCAACAGT TCCTCCACCC TGCTGGTGCT TCGCAACGTG
GGCGCGCAAG GTACGGCGCA ATTCAACCGA TCCATCCAGT ACGTGGCCAA CGCGCAGGCG
ATACGCATCG CCGACGGCAC CGGCGGCGCG ACCAACGGCG ATACGCTCAT CGACAACGTG
ACTGCCTTCA GCCTGACCTA CTGGCAGGAA GACACTTCCA CCACCTCGTG GGCCTCTACC
AACGATGCCC GGCTGCTGTC CGCCATCGAC GTCAATTTCA CCCTTACGGC GCCGGGCGGC
ACCACGCGCC AGTTCGTCAA CCGTATCGTG CCCCGCAACA ACGAAAACCG GGGCGGGGCC
AGCCCCAACT CCGCGCCGCC ATCGCTGGCC CGGTACGACG TGTGCTTCGT GGCAACTGCG
GCGGCGGGCA ATGGCGATCA CCCGGTGGTG GTGCAACTGC GCCAGTTCCG CGACAGGCTG
CTGCTGCCAT GGTCCGGTGG GCGGGCACTG GTGCGAGCTT ATTACGCCGT GGGGCCGGAC
ATGGCCCGCG TGGTTGCCGC GCACGAGCCT TTGCGTGCCG CCACCCTGGC CGCCCTGACG
CCGGTTGCCG CGCTGGCGGG GCTGGCCGTG CATGCGCCAG GATGGCTGGC GGCCTGCGTG
GTGCTTTCCG CCCTGCTGGT GGCGGTGACG GGCCGTGCCC TGCGCCGGGG CAGCACCGCC
GCGCGCGCGG CCATGGACAC TGCCATGGGC ACCGCTCGCA ACGCCGGGCA GCGCGGGGCC
GTGCTGCTGA GCGCCATCGT GACCATTGTG GCCTTTTCCG TACTGGCCGC CACCATGGTG
CCCATGATGA CCGGGTCCAT GATGGGCGAA ATTTCAGCCG TGCAGGGTGA CCAGGCCTAC
TACCTTGCGG AATCGGGCTT TGGCGCTGCG GGCAGCATGT TTCTTGCGGC AGGCGACGAG
CAGGCCCGCA AGAACCTGCT GGAGACCATG GATGGTTCCA CCTACACCTT TGCCAACAAC
GCCGGGTCAT TCACCCTGGG GGTGGAGCCC TACTGGTACG AGGTCACCAA CAATACCGGC
ACCACCCTGA CCACGCGCGC CTACGGCACG CCGCCCACCC TGCCCACCAA CACGTCGGGG
CGCATCCGCG CGGGCACCGG GTCCACCTTC TATGCCTATA GCAACATTTC CGTTTCAGGC
GACACGGTAA CCTTCACCCT GACCAGTACG CCTTCACCGC TTATCGCATC GGGCACGGAC
ATCTTCTTCA GCGCGCGGCC CAGTGCCAGC CAGACCGTGA CGGAACGCGG CAACCTTACC
CTTACCGCCA GCAACGCCGC CGCCTTTCCC AACGTCAACG GCATGTTCAC CATACGCGAC
GGCGCCACCC ACACCGGACG CGTGGCCTAC GTGTACCGCC GCAAGAACGG CAGTACCCTG
GAAGACGTGC GGCTGGTGGA AGGACAGGGG GCAACCTGGT CCAACCTGGC GCTGACCACC
GACGCCAACA TCTCGCTGGA GGCGTACCTG CGTCTGCACT CCACCGGCAC TCCGTCCGGC
GGGCTGCCGC GTGAGGTGGT CTACAATGTG CCCATCGGGT GGATATTGGG CGGCGGCAAC
TTCAGCAAGG AACAATATCA GGATACCTTT GCCAACCTGG ACAAGTGGTA TACCGGCTCT
TCCAGCGAAG GGCATCTGGG GACGCACAGC ATTTCCAGCG ACTCCGTGCA CATCACCGCC
ATGCAGACGG CGGTAACCAC CGGCTTCGGT TTTTTCCGCT CATGGCTGAG CGGCGACAAT
CAGTGGAGTT CACTGTTCTT CAACTGGGGC GCCACCAACG TAGACCTTGC GCGAGGCTGG
CTGGATACAG AGGGCAACTC CAGCTACGAT CTGCAATTCA AAACACGGGT GGCTAGCCAG
AGCGGGAACA ACAAGGCATT CTTCGGCGGC ATGATGTTCC GTGCGCGCAA CAACGACCAG
GGCGATGCGG ACTCCAACAA CGACGAACTT GAATGCTACG GTATTTCCTT CGTCCGCTTC
AGGCAGTACC GCAGCCTGCT GAACAGCAAC TGGTACTGGC CAAACGACGT ACCCTCCACA
CTGGTGCCCG GCTATGACGC AGGACCAGAC CCGGATACGG GCGGTCCAAT GTTTGGCACG
GACGAATCGC AGAACGAAAT CGTCAACAAT TACTTTATCA TCTGGCTTAA CGCCAACAAG
TATTCATGGC CCGGCATTCT GCTGTGGGAA CGCCGCAACG GTGCGTTCCG CTGGCTGGCC
TACAAGAGGC TGGGACAGGC CTCGGGTATC GTCAACTACG GCAGTGGCCC CACCTACAGC
CTGAAAGACT GGCCCTCACT GCTGGTCCGG CTGGTGGAAG GACAGGAACT GTCCTTTACC
AACGGCGGCG GAACCAGCGC GGGAACCACC TTGCGCATCA ACTACGGCGA CGAAATAACC
ACCGCCACCG GGGCCAAGGC GCGCGTCATC GGCCCCCCCA TCGTCACCAG CGGCGACTGG
GCATCCGGAA CGGCGGCAGG CAAGCTGGTG CTGACAAACG TAGACACGGG CAGCAGCGGC
AGTTTCGGCA ACGGGCAGAT CATCACGGTA GACAATGTGC AACATGCGCG AGTCGGCTCT
GGGGGGTTGG GCAGCAAGGC CAACTACATT CGTGTGTATT ATTCTGATGC AAGCACCAAT
ACCACTGGCG ATGCAGTGCC GTACAACCCT ACGGCACCGG GCGGCAATAC GACGTATACC
TCCAGCCAGC GTCGCTCAAA CCAGCGCATC ACCACCAGCA GCGGCAAGCT AAGCTGGATA
CCCGACGACT ACGAACAGTG GACTGCTGCG ACAGATTACT TTTCGCTGGT ACAGTGGGAC
GATGTGAACA CCGCAGTTAC GGGCGTGACA ACGATGGGCG AGTTGCTGAA CGGAAACAGC
CTGAGCAACA GCATAGTGCG CAGTACCAAC CTGTTGTCGC CCACGTACAA TGACAGCAGC
CCGTCTTACT CGCCACCGGA AGCACTATCC ATCTTCACCA GCGGCCCCAC AGGTACGAAC
TTTTACTTTG ACGACTTCGG GTTGCAGCTT GACCTGCGTG GCGGCAAGGG CTTTCTGCCG
CCCATCCAGC AATAG
 
Protein sequence
MNAKGFTLIE VTVVLVLAGL LAAVAGVGIV TGARSFVTAR EASDLAQEAQ LAMDRLTREI 
VELVDIPANS SSTLLVLRNV GAQGTAQFNR SIQYVANAQA IRIADGTGGA TNGDTLIDNV
TAFSLTYWQE DTSTTSWAST NDARLLSAID VNFTLTAPGG TTRQFVNRIV PRNNENRGGA
SPNSAPPSLA RYDVCFVATA AAGNGDHPVV VQLRQFRDRL LLPWSGGRAL VRAYYAVGPD
MARVVAAHEP LRAATLAALT PVAALAGLAV HAPGWLAACV VLSALLVAVT GRALRRGSTA
ARAAMDTAMG TARNAGQRGA VLLSAIVTIV AFSVLAATMV PMMTGSMMGE ISAVQGDQAY
YLAESGFGAA GSMFLAAGDE QARKNLLETM DGSTYTFANN AGSFTLGVEP YWYEVTNNTG
TTLTTRAYGT PPTLPTNTSG RIRAGTGSTF YAYSNISVSG DTVTFTLTST PSPLIASGTD
IFFSARPSAS QTVTERGNLT LTASNAAAFP NVNGMFTIRD GATHTGRVAY VYRRKNGSTL
EDVRLVEGQG ATWSNLALTT DANISLEAYL RLHSTGTPSG GLPREVVYNV PIGWILGGGN
FSKEQYQDTF ANLDKWYTGS SSEGHLGTHS ISSDSVHITA MQTAVTTGFG FFRSWLSGDN
QWSSLFFNWG ATNVDLARGW LDTEGNSSYD LQFKTRVASQ SGNNKAFFGG MMFRARNNDQ
GDADSNNDEL ECYGISFVRF RQYRSLLNSN WYWPNDVPST LVPGYDAGPD PDTGGPMFGT
DESQNEIVNN YFIIWLNANK YSWPGILLWE RRNGAFRWLA YKRLGQASGI VNYGSGPTYS
LKDWPSLLVR LVEGQELSFT NGGGTSAGTT LRINYGDEIT TATGAKARVI GPPIVTSGDW
ASGTAAGKLV LTNVDTGSSG SFGNGQIITV DNVQHARVGS GGLGSKANYI RVYYSDASTN
TTGDAVPYNP TAPGGNTTYT SSQRRSNQRI TTSSGKLSWI PDDYEQWTAA TDYFSLVQWD
DVNTAVTGVT TMGELLNGNS LSNSIVRSTN LLSPTYNDSS PSYSPPEALS IFTSGPTGTN
FYFDDFGLQL DLRGGKGFLP PIQQ