Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_1034 |
Symbol | |
ID | 7172930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | + |
Start bp | 1253821 |
End bp | 1257135 |
Gene Length | 3315 bp |
Protein Length | 1104 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643539541 |
Product | hypothetical protein |
Protein accession | YP_002435457 |
Protein GI | 218886136 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02532] prepilin-type N-terminal cleavage/methylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 0.881671 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCCA AGGGCTTTAC CCTCATCGAA GTTACCGTAG TGCTGGTACT GGCGGGCCTG CTGGCCGCCG TGGCCGGGGT GGGCATCGTC ACCGGGGCGC GCAGCTTCGT GACGGCGCGC GAAGCCAGCG ACCTGGCGCA GGAAGCGCAA CTGGCCATGG ACCGCCTGAC CCGCGAGATT GTGGAACTGG TGGACATACC TGCCAACAGT TCCTCCACCC TGCTGGTGCT TCGCAACGTG GGCGCGCAAG GTACGGCGCA ATTCAACCGA TCCATCCAGT ACGTGGCCAA CGCGCAGGCG ATACGCATCG CCGACGGCAC CGGCGGCGCG ACCAACGGCG ATACGCTCAT CGACAACGTG ACTGCCTTCA GCCTGACCTA CTGGCAGGAA GACACTTCCA CCACCTCGTG GGCCTCTACC AACGATGCCC GGCTGCTGTC CGCCATCGAC GTCAATTTCA CCCTTACGGC GCCGGGCGGC ACCACGCGCC AGTTCGTCAA CCGTATCGTG CCCCGCAACA ACGAAAACCG GGGCGGGGCC AGCCCCAACT CCGCGCCGCC ATCGCTGGCC CGGTACGACG TGTGCTTCGT GGCAACTGCG GCGGCGGGCA ATGGCGATCA CCCGGTGGTG GTGCAACTGC GCCAGTTCCG CGACAGGCTG CTGCTGCCAT GGTCCGGTGG GCGGGCACTG GTGCGAGCTT ATTACGCCGT GGGGCCGGAC ATGGCCCGCG TGGTTGCCGC GCACGAGCCT TTGCGTGCCG CCACCCTGGC CGCCCTGACG CCGGTTGCCG CGCTGGCGGG GCTGGCCGTG CATGCGCCAG GATGGCTGGC GGCCTGCGTG GTGCTTTCCG CCCTGCTGGT GGCGGTGACG GGCCGTGCCC TGCGCCGGGG CAGCACCGCC GCGCGCGCGG CCATGGACAC TGCCATGGGC ACCGCTCGCA ACGCCGGGCA GCGCGGGGCC GTGCTGCTGA GCGCCATCGT GACCATTGTG GCCTTTTCCG TACTGGCCGC CACCATGGTG CCCATGATGA CCGGGTCCAT GATGGGCGAA ATTTCAGCCG TGCAGGGTGA CCAGGCCTAC TACCTTGCGG AATCGGGCTT TGGCGCTGCG GGCAGCATGT TTCTTGCGGC AGGCGACGAG CAGGCCCGCA AGAACCTGCT GGAGACCATG GATGGTTCCA CCTACACCTT TGCCAACAAC GCCGGGTCAT TCACCCTGGG GGTGGAGCCC TACTGGTACG AGGTCACCAA CAATACCGGC ACCACCCTGA CCACGCGCGC CTACGGCACG CCGCCCACCC TGCCCACCAA CACGTCGGGG CGCATCCGCG CGGGCACCGG GTCCACCTTC TATGCCTATA GCAACATTTC CGTTTCAGGC GACACGGTAA CCTTCACCCT GACCAGTACG CCTTCACCGC TTATCGCATC GGGCACGGAC ATCTTCTTCA GCGCGCGGCC CAGTGCCAGC CAGACCGTGA CGGAACGCGG CAACCTTACC CTTACCGCCA GCAACGCCGC CGCCTTTCCC AACGTCAACG GCATGTTCAC CATACGCGAC GGCGCCACCC ACACCGGACG CGTGGCCTAC GTGTACCGCC GCAAGAACGG CAGTACCCTG GAAGACGTGC GGCTGGTGGA AGGACAGGGG GCAACCTGGT CCAACCTGGC GCTGACCACC GACGCCAACA TCTCGCTGGA GGCGTACCTG CGTCTGCACT CCACCGGCAC TCCGTCCGGC GGGCTGCCGC GTGAGGTGGT CTACAATGTG CCCATCGGGT GGATATTGGG CGGCGGCAAC TTCAGCAAGG AACAATATCA GGATACCTTT GCCAACCTGG ACAAGTGGTA TACCGGCTCT TCCAGCGAAG GGCATCTGGG GACGCACAGC ATTTCCAGCG ACTCCGTGCA CATCACCGCC ATGCAGACGG CGGTAACCAC CGGCTTCGGT TTTTTCCGCT CATGGCTGAG CGGCGACAAT CAGTGGAGTT CACTGTTCTT CAACTGGGGC GCCACCAACG TAGACCTTGC GCGAGGCTGG CTGGATACAG AGGGCAACTC CAGCTACGAT CTGCAATTCA AAACACGGGT GGCTAGCCAG AGCGGGAACA ACAAGGCATT CTTCGGCGGC ATGATGTTCC GTGCGCGCAA CAACGACCAG GGCGATGCGG ACTCCAACAA CGACGAACTT GAATGCTACG GTATTTCCTT CGTCCGCTTC AGGCAGTACC GCAGCCTGCT GAACAGCAAC TGGTACTGGC CAAACGACGT ACCCTCCACA CTGGTGCCCG GCTATGACGC AGGACCAGAC CCGGATACGG GCGGTCCAAT GTTTGGCACG GACGAATCGC AGAACGAAAT CGTCAACAAT TACTTTATCA TCTGGCTTAA CGCCAACAAG TATTCATGGC CCGGCATTCT GCTGTGGGAA CGCCGCAACG GTGCGTTCCG CTGGCTGGCC TACAAGAGGC TGGGACAGGC CTCGGGTATC GTCAACTACG GCAGTGGCCC CACCTACAGC CTGAAAGACT GGCCCTCACT GCTGGTCCGG CTGGTGGAAG GACAGGAACT GTCCTTTACC AACGGCGGCG GAACCAGCGC GGGAACCACC TTGCGCATCA ACTACGGCGA CGAAATAACC ACCGCCACCG GGGCCAAGGC GCGCGTCATC GGCCCCCCCA TCGTCACCAG CGGCGACTGG GCATCCGGAA CGGCGGCAGG CAAGCTGGTG CTGACAAACG TAGACACGGG CAGCAGCGGC AGTTTCGGCA ACGGGCAGAT CATCACGGTA GACAATGTGC AACATGCGCG AGTCGGCTCT GGGGGGTTGG GCAGCAAGGC CAACTACATT CGTGTGTATT ATTCTGATGC AAGCACCAAT ACCACTGGCG ATGCAGTGCC GTACAACCCT ACGGCACCGG GCGGCAATAC GACGTATACC TCCAGCCAGC GTCGCTCAAA CCAGCGCATC ACCACCAGCA GCGGCAAGCT AAGCTGGATA CCCGACGACT ACGAACAGTG GACTGCTGCG ACAGATTACT TTTCGCTGGT ACAGTGGGAC GATGTGAACA CCGCAGTTAC GGGCGTGACA ACGATGGGCG AGTTGCTGAA CGGAAACAGC CTGAGCAACA GCATAGTGCG CAGTACCAAC CTGTTGTCGC CCACGTACAA TGACAGCAGC CCGTCTTACT CGCCACCGGA AGCACTATCC ATCTTCACCA GCGGCCCCAC AGGTACGAAC TTTTACTTTG ACGACTTCGG GTTGCAGCTT GACCTGCGTG GCGGCAAGGG CTTTCTGCCG CCCATCCAGC AATAG
|
Protein sequence | MNAKGFTLIE VTVVLVLAGL LAAVAGVGIV TGARSFVTAR EASDLAQEAQ LAMDRLTREI VELVDIPANS SSTLLVLRNV GAQGTAQFNR SIQYVANAQA IRIADGTGGA TNGDTLIDNV TAFSLTYWQE DTSTTSWAST NDARLLSAID VNFTLTAPGG TTRQFVNRIV PRNNENRGGA SPNSAPPSLA RYDVCFVATA AAGNGDHPVV VQLRQFRDRL LLPWSGGRAL VRAYYAVGPD MARVVAAHEP LRAATLAALT PVAALAGLAV HAPGWLAACV VLSALLVAVT GRALRRGSTA ARAAMDTAMG TARNAGQRGA VLLSAIVTIV AFSVLAATMV PMMTGSMMGE ISAVQGDQAY YLAESGFGAA GSMFLAAGDE QARKNLLETM DGSTYTFANN AGSFTLGVEP YWYEVTNNTG TTLTTRAYGT PPTLPTNTSG RIRAGTGSTF YAYSNISVSG DTVTFTLTST PSPLIASGTD IFFSARPSAS QTVTERGNLT LTASNAAAFP NVNGMFTIRD GATHTGRVAY VYRRKNGSTL EDVRLVEGQG ATWSNLALTT DANISLEAYL RLHSTGTPSG GLPREVVYNV PIGWILGGGN FSKEQYQDTF ANLDKWYTGS SSEGHLGTHS ISSDSVHITA MQTAVTTGFG FFRSWLSGDN QWSSLFFNWG ATNVDLARGW LDTEGNSSYD LQFKTRVASQ SGNNKAFFGG MMFRARNNDQ GDADSNNDEL ECYGISFVRF RQYRSLLNSN WYWPNDVPST LVPGYDAGPD PDTGGPMFGT DESQNEIVNN YFIIWLNANK YSWPGILLWE RRNGAFRWLA YKRLGQASGI VNYGSGPTYS LKDWPSLLVR LVEGQELSFT NGGGTSAGTT LRINYGDEIT TATGAKARVI GPPIVTSGDW ASGTAAGKLV LTNVDTGSSG SFGNGQIITV DNVQHARVGS GGLGSKANYI RVYYSDASTN TTGDAVPYNP TAPGGNTTYT SSQRRSNQRI TTSSGKLSWI PDDYEQWTAA TDYFSLVQWD DVNTAVTGVT TMGELLNGNS LSNSIVRSTN LLSPTYNDSS PSYSPPEALS IFTSGPTGTN FYFDDFGLQL DLRGGKGFLP PIQQ
|
| |