Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_0688 |
Symbol | |
ID | 7172575 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | - |
Start bp | 829719 |
End bp | 831161 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643539188 |
Product | polysaccharide chain length determinant protein, PEP-CTERM locus subfamily |
Protein accession | YP_002435113 |
Protein GI | 218885792 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 0.403997 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGTT ACCTCGTGCT GGTGAACGAG AGCGTCGTGT CGTTCTGCGT GGTGGCGGTG CTTGTGGCGG TGCTTGGGGT GGTGGTGAGC TATGTGCTTC CCAAGCGTTA TCAGGCCCAG TCGTCCGTCT CCATTGAGGA AAACGTGGTC AACGAACTGG TAAAGGGGAT TGCCGTCACC CCCTCGCTTG AGGCAAAGCT GCGGATTCTC AAGGTTTCCA TACTCAGTCG CAAGATGTTG TTGCGGGTGA TCAAGGACCT GGACATGGAT CTGGGCATAC AGGGTGAGCG GCTTGAGCGC TTGATAGAAA CAACAAGGTC GAATGTTGAA ATTACGCATG AGGAAAGAAA AGGCATTTTC TACATCAGAT ATGCGAACTC GTCTCCGGAA AGAGCACGCG ATTTCGTCAA TGCATTGACG CGCAGATACA TCGAAGAAAG TACGTCGTCG AAGCGTGAGG AATCATACGA AGCGACAAGA TTTCTTTTTG ATCAGATATC AGTATTCCAG AAGCGTATTG ATGCCGCACA ACAGGCAATT GATGCGTACA AGTCGGAGAA GGGTATGGTT CTCAGTCTGA ATGAGAACAT ACTTCGCGAG GAAATCAAGG AAACGGAACA TCGTCTGGAG GAGACGCGCA TTCGCAAGAA CGAACGCCTT GCCCAGCTCA ATATCCTTGA AAAGGGAAGC GGTGGGGGAC GGCTTGCGGA AAAGGAGGCG TCCTACAAGA CGTTGCTGCG AACCTACACC GATCAGCATC CGGACGTGAT CAAGGCCCGG GCGGAGCTTG ACGCCCTGCG CTCTGGCGGG GACGGGGCGG AGCGCCGCAA GGGTGGCGTG GACTACCAGC GCCTCAAGGT GGAGCTGGAG TCGCTGAGCG AGATAGAGCA GGTCCAGCAG GCGCTCATAG AGAAGGACAA GCGACTGCTG CAAGAGCTTC CCGCCGTGCA GACAGAGCTT CAGGTCCTGC AGCAGGCGCG CAAGAACGAG ACGTTGATCT ACGAGCAGCT CGTGTCGCGC TACGGGCAAT CCGAGGTCTC GAAGCAGATG GAATTGCAGG ACAAGGCCGT GAGCTTTCGC ATCATCGACC CCGCGATCCT TCCCTTGCGG CCCAGCAGCC CCAACAGGCC GTTGATCATG CTGGGTTCGC TGGTGGCGGG CGTGGTCGTG GCTGCGGGCG GGATCATTCT GTCCGACCAG TTTTTCCGCA GGGTGCGCTC GGTGGAAGAC CTGACATCCA GGGGGTTTCT GGTGCTTGGC GCGCTGCCCC GCATAGCCAC CGCCGCGGAC GCGCGCGTGG CGAGACGCAG GCGGACCGCC ATCGCCCTGG CCCTGTTGGC GATGCTGTGC ATCGTGGGGC TTGCAGGGTG GGAATACAGC GGTTTCGAGG GGCTGGACAC GCTGTTTGCC AGGGCTCGCT ACATCCTTTC CACGTGGTTG TGA
|
Protein sequence | MKRYLVLVNE SVVSFCVVAV LVAVLGVVVS YVLPKRYQAQ SSVSIEENVV NELVKGIAVT PSLEAKLRIL KVSILSRKML LRVIKDLDMD LGIQGERLER LIETTRSNVE ITHEERKGIF YIRYANSSPE RARDFVNALT RRYIEESTSS KREESYEATR FLFDQISVFQ KRIDAAQQAI DAYKSEKGMV LSLNENILRE EIKETEHRLE ETRIRKNERL AQLNILEKGS GGGRLAEKEA SYKTLLRTYT DQHPDVIKAR AELDALRSGG DGAERRKGGV DYQRLKVELE SLSEIEQVQQ ALIEKDKRLL QELPAVQTEL QVLQQARKNE TLIYEQLVSR YGQSEVSKQM ELQDKAVSFR IIDPAILPLR PSSPNRPLIM LGSLVAGVVV AAGGIILSDQ FFRRVRSVED LTSRGFLVLG ALPRIATAAD ARVARRRRTA IALALLAMLC IVGLAGWEYS GFEGLDTLFA RARYILSTWL
|
| |