Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_0656 |
Symbol | |
ID | 7172543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | + |
Start bp | 790871 |
End bp | 792166 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643539156 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002435081 |
Protein GI | 218885760 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 0.31335 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGCT TCAGAAAAGC GTGCACCCTG GCGGCGGCTG CCCTGCTGTC GCTGACCCTG CTGGCGGGCA CCGCACTGGC GGAAAAGGTG AACCTGACCT TCTACTTCCC TGTTTCCGTC GGCGGACCCA TCACCAAGAT CGTGGAGGGC ATGACCGAGC AGTTCATGAA GGAACACCCG GACATCAAGG TCACTCCGGT GTACGCGGGC ATCTACCGCG AAACCCTCAC CAAGGCCCTT ACCGCCCTGC GCGGGGGCGA GCCGCCGCAC GTGGCGGTGC TGCTGTCCAC CGACATGTAC ACCCTCATCG ACGAAGACGC GGTGGTGGCC TACGACGACA TCCTGAAGCC GGAGGAAATG GGCTTCACCA AGGCGTTCTT CCCCGGTTTC ATGCGCAACA GCCAGACCGG CGGCAAGACC TGGGGCATTC CGTTCCAGCG TTCCACCATC GTGATGTACT GGAACAAGGA GGCCTTCAAG GCCGCCGGGC TTGATCCCGA AAAGGGCCCC GCCAACTGGA ACGAACTGGT GGAAATGGGC AAGAAGCTGA CCGTGCGCGA TGCCTCCGGC AAGGTCACCC AGTGGGGCGT GGCCATTCCC TCCACCGGGT ACGCCTACTG GATGTTCCAG GCCCTGGCCA TCCAGAACGG CGTGGAACTG ATGAACGCCG AAGGCACCAG GACCGACTTC GACAACCCCA AGGCCATCGA GGCCCTGCAA TTCCTGGTGG ATCTGGCCTA CAAGCACGAG GTGTCGCCCA AGGGCACCAT CGACTGGGCC ACCACCCCGC GCGATTTCTT CGAGCGCAAG TCCGCCATCA TGTGGACCAC CACCGGCAAC CTGACCAACG TGCGCACCAA CGCGCCCTTC CCCTTCGGCG TGGGCATGCT GCCCGCCAGC GCCCGCCCCG GTTCGCCCAC GGGCGGCGGC AACTTCTACA TCTTCAAGAA GGCCACCCCC GCAGAGCGCA AGGCTGCGGT CGACTTCGTG CAGTGGATGA CCAGCGCGGA ACGCGCCGCC CAGTGGGGCA TCGACACCGG CTACGTGGCC GTGCGCCCCG ACGCGTGGGA AACCCCGGCC ATGAAGGACT ACGTGGCCAA GTTCCCCGTG GCCGCCGTGG CCCGCGACCA GCTGGCCCAC GCCGTGCCCG AACTGTCCAC CCATGACAAC CAGCGCGTGA CCAAGGCGCT GGACGACGCC ATCCAGGCCG CCGTGACCGG CTCCAAGAAG CCCGCCGACG CCCTGAAGGA CGCCCAGAAG GAAGCAGAAC GCATCCTGCG CCGTTACGGC AAGTAG
|
Protein sequence | MTGFRKACTL AAAALLSLTL LAGTALAEKV NLTFYFPVSV GGPITKIVEG MTEQFMKEHP DIKVTPVYAG IYRETLTKAL TALRGGEPPH VAVLLSTDMY TLIDEDAVVA YDDILKPEEM GFTKAFFPGF MRNSQTGGKT WGIPFQRSTI VMYWNKEAFK AAGLDPEKGP ANWNELVEMG KKLTVRDASG KVTQWGVAIP STGYAYWMFQ ALAIQNGVEL MNAEGTRTDF DNPKAIEALQ FLVDLAYKHE VSPKGTIDWA TTPRDFFERK SAIMWTTTGN LTNVRTNAPF PFGVGMLPAS ARPGSPTGGG NFYIFKKATP AERKAAVDFV QWMTSAERAA QWGIDTGYVA VRPDAWETPA MKDYVAKFPV AAVARDQLAH AVPELSTHDN QRVTKALDDA IQAAVTGSKK PADALKDAQK EAERILRRYG K
|
| |