Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_2799 |
Symbol | |
ID | 4663493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 3255105 |
End bp | 3256667 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639821054 |
Product | extracellular solute-binding protein |
Protein accession | YP_968237 |
Protein GI | 120603837 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.741108 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGTC TCATCGCGTT GTTGTCCCTG GCTCTTGCGG CCATGCTCGC CCTCGGCGGC ATCGCCAACG CTGCCGAAGG CAAGACCTTC CGCATCGCCA TGGCGTCCGA CCCCGAGTCC CTCGACCCCC ACATGCAGCT TTCCGGCCCC ATCCTGGCAT ACTCGCACTG GGTCTTCGAC CCGCTGGTAC GCTGGACCCC GGACATGAAG TTCGAAGCCC GCCTTGCCGA GAAGTGGGAA CAGATCAACC CCACCACCAT GCGTTTCCAC CTGCGCAAGG GCGTGAAGTT CCATAGCGGC AACCCCTTCA CCGCGAAGGA CGTGGCATGG ACGCTCGACC GCCTGAAGAA GTCCCCCGAC TTCAAGGGCC TGTTCATCAA GTTCGCCGAG CCCAAGGTCA TCGACGACAA CACCATCGAC ATCATCACCA CCGAGCCCTA CGGCCTCGTG ATGAACCTCG CCACCTACAT CTTCCCGATG GACAGCAAGT TCTACACTGG CACCGACGCC AAGGGTCAGC CCAAGGACGC CATCGTGAAG AGCGGCTACT CGTTCGCCAA CGACAACGCC TCCGGCACCG GTCCCTACAG CGTGGCCGAG CGTGAACAGG GCGTGAAGCT CATCCTCAAG GCCAACAAGG GCTACTGGGG CAAGCGCGGC AACGTCGACA CCATCGAACT CACCCCCATC AAGAACGAAG CCACCCGCGT GGCCGCCATC CTGAAGGGCG ACGTCGACTT CATCTCGCCC GTGCCCGTGC AGGACTACGA CCAGCTTTCC AAGAATGCCG ATGTGGAACT GATCACCATG CCCAGCGCCC GCATCATCAC CATTCAGCTC AACCAGAAGA AGTTCCCCCA GTTCGCGGAC AAGCGCGTGC GTGAAGCCAT CATCGCCGCC ACCGATACCG CCGGTATCGT GGCCAAGGTC ATGAAGGGCT ACACCACCAC CACGCAGCAG CAGGCCCCCA AGGGCTTCGC TGGCTACATC GCCGACCTCA AGCCCCGCTA CAATCTCGAC AACGCCAAGA AGCTCATGAA GGACGCCGGC TTCGAGAAGG GCTTCGAAGT GTCGATGATC GCCCCCAACA ACCGCTACGT GAACGACGAG AAGATCGCGC AGGCCTTCGT TTCGATGATG GCTCGCATCA ACATCAAGGT GAACCTCAAG ACCATGCCCA AGGCGCAGTA CTGGGACCAG TTCGACGCCC AGGTTGCCGA CATCCAGATG ATCGGCTGGC ATCCTGACAC CGAGGACACC GCCAACTACT CCGAGTACCT GCTCATGACC CCCAACAAGG ACACGGGCAT GGGCCAGTAC AACAGCGGCA ACTACGCCAA CCCCAAGTTC GACGCCCTCA TCGACGCTGC CAACCGCGAG ACCGACCCGG CCAAGCGTGA CGCGCTCCTG AAGGAGTCGG AGCAGATGGC TTACAACGAC GCCGCCTTCG TGCCCCTGCA CTGGGAACCG CTGTCGTGGG CAGCTCGCAA GACCGTCAAG AACGCCAAGC AGGTCGTCAA CGCGCAGGAC TTCCCCTACT TCGGCGATCT GATGATGCAG TAA
|
Protein sequence | MKRLIALLSL ALAAMLALGG IANAAEGKTF RIAMASDPES LDPHMQLSGP ILAYSHWVFD PLVRWTPDMK FEARLAEKWE QINPTTMRFH LRKGVKFHSG NPFTAKDVAW TLDRLKKSPD FKGLFIKFAE PKVIDDNTID IITTEPYGLV MNLATYIFPM DSKFYTGTDA KGQPKDAIVK SGYSFANDNA SGTGPYSVAE REQGVKLILK ANKGYWGKRG NVDTIELTPI KNEATRVAAI LKGDVDFISP VPVQDYDQLS KNADVELITM PSARIITIQL NQKKFPQFAD KRVREAIIAA TDTAGIVAKV MKGYTTTTQQ QAPKGFAGYI ADLKPRYNLD NAKKLMKDAG FEKGFEVSMI APNNRYVNDE KIAQAFVSMM ARINIKVNLK TMPKAQYWDQ FDAQVADIQM IGWHPDTEDT ANYSEYLLMT PNKDTGMGQY NSGNYANPKF DALIDAANRE TDPAKRDALL KESEQMAYND AAFVPLHWEP LSWAARKTVK NAKQVVNAQD FPYFGDLMMQ
|
| |