Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_0788 |
Symbol | |
ID | 7172677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | - |
Start bp | 947079 |
End bp | 948809 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643539290 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002435213 |
Protein GI | 218885892 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 0.301156 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGCA CCGCTTCACC CGCGCCGCGC GCCACGCGCG CAGGCCTGTA CCTTCTGGCG TTTCTGCTGT GCCTGCTGCT GGCCGCCTGC GGCGACTCCG CCCCCCGGGG CGGCAAGGAC GGCGGCCAGA ACGGCACCAC GGGTGCCGGG GGCAACGCCA CCTCGGCCCA GGGGGGGCAG GCCGCCGCGC CAACGCCCGG AGAACCGGTC ACCGGCGACA GGATCATTCT CGGCACCATC GGAGAACCGT CCAACCTGCT GCCGTTCATC GCGTCCGACG CGTCGTCGCA CGAGGTCAGC GATCAGATTT TCGTGGCCGC GCTGCGCTAC GACAAGGACC TGAACATCGA GAAGTGGGCC GCCGCCTCGT GCGAGGTGCT GGAAAACGGC ACCCTGCTGC GCTTCACCCT GCGCGACGAC ATCCGCTGGG AGGACGGCAA GCCCCTGACC GCCGACGACG TGGAGTTCAC CTACAAGCTG ATGGTGGACC CCAAGACCCC CACCCCCTAC GCCGAGGACT ACCTGGCCAT CCAGGAATTC CGCAAGACGG GGCCGCTGTC GTTCGAGGTG CGCTACGCCA AGCCCTTTGC CCGCTCGCTG ATCACCTGGA TGCACGGCAT CATGCCGAAG CACCTGCTGG AAGGGCAGGA CCTGATGAAC ACGCCCTTCT CGCGCAAGCC CGTGGGGGCA GGACCGTACC GGCTGAAGGA ATGGGAGGCG GGCACCCGCC TGACCCTGGA GGCCAGCCCC AGCTATTTCC TGGGCCGCCC CTACATCGAC GAGGTGGTCT ACCGGATCAT CCCCGACCTC TCCACCATGT TCCTGGAACT GAAGGCCCAG CGCCTGGACA TGATGAACCT GAGCCCGCAG CAGTACCTGC ACCAGACCAA CGGGGCCGAA TGGGACATGG CCTGGCGCAA GTTCCAGTAC CTGTCCTTCG GCTACAGCTA TCTTGGCTAC AACCTGTCGC TGCCCATGTT CAGGGATGTG AAGGTGCGCC AGGCGCTTAC CTGCGCCATA GACCGCAAGG CGCTGGTGGC CGGGGTGCTG CTGGGGCAGG GCATGCCCAC GGTGGGCCCG TACAAGCCGG GCACCTGGGT ATACAATGAC AAAATTGATG ATTATCCGTA TGATCCTGCC AAGGCCAGGG AACTGCTGGC CCAGGCAGGC TGGGTCGACA CCAACGGCGA CGGCCTTCTG GACAAGGACG GCGCGCCCTT CGCGTTCACC ATCCTGACCA ACCAGGGCAA CGACCAGCGC ATCAAGGCCG CCACCATCAT CCAGAGCCAG CTCAAGTCCG TGGGCATCGA CGTGAAGATA CGCACGGTGG AGTGGGCGGC CTTCATCAAG GAATTCGTGG ACAAGGGGCG CTTCGACGCC GTGCTGCTGG GCTGGAACAT CCTGCAAGAC CCCGACCTGT ACGACGTGTG GCACTCGTCC AAAGCCGTGC CCGGCGGGCT GAACTTCGTG GGTTACAAGA ACCCGGAGGT GGACGACCTG CTGGAACGCG CCCGTTCCAC CTTCGACCAG GCCGAGCGCA AGAAGCTGTA CGACAGGTTC CAGGAAATCC TGCACCGCGA CCAGCCGTAC TGCTTCCTGT ACACGCCGTA TTCGCTGCCC ATCGTCAGTT CGCGTTTCCA GGGGCTGGAA CCCGCTCCGG CGGGGCTGAC CCACAATTTC ACCCGCTGGT GGACCCCGAG GGAGCAGCAG CGGTACCGCA TGCAACAGTA G
|
Protein sequence | MNRTASPAPR ATRAGLYLLA FLLCLLLAAC GDSAPRGGKD GGQNGTTGAG GNATSAQGGQ AAAPTPGEPV TGDRIILGTI GEPSNLLPFI ASDASSHEVS DQIFVAALRY DKDLNIEKWA AASCEVLENG TLLRFTLRDD IRWEDGKPLT ADDVEFTYKL MVDPKTPTPY AEDYLAIQEF RKTGPLSFEV RYAKPFARSL ITWMHGIMPK HLLEGQDLMN TPFSRKPVGA GPYRLKEWEA GTRLTLEASP SYFLGRPYID EVVYRIIPDL STMFLELKAQ RLDMMNLSPQ QYLHQTNGAE WDMAWRKFQY LSFGYSYLGY NLSLPMFRDV KVRQALTCAI DRKALVAGVL LGQGMPTVGP YKPGTWVYND KIDDYPYDPA KARELLAQAG WVDTNGDGLL DKDGAPFAFT ILTNQGNDQR IKAATIIQSQ LKSVGIDVKI RTVEWAAFIK EFVDKGRFDA VLLGWNILQD PDLYDVWHSS KAVPGGLNFV GYKNPEVDDL LERARSTFDQ AERKKLYDRF QEILHRDQPY CFLYTPYSLP IVSSRFQGLE PAPAGLTHNF TRWWTPREQQ RYRMQQ
|
| |