Gene DvMF_0788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_0788 
Symbol 
ID7172677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp947079 
End bp948809 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content65% 
IMG OID643539290 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002435213 
Protein GI218885892 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value0.301156 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGCA CCGCTTCACC CGCGCCGCGC GCCACGCGCG CAGGCCTGTA CCTTCTGGCG 
TTTCTGCTGT GCCTGCTGCT GGCCGCCTGC GGCGACTCCG CCCCCCGGGG CGGCAAGGAC
GGCGGCCAGA ACGGCACCAC GGGTGCCGGG GGCAACGCCA CCTCGGCCCA GGGGGGGCAG
GCCGCCGCGC CAACGCCCGG AGAACCGGTC ACCGGCGACA GGATCATTCT CGGCACCATC
GGAGAACCGT CCAACCTGCT GCCGTTCATC GCGTCCGACG CGTCGTCGCA CGAGGTCAGC
GATCAGATTT TCGTGGCCGC GCTGCGCTAC GACAAGGACC TGAACATCGA GAAGTGGGCC
GCCGCCTCGT GCGAGGTGCT GGAAAACGGC ACCCTGCTGC GCTTCACCCT GCGCGACGAC
ATCCGCTGGG AGGACGGCAA GCCCCTGACC GCCGACGACG TGGAGTTCAC CTACAAGCTG
ATGGTGGACC CCAAGACCCC CACCCCCTAC GCCGAGGACT ACCTGGCCAT CCAGGAATTC
CGCAAGACGG GGCCGCTGTC GTTCGAGGTG CGCTACGCCA AGCCCTTTGC CCGCTCGCTG
ATCACCTGGA TGCACGGCAT CATGCCGAAG CACCTGCTGG AAGGGCAGGA CCTGATGAAC
ACGCCCTTCT CGCGCAAGCC CGTGGGGGCA GGACCGTACC GGCTGAAGGA ATGGGAGGCG
GGCACCCGCC TGACCCTGGA GGCCAGCCCC AGCTATTTCC TGGGCCGCCC CTACATCGAC
GAGGTGGTCT ACCGGATCAT CCCCGACCTC TCCACCATGT TCCTGGAACT GAAGGCCCAG
CGCCTGGACA TGATGAACCT GAGCCCGCAG CAGTACCTGC ACCAGACCAA CGGGGCCGAA
TGGGACATGG CCTGGCGCAA GTTCCAGTAC CTGTCCTTCG GCTACAGCTA TCTTGGCTAC
AACCTGTCGC TGCCCATGTT CAGGGATGTG AAGGTGCGCC AGGCGCTTAC CTGCGCCATA
GACCGCAAGG CGCTGGTGGC CGGGGTGCTG CTGGGGCAGG GCATGCCCAC GGTGGGCCCG
TACAAGCCGG GCACCTGGGT ATACAATGAC AAAATTGATG ATTATCCGTA TGATCCTGCC
AAGGCCAGGG AACTGCTGGC CCAGGCAGGC TGGGTCGACA CCAACGGCGA CGGCCTTCTG
GACAAGGACG GCGCGCCCTT CGCGTTCACC ATCCTGACCA ACCAGGGCAA CGACCAGCGC
ATCAAGGCCG CCACCATCAT CCAGAGCCAG CTCAAGTCCG TGGGCATCGA CGTGAAGATA
CGCACGGTGG AGTGGGCGGC CTTCATCAAG GAATTCGTGG ACAAGGGGCG CTTCGACGCC
GTGCTGCTGG GCTGGAACAT CCTGCAAGAC CCCGACCTGT ACGACGTGTG GCACTCGTCC
AAAGCCGTGC CCGGCGGGCT GAACTTCGTG GGTTACAAGA ACCCGGAGGT GGACGACCTG
CTGGAACGCG CCCGTTCCAC CTTCGACCAG GCCGAGCGCA AGAAGCTGTA CGACAGGTTC
CAGGAAATCC TGCACCGCGA CCAGCCGTAC TGCTTCCTGT ACACGCCGTA TTCGCTGCCC
ATCGTCAGTT CGCGTTTCCA GGGGCTGGAA CCCGCTCCGG CGGGGCTGAC CCACAATTTC
ACCCGCTGGT GGACCCCGAG GGAGCAGCAG CGGTACCGCA TGCAACAGTA G
 
Protein sequence
MNRTASPAPR ATRAGLYLLA FLLCLLLAAC GDSAPRGGKD GGQNGTTGAG GNATSAQGGQ 
AAAPTPGEPV TGDRIILGTI GEPSNLLPFI ASDASSHEVS DQIFVAALRY DKDLNIEKWA
AASCEVLENG TLLRFTLRDD IRWEDGKPLT ADDVEFTYKL MVDPKTPTPY AEDYLAIQEF
RKTGPLSFEV RYAKPFARSL ITWMHGIMPK HLLEGQDLMN TPFSRKPVGA GPYRLKEWEA
GTRLTLEASP SYFLGRPYID EVVYRIIPDL STMFLELKAQ RLDMMNLSPQ QYLHQTNGAE
WDMAWRKFQY LSFGYSYLGY NLSLPMFRDV KVRQALTCAI DRKALVAGVL LGQGMPTVGP
YKPGTWVYND KIDDYPYDPA KARELLAQAG WVDTNGDGLL DKDGAPFAFT ILTNQGNDQR
IKAATIIQSQ LKSVGIDVKI RTVEWAAFIK EFVDKGRFDA VLLGWNILQD PDLYDVWHSS
KAVPGGLNFV GYKNPEVDDL LERARSTFDQ AERKKLYDRF QEILHRDQPY CFLYTPYSLP
IVSSRFQGLE PAPAGLTHNF TRWWTPREQQ RYRMQQ