Gene Dvul_1145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1145 
Symbol 
ID4662483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1394304 
End bp1396001 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content62% 
IMG OID639819374 
Productextracellular solute-binding protein 
Protein accessionYP_966592 
Protein GI120602192 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00838015 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.546746 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGTTT TGCGTTGTCT GACCATCGTA TCCCTTGTGG CTGTGTGCCT TTGTGCGGGA 
TGTGGCGATG ATACTGCCCC CACGGTATCC AAAGAGGCAC CGCAGGTGCC TTCCGGCAGT
ACCGCCGGCG AGGGCTTCCC GCCAGCAGGG CGACCCACCG CACAGGAAAT GGCGGAAGAT
GGCGGACGCA TCATCATGGG GTCCATCGGG GAACCGTCGA ACCTCATCTC GTTCCTTTCC
AGCGATGCGT CCTCGCACGA GGTCGCAAGC CAGCTTTACG TGGGGCTCTT GCGGTATAAC
CGTGACCTCG AGATCGAGCC TTGGGCGGCA GAGTCCTACG AGGTGCTCGA CGGGGGCAGG
CTGCTGCGGT TCAAGCTGCG CAAGGGTATC CGGTGGCAGG ACGGCGTGGA ACTCACGGCG
GATGACGTCG AGTTCACCTA CAAGCTTATC ATCGCGCCCA CGACACCGAC ACCGTACGCG
GGCGACTTCC TTGCCGTGCA GGAGTTCCGC AAGACGGGCC GCTACAGCTT CGAGGTGCGC
TACGACAAGC CCTTTGCCCG GTCGCTCATA AGCTGGATGC AGGACATCAT GCCGAAGCAC
CTGCTGGAAG GACAGGACGT GCGGACGACC CCCTTCGCGC GCAAGCCGGT GGGGGCGGGG
CCGTACATGC TCGAATCGTG GGAACCCGGA ACCCGTCTGG TGCTGCGTGC CAACCCAGAC
TACTTCGAGG GCAGACCGCA TATCGACGAG GTCGTGTACC GCATCATTCC CGACAACGCG
ACCATGTTCC TCGAACTCAA GGCGGGCAAG CTCGACATGA TGGGGCTCTC GCCGCAGCAG
TACCTGCGGC AGACCGATGG CTACACATGG GAGCGTGACT GGCGCAAATA CCGCTATCTT
TCGTTCGGGT ACACCTATCT CGGCTACAAC CTGAAGCATC CGTTCTTCGC CGATGCGCGT
GTCCGCCGTG CCATCGCCCA TGCCATCGAC CGCGAGGGCA TCATCAAGGG CGTGCTTCTG
GGACAGGGGG TGCCCACGGT CGGTCCCTAC AAGCCCGGCA CATGGGTATA CAACGACAGG
TTAACGGCGT ACTCCTATGA TCCTGCCCTC GCAGCCGAGA TGCTGCGCGA AGCGGGGTGG
CAGGACACGG ACGGGGATGG CATCCTCGAC CGTGAGGGAA GGCCCTTCGC CTTCACCATC
CTGACCAACC AGGGCAACGA CCAGCGTATC AAGACGGCAA CCATCATCCA GAGCCAGCTC
AAGGATGTGG GAATACGTGT ACAGATACGG ACGGTGGAGT GGGCCGCGTT CATCAAGGAG
TTCGTGAACA CGGGCAGGTT CGATGCCGTC ATTCTCGGCT GGAACATCAC GCAGGACCCG
GATGCCTATG ACGTGTGGCA TTCCTCCAAG GCTGAACCCG GCGGGCTCAA TTTCGTGGGC
TATCGCAATG CAGAGGTGGA CGCCGTACTA GAAAAGGCCC GTCGCACTTT CGACCAGGAC
GAGCGCAAGC AGTACTACGA CCGTTTCCAG GAGATCGTCC ACCGCGACCA GCCCTACTGC
TTTCTGTACG TGCCGTATGC CCTGCCGGTG GTGGCAGCGC GTTTTCGAGG CATCGACCCC
GCCCCGGCGG GCCTCATGCA CAACTTCAAC CGCTGGTGGG TTCCCGTCGA CCAGCAGCGC
TCAACGGTGC AGCAGTAA
 
Protein sequence
MSVLRCLTIV SLVAVCLCAG CGDDTAPTVS KEAPQVPSGS TAGEGFPPAG RPTAQEMAED 
GGRIIMGSIG EPSNLISFLS SDASSHEVAS QLYVGLLRYN RDLEIEPWAA ESYEVLDGGR
LLRFKLRKGI RWQDGVELTA DDVEFTYKLI IAPTTPTPYA GDFLAVQEFR KTGRYSFEVR
YDKPFARSLI SWMQDIMPKH LLEGQDVRTT PFARKPVGAG PYMLESWEPG TRLVLRANPD
YFEGRPHIDE VVYRIIPDNA TMFLELKAGK LDMMGLSPQQ YLRQTDGYTW ERDWRKYRYL
SFGYTYLGYN LKHPFFADAR VRRAIAHAID REGIIKGVLL GQGVPTVGPY KPGTWVYNDR
LTAYSYDPAL AAEMLREAGW QDTDGDGILD REGRPFAFTI LTNQGNDQRI KTATIIQSQL
KDVGIRVQIR TVEWAAFIKE FVNTGRFDAV ILGWNITQDP DAYDVWHSSK AEPGGLNFVG
YRNAEVDAVL EKARRTFDQD ERKQYYDRFQ EIVHRDQPYC FLYVPYALPV VAARFRGIDP
APAGLMHNFN RWWVPVDQQR STVQQ