Gene Dvul_1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1034 
Symbol 
ID4664097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1267852 
End bp1269369 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content62% 
IMG OID639819258 
Productextracellular solute-binding protein 
Protein accessionYP_966481 
Protein GI120602081 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4623] Predicted soluble lytic transglycosylase fused to an ABC-type amino acid-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.121119 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAAT CCTCCGGGTG GTCACGCGTT GTCGTGCTGT TCTGTTTCAT GTTCGCCCTG 
ACAGCGCCCT GTCTGGTTGT CGTGTCGGGT ACAGCCGCCG CCATGGTCGC TGCCCAGGAG
GATGACCCGG CGACGTTGCC GGAAGACCGC ATCCAGGACC TCATCCTCCC TGCGGAGAAG
GCCTTCTCGG GTGATGTGAA GGAGATAAGG CAACGTGGAG TGCTTCGTGT TCTCGTCACC
TACCGCAAGG GAGATTTCTT CATCGCTGAC GGCGAACTGC GCGGCGTGGA AGTCGAACTC
GCCCGTGCAT TCGCCGCATG GCTTGGCAAG AAGGGGGGCA AGAAGGCCCT GCCTGTCCGC
GCCATCTTCA TTCCGGTCGC GTTCGATGAA CTGTTGACGG CGCTTGAGCA GGGCAAGGGG
GACATTGCCG CTGCCGGGTT CACCGTCACG GAGGCACGTA GCGCACGCGT CCCGTTCGCC
ACGCCGTACC TGCGCGGCAT CGACGAGGTC TTTGCCATCC GCAAGGGTGC ACCCGTGCCA
ACGAGCCTGG ATGAACTCGC GGGCAGGACG GTGCATGTCG TCCGGGGCTC AAGCCACGAG
CAGCATCTGG GCGAACTCAA CGTCCGTCTT GCCGCACAGG GCCTTGCGCC CTTGAACATC
GTGACGCCAT CGGCCGACCT GCAACCGGAG GACCTCTTCG ACCTTCTCGG GACAGGTGCC
ATAGACCTGA TGGTCGCCGA CAGTCATCGT GCCCGGCTGT GGCGACGCGC CATGCCCGAT
GTGCAGGTCG TGCCGTCACT GCAGTTGAAG ACGGGGCAGG ACATCGCATG GGCGGTACGC
CCCGATGCCC CGGGGTTGCT GCATGAAGTG AACGCCTTCT TCGCTGCCGA TGGGGGGAGG
GCCGTGAAGA AGGCTGCTGG ACTGCTGGAA CGCTATTACG CAGACAGGTC GTGGCACGTC
GAAGGGCTCA ACCGCAAGTT CGCTGCCCGG GCAAAGCGTC TCTATCCCCA TTTCATCCGC
TATGGCGACA CCTATGCCTT CGACCCGTTG CTCTTGCTTG CGCAGGGCTA CCGGGAGTCG
CGTCTCAACC AGAAGCTGCG CAGTCCACGC GGTGCCGTCG GAGTGATGCA GGTGCTACCT
TCGACAGCCC GGACCATGGG TTTTCCCGAT GTCGTGAAGG AGGCGGTGAC GAACATCCAT
GCCGGGGTGC GCTATCTTGA ATATGTGCGG TCGGACTACT TTTCCGATGC CGATATCCGC
GAACCGGACA GGACGCTGTT CAGCCTTGCC GCCTACAATA TGGGCCCCAA CCGCATGGCC
CGTGTCCGCG AACGTGCCAT ACGGATGGGA CTCGACCCCA ATCGCTGGTT CGGCAATGTG
GAATATGCGG CGTTACGATA CGTCGGGCGT GAACCTGTGA CCTACGTCGC TCAGATTTCA
TCGTATTACA TCGCCTATCA GGGCAGTCAT GCCGTGACCG GTGCCCGTCG CCCGGTGCTG
GAGGCACTGC AGAAGTGA
 
Protein sequence
MPKSSGWSRV VVLFCFMFAL TAPCLVVVSG TAAAMVAAQE DDPATLPEDR IQDLILPAEK 
AFSGDVKEIR QRGVLRVLVT YRKGDFFIAD GELRGVEVEL ARAFAAWLGK KGGKKALPVR
AIFIPVAFDE LLTALEQGKG DIAAAGFTVT EARSARVPFA TPYLRGIDEV FAIRKGAPVP
TSLDELAGRT VHVVRGSSHE QHLGELNVRL AAQGLAPLNI VTPSADLQPE DLFDLLGTGA
IDLMVADSHR ARLWRRAMPD VQVVPSLQLK TGQDIAWAVR PDAPGLLHEV NAFFAADGGR
AVKKAAGLLE RYYADRSWHV EGLNRKFAAR AKRLYPHFIR YGDTYAFDPL LLLAQGYRES
RLNQKLRSPR GAVGVMQVLP STARTMGFPD VVKEAVTNIH AGVRYLEYVR SDYFSDADIR
EPDRTLFSLA AYNMGPNRMA RVRERAIRMG LDPNRWFGNV EYAALRYVGR EPVTYVAQIS
SYYIAYQGSH AVTGARRPVL EALQK