Gene Dvul_1989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1989 
Symbol 
ID4663328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2318945 
End bp2320168 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content63% 
IMG OID639820230 
Productpeptidase M24 
Protein accessionYP_967432 
Protein GI120603032 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.758645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.208351 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCACTG CCGCAGAACG CATTCCCGAC GATGAAGTCC GCCGCCGCCA CAGCCGCTGC 
CGTGCGGCCC TCGCAGACGT CGCCCCCGAG GCCTCCGGTC TGCTGGTCTT TGCCAGACTC
TCCATCTACT ACCTCACCGG TTCGCTCGGT AACGGCGTGC TGTGGCTACC CCGCGAAGGC
GAAGCCATGC TCTTCGTCCG CAAGGGCATT GAACGCGTTC TGCTTGAAAG CCCCATTGAA
CTTGTGCACC CCTTCCGTTC CTACGGCGAC ATCGTCGAAC TCGCACGCGA AGCAGGTTCC
CCATTGGGCG GGGTGGTTGC TGCCGAGATG GGAGGACTGC CATGGTCTCT CGCCAACCTG
CTGCAACAGC GCCTTCAGGG CGTTTCTTTC GTACCCGGCG ACATGGCGGT AACCCTCGCG
CGGGCCGTCA AGTCACCATG GGAACTGAAC AAGATGCGCC TTGCCGGGGC AAGGCATCAC
GAAAGCCTGC ACGAAGCCCT TCCGCAGCGC ATACGCCCCG GCATGACCGA ACGAGAGGTC
TCGCACCTCG CGTGGCAGGT CTTCTTCGAG CGCGGGCATT CGGGCATGAT GCGCATGTCC
GCCAATGGTG AGGAGATATT CCTCGGCCAT GTGGCCGCCG GAGAGAACGG GAACTACCCC
AGCCACTTCA ACGGCCCGCT GGGGCTCAAA GGCGAACATC CCGCCGTCCC CTACATGGGC
TATGCGGGGT CTGTATGGCG CAGGGGAACG CCGTTGGCCG TGGACATCGG CTTCACCCTT
GAAGGCTATC ATACCGACAA GACGCAAGTG TACTGGGCGG GTCCGCGTGC CTCCATCCCT
GACGCCGTGC TGCGCGCCCA CGAGACGTGC ATGGAAGTGC AGGCCCGCGC AGCCGCAGCC
CTGCGCCCCG GCGCCATCCC CTCCGCCATC TATCAGGACG CCCTGCAACT CGTCGGGGAG
TACGGACTCT CTGAAGGATT CATGGGAATT GGAAGCAACA AGGTACCGTT CCTCGGGCAC
GGCATCGGCC TTGCCGTGGA TGAACACCCG GTACTGGCCC GGCGTTTCGA TGCGCCTCTC
CAGACCGGCA TGGTCATCGC CATCGAACCC AAGATGGGCA TCCCCGGAGT GGGGATGGTG
GGAGTGGAGA ACACCTTTGA AGTGACGGAA GACGGTGGCC GCTGCCTCAC GGGTGACGAG
TACGACATCG TCTGCATCGA ATGA
 
Protein sequence
MFTAAERIPD DEVRRRHSRC RAALADVAPE ASGLLVFARL SIYYLTGSLG NGVLWLPREG 
EAMLFVRKGI ERVLLESPIE LVHPFRSYGD IVELAREAGS PLGGVVAAEM GGLPWSLANL
LQQRLQGVSF VPGDMAVTLA RAVKSPWELN KMRLAGARHH ESLHEALPQR IRPGMTEREV
SHLAWQVFFE RGHSGMMRMS ANGEEIFLGH VAAGENGNYP SHFNGPLGLK GEHPAVPYMG
YAGSVWRRGT PLAVDIGFTL EGYHTDKTQV YWAGPRASIP DAVLRAHETC MEVQARAAAA
LRPGAIPSAI YQDALQLVGE YGLSEGFMGI GSNKVPFLGH GIGLAVDEHP VLARRFDAPL
QTGMVIAIEP KMGIPGVGMV GVENTFEVTE DGGRCLTGDE YDIVCIE