Gene Dvul_2684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2684 
Symbol 
ID4662842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp3125657 
End bp3126727 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content66% 
IMG OID639820931 
Productpeptidase M24 
Protein accessionYP_968123 
Protein GI120603723 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.045343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0582018 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAACA TCCGCTTCGA GGCCCGGCGC GAAAAGCTGC GGGCCGCCAT GCGCGAACGC 
GGGCTGGCGG CCCTGTTCGT CAGCCACGAC GCCAACCGCT ACTACCTTTC CGGTTTCGAA
CTGCACGACC CGCAGACCAA CGAGAGCGCG GGCTATGTGC TCGTCACCGC CGACGGGCGC
GACTGGATAT GCACCGACTC GCGGTATCTC GATGCCGCAC GGCGCATCTG GGACAACGAG
CGCATCTTCA TCTACGGTGC CGATGCCCCC GCGCAGATGA ACACCCTCAT CCGCGACCAT
GTACGCGGGA CGGTGGGCTT CGAAGCCCGT TCGGTGAGCC TCGAATTCTT CGAGAAGCTC
TCGCCCGGCC TAGCCATGGA GCGCGTCGAC GGTCTCGTGG AAGCGCAGCG CATCATCAAG
GAACCTGAAG AGATCGAGGT GATGGAGCGT TCATGCGCTC TCAACCATCG ACTCATGGAG
TGGGTGCCCT CCATCCTGCG GCCCGGTCGC ACCGAGGCCG AAGTGGCGTG GGACATCGAA
TCGTTCTTCC GTTCCAACGG CGCGTCGGAA CTCTCGTTCG CCAGCATCGT GGCGGTGGGC
CCCAACGGCG CGCTGCCGCA CCACCGTGGC GGGCGCGACG TCATCACCGA CAACTGTTCG
GTGCTGGTGG ATGTGGGCGC ACGTCTCGAC GAATACTGTT CCGACCAGAC CCGCACCTTC
TGGGTGGGTG ACAAGCCCGC CGACCATTTC GTGCGCGCAC TGGAACAGAC GCAGACGGCG
CAGGCCAAGG CCATCGCCGC CATGCACCCC GGCATGCGCG CCTGCGACGC CTACAAGGTG
GCGCGTGACC ACTTCGAGAG CGTCGGCGTG GCGGCGCACT TCACCCACGC ACTGGGGCAC
GGCATCGGGC TCGAGACGCA TGAACCGCCA AGCCTCAACC CCCGCAACGA GATGGTGCTC
AAGCCCGGCA TGGTGGTGAC CGTTGAGCCG GGGCTGTACT ATCCCGAGTG GGGCGGCATC
CGCTGGGAGT ACATGGTGCT GGTGACCGAA GACGGCGTCC GCGCCCTGTA G
 
Protein sequence
MDNIRFEARR EKLRAAMRER GLAALFVSHD ANRYYLSGFE LHDPQTNESA GYVLVTADGR 
DWICTDSRYL DAARRIWDNE RIFIYGADAP AQMNTLIRDH VRGTVGFEAR SVSLEFFEKL
SPGLAMERVD GLVEAQRIIK EPEEIEVMER SCALNHRLME WVPSILRPGR TEAEVAWDIE
SFFRSNGASE LSFASIVAVG PNGALPHHRG GRDVITDNCS VLVDVGARLD EYCSDQTRTF
WVGDKPADHF VRALEQTQTA QAKAIAAMHP GMRACDAYKV ARDHFESVGV AAHFTHALGH
GIGLETHEPP SLNPRNEMVL KPGMVVTVEP GLYYPEWGGI RWEYMVLVTE DGVRAL