Gene Dvul_1787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1787 
Symbol 
ID4662515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2085687 
End bp2087657 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content62% 
IMG OID639820027 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_967231 
Protein GI120602831 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.124146 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAACCAGT TTTCGCGGAA CCTGGTGCTC TGGGCGACCA TATCGCTGTT GATGGTCGTT 
CTGTTCAACC TGTTCAATCA ACCACAGGGC ACGCAGCAGA GGGTCACCTA TAGCGAGTTC
CTGCAAAGGG TCGAGAAGGG CGAGGTCGTC GAGGTCACCA TCCAAGGGCA GAAGCTCTCT
GGCAAGACCA CCGAAGGCAA ACCCTTCCAG ACATTTGCGC CCGAAGATCC CTCGCTGGTA
AGCCGCCTCC TCGACAAGAA GATCGAAGTC AAGGCAGAAC CGCAGGAAGA GGCCGCGTGG
TACATGACGC TGCTCGTTTC GTGGTTCCCC ATGCTTCTTC TTATTGGTGT GTGGATTTTC
TTCATGCGTC AGATGCAGGG CGGCGGTGGC AAGGCCATGT CGTTCGGGCG TTCTCGTGCG
CGCATGATCA CGCAGGAGTC GGCACGCGTC ACTTTCGAGG ACGTTGCCGG TGTCGATGAG
GCAAAAGAGG AACTCAGCGA GGTCGTCGAG TTCCTGTCCA ACCCCCGCAA GTTCACCCGC
CTCGGCGGGC GCATCCCCAA GGGTGTGCTG CTTGTCGGTC CTCCCGGTAC GGGCAAGACG
CTGCTGGCGC GGGCTGTTGC CGGTGAGGCA GGGGTGCCCT TCTTCTCCAT CTCCGGTTCG
GACTTCGTGG AGATGTTCGT GGGCGTGGGC GCCTCGCGCG TGCGCGACCT GTTCATGCAG
GGCAAGAAGA ACGCGCCGTG CCTCATCTTC ATCGATGAAA TCGACGCAGT CGGTCGTCAG
CGTGGTGCTG GCCTCGGCGG CGGGCATGAC GAACGTGAGC AGACTCTCAA CCAGCTGCTG
GTGGAGATGG ACGGCTTCGA ATCCAACGAG GGCGTCATAC TCATCGCAGC CACCAACCGC
CCCGACGTCC TCGACCCGGC GTTGCTGCGC CCCGGTCGTT TCGACAGGCA GGTGGTCGTG
CCCACACCGG ACGTGCGCGG TCGCAAGCGC ATCCTTGAAG TGCACGGCAG GCGCACCCCG
CTGTCCAGCG GGGTCAACCT CGAGATCATC GCCAAGGGCA CCCCGGGCTT TTCCGGTGCC
GACCTTGAGA ATCTCGTCAA CGAAGCCGCG TTGCAGGCTG CAAAGCTCAA CAAGGACGTC
GTGGACATGG GGGATTTCGA GTACGCCAAG GACAAGGTCC TGATGGGCAA AGAGCGTCGC
AGCCTCATCC TGAGCGATGA AGAGAAGCGC ATCACGGCCT ACCATGAAGC CGGTCACGCC
CTTGCTGCCA AGCTCATTCC CGGTTCAGAC CCCATCCACA AGGTCACCAT CATCCCCCGC
GGCAGGGCTC TTGGCGTTAC CATGCAGTTG CCGGAAGGCG ACAGGCACGG CTATTCGCGC
AACTACCTGC TGGGGAACCT CGTGGTACTG CTTGGCGGAC GTGTCGCGGA GGAGATCATC
TTCTCAGACG TGACCACAGG TGCGGGCAAC GATATCGATC GGGCCACCAA GATGGCTCGC
AAGATGGTCT GCGAATGGGG CATGAGCGAG GCCATCGGCC CGCTGGCCAT CGGTGAACAG
GGTGAAGAGG TGTTCATCGG GCGTGAATGG GCCCACTCGC GCAATTTCAG CGAGGAGACG
GCACGCCTGG TCGACGCCGA GGTCAAGCGC ATCATCGAAG AGGCTCGCCA GCGTTGCCAC
ACCCTGCTTG AAGAGAACCT GACTGCCCTG CACGACATCG CCAATGCCTT GCTGGAACGC
GAGACCATCA GTGGTGATGA CATCGACATC CTCATGCGTG GCGAGAAGCT GCCTCCCGAA
AGGGGAAACG GCGGGGCCCC GGCAAACTCC GGGACGACAC CGCCAGCAGG GAACGCCACG
GACGGGGCAA AGGCCGCGCA GTCGCCGCAG GCTTCGGAAG ACGCCGCTAC CCCGGCCGAG
GAGTTCACCC TCGAAGCTGA AGAACCCGAA AAGAGGGACG ACCGCGCATG A
 
Protein sequence
MNQFSRNLVL WATISLLMVV LFNLFNQPQG TQQRVTYSEF LQRVEKGEVV EVTIQGQKLS 
GKTTEGKPFQ TFAPEDPSLV SRLLDKKIEV KAEPQEEAAW YMTLLVSWFP MLLLIGVWIF
FMRQMQGGGG KAMSFGRSRA RMITQESARV TFEDVAGVDE AKEELSEVVE FLSNPRKFTR
LGGRIPKGVL LVGPPGTGKT LLARAVAGEA GVPFFSISGS DFVEMFVGVG ASRVRDLFMQ
GKKNAPCLIF IDEIDAVGRQ RGAGLGGGHD EREQTLNQLL VEMDGFESNE GVILIAATNR
PDVLDPALLR PGRFDRQVVV PTPDVRGRKR ILEVHGRRTP LSSGVNLEII AKGTPGFSGA
DLENLVNEAA LQAAKLNKDV VDMGDFEYAK DKVLMGKERR SLILSDEEKR ITAYHEAGHA
LAAKLIPGSD PIHKVTIIPR GRALGVTMQL PEGDRHGYSR NYLLGNLVVL LGGRVAEEII
FSDVTTGAGN DIDRATKMAR KMVCEWGMSE AIGPLAIGEQ GEEVFIGREW AHSRNFSEET
ARLVDAEVKR IIEEARQRCH TLLEENLTAL HDIANALLER ETISGDDIDI LMRGEKLPPE
RGNGGAPANS GTTPPAGNAT DGAKAAQSPQ ASEDAATPAE EFTLEAEEPE KRDDRA