Gene Dvul_0798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0798 
Symbol 
ID4664117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp981404 
End bp982624 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content67% 
IMG OID639819019 
Producthypothetical protein 
Protein accessionYP_966246 
Protein GI120601846 
COG category[V] Defense mechanisms 
COG ID[COG0577] ABC-type antimicrobial peptide transport system, permease component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGGTC TGGGCATCAT CCTCAAGTCG CTGGCGGCGC ACCGTACCCG CGCGTTGCTG 
GCCATGCTGG GCGTCTTTCT GGGGGCGCTG GCCCTCACCG CCGTGATGCA CGTGGCCGGG
GCCATGGTGC TGAAGGCTGA CCTCGAGACG CAGAAGCTGG GGCCCAATCT GCTTCAGGCC
CTTTCTGGGC AGGTGCGTTT CCGGCGTGAC GGCCCTTCGG GTGTTAGCGG CGTGAACCGT
ACATTCACCC TTCAGGACGC CGAGGCCATT CTCGGGGGCG TGGCGCAGGT GCGCGACGGC
GTACCCTACT GCAACGCCCC CATGCCGGTG CGCTACGGCA GCATCAAGAC CACGTCGCAA
CTGGTGGCGA CCCTGCCCGG ATTCGCTCGC GTCAGGGCCT ACAGCCCCGC ATACGGGCGA
TTCATCTCCG ACGAGGACGA AGCAGGCCGG GCGCTGGTGT GCGTGCTGGG CACGGCCATC
GCCACTCGCC TGTTCGAAAG ACCCGACAAT GCCGTGGGGC GCACCGTGTT CTTCTTCCGC
GCCCCGGTGC AGGTCGTGGG CGTCATGGAG GAGAAGGGGC AGGACGTGTC GGGCACGAAC
CTTGACGAAC AGGTCTATGT CCCTCTTTCC ACGTACATGC GGCGCATGGC CAATCAGGAC
TGGATAAGCG GCGTCTACAT GAACCTGCAT GACGGGGCCG ACGAGGAGGC CGCAAGGGCG
GCGGTGACAG CCATCCTGCG TTCGCGGCAT CTCATCGCCG CAGGGCAGAA GGACGACTTC
TCCGTCCTTT CCGCACGCGA CGCCAACAAG CTGCGCAAGG AGGCCCTCGA CCTCGTGCAG
ACGCTCGGGG TGCTCAGTTC ATCCATCTCG TTCGCGGTGG GCAGCCTCGG CATCCTGTCC
ATCATGACCC TGCTGGTGCG TGCACGCAGG CTGGAGATAG GTGTGCGCCG TGCCGTGGGG
GCCAGCCGCA ACGTCATCGT GCGGCAATTC CTCGCAGAGG CGGGACTCAT GGCAGGGGTG
GGCGGCACGC TTGGTGTCGT CGTGGCCCTT GCACTGGTGA CCGTGGTCTA TGCCGTGGGC
GACTTCCCCT ATACCTACGA CCCGCTGCTG GCCGCCGGGG CCTGCATCGC CTCGGTGGTG
CTGGGCGTCG CGGCCGGGGC CTATCCCGCA TGGCAGGCGT CACGGGTCGA CGTCCTCGAC
GTATTGCGGC ACCCCGAATA G
 
Protein sequence
MRGLGIILKS LAAHRTRALL AMLGVFLGAL ALTAVMHVAG AMVLKADLET QKLGPNLLQA 
LSGQVRFRRD GPSGVSGVNR TFTLQDAEAI LGGVAQVRDG VPYCNAPMPV RYGSIKTTSQ
LVATLPGFAR VRAYSPAYGR FISDEDEAGR ALVCVLGTAI ATRLFERPDN AVGRTVFFFR
APVQVVGVME EKGQDVSGTN LDEQVYVPLS TYMRRMANQD WISGVYMNLH DGADEEAARA
AVTAILRSRH LIAAGQKDDF SVLSARDANK LRKEALDLVQ TLGVLSSSIS FAVGSLGILS
IMTLLVRARR LEIGVRRAVG ASRNVIVRQF LAEAGLMAGV GGTLGVVVAL ALVTVVYAVG
DFPYTYDPLL AAGACIASVV LGVAAGAYPA WQASRVDVLD VLRHPE