Gene Dvul_1109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1109 
Symbol 
ID4662534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1347971 
End bp1349797 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content66% 
IMG OID639819338 
Producttype II secretion system protein E 
Protein accessionYP_966556 
Protein GI120602156 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.280352 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0921606 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACTCG CCGAACGCCT CATGCGCGCG ACACGACGCC AGCAGGAAAC GGACGACGTA 
CCCGCCGCCC CGGTTCATGC GAGACGGGCC ACCGACCGCA TCGCCATGCC CGACACCCTG
CGTGGTGACA CCGGGACACA CGGGCCCGAG AACGCCCCTT CTTCGCACCT CGACGTCACC
ACGGCATCGG CCGGGCAACC GGACACGCAC AGCCAGACCG GCATGCCGTA CGATGTGGCG
GTCGATGGCC TGCACGTCCG GCCTCCGCAA GACGGCCCTG CATCCGGACT TTTCCAGACG
ACCAGCGACC AGCCCCCCAC CATTGCGGAC GGCGAGAACG ACGAGAACCC GATTCTGCCC
CCTGCGGCCA CCACGGCGGC ACCGCGACGC CCTGTGGTCG CGCCGTCGTC ATCAAGGCGC
ACGGACGACG CCACCGTGTC CCCCGCCGCT GCCGAAACAC AGGCAACCGC CCCGGCACAG
GCCGAGCAAC CCCTGCGGGC GGCCCGCAGC ACCCCCCGGG CCGCGGAACA GGTCGTGGAC
GTGAGCAAGC TCACCCTGCA CGGTGACCAC TATTACGAAA TCAAGGAACA TCTGCTCGAC
CGACTGCTTG AGCTTCTCGA CCTCGCCGCC GTGGAGTCGC TTCCCCCCGA ACGACTTGGC
GACGAGATTG GCCGCCTCGT CGAGAGGCTC ATGCGCGACG AGTTCAGGCA AGCCCCCCTC
AACGCCAGTG AACGCCGCCA GATAACCGGT GACATCCGCG ACGAGATCCT CGGCCTCGGC
CCGCTGGAAC CGCTGCTGCA CGACCCCACC GTCAACGACA TCCTCGTCAA CAACTACAGG
CAAATCTACG TCGAACGCCG CGGCAAGCTC ATCAAGGTCA ACACGCGCTT CCAGGACGAC
GACCACCTGC GCAAGATCAT CGACCGCATC GTCTCGCGCA TAGGCCGCCG AGTGGACGAA
TCGTCGCCCA TGGTCGACGC CCGCCTTGCC GACGGGTCGC GCGTCAACGC CATCATCCCG
CCTCTGGCGC TCGACGGACC CAGCCTGTCC ATCCGCCGCT TCTCGAAGGA CCCTCTGGAG
TTGCACGACC TCATCGGCTT CGGCGCCCTG ACGCCGGAGA TGGGCGAAGT GCTGCAGGGC
ATCGTCAAGG CGCGGCTGAA CATCATCGTC TCGGGCGGAA CAGGGTCGGG CAAGACCACC
ATGCTCAACT GCCTTTCGCG TTTCGTGCCG CACGACGAAC GCATCGTGAC CATCGAGGAC
GCCGCAGAAC TCCAGCTCAA ACAGGAGCAT GTGGTGCGCC TTGAGACACG GCCCGCCAAC
ATCGAGGGAC ACGGCGAGGT CACGGCCCGC GACCTTGTGA AGAACTGCCT GCGTATGCGC
CCCGACCGCA TCATCGTCGG CGAAGTCCGT AGCGGTGAAG TGCTCGACAT GCTGCAAGCC
ATGAACACCG GTCACGACGG GTCGCTGACG ACCATCCACG CCAACACCCC GCGAGACTGC
CTGATGCGCC TTGAGACCAT GGTCGCCATG GCAGGGCTGA ACATCAGCAC CCTTTCGCTC
AAACGCTACA TATCCTCCGC CGTGGACGTG ATCATACAGG TCTCACGCCT CTCGGACGGT
TCACGCAAGC TCACCAGCCT GATGGAACTG ACCGGCATGG AAGGCGAGGC CATCACCATG
CAGGAAATCT ACAGCTTCGA GCAGACCGGA GTGGACGACA AGGGCAAGGT TCAGGGGCAC
TTCCGCAGCG GCGGCATCAG GCCGAACTTC GCCCCGCGCC TAGCGGCCAT GGGCATCCAT
CTCGGCGGCA GCCTCTTCGA CGTCTGA
 
Protein sequence
MRLAERLMRA TRRQQETDDV PAAPVHARRA TDRIAMPDTL RGDTGTHGPE NAPSSHLDVT 
TASAGQPDTH SQTGMPYDVA VDGLHVRPPQ DGPASGLFQT TSDQPPTIAD GENDENPILP
PAATTAAPRR PVVAPSSSRR TDDATVSPAA AETQATAPAQ AEQPLRAARS TPRAAEQVVD
VSKLTLHGDH YYEIKEHLLD RLLELLDLAA VESLPPERLG DEIGRLVERL MRDEFRQAPL
NASERRQITG DIRDEILGLG PLEPLLHDPT VNDILVNNYR QIYVERRGKL IKVNTRFQDD
DHLRKIIDRI VSRIGRRVDE SSPMVDARLA DGSRVNAIIP PLALDGPSLS IRRFSKDPLE
LHDLIGFGAL TPEMGEVLQG IVKARLNIIV SGGTGSGKTT MLNCLSRFVP HDERIVTIED
AAELQLKQEH VVRLETRPAN IEGHGEVTAR DLVKNCLRMR PDRIIVGEVR SGEVLDMLQA
MNTGHDGSLT TIHANTPRDC LMRLETMVAM AGLNISTLSL KRYISSAVDV IIQVSRLSDG
SRKLTSLMEL TGMEGEAITM QEIYSFEQTG VDDKGKVQGH FRSGGIRPNF APRLAAMGIH
LGGSLFDV