Gene Dvul_1798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1798 
Symbol 
ID4662671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2099830 
End bp2103267 
Gene Length3438 bp 
Protein Length1145 aa 
Translation table11 
GC content61% 
IMG OID639820039 
Producthypothetical protein 
Protein accessionYP_967242 
Protein GI120602842 
COG category 
COG ID 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.314403 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCCA AGGGCTTCAC ACTCATAGAG ACCGTGGCGG TACTGCTCAT CGTCGGAGTC 
ATCGCTGCCG TGGCAGGTGT GGGCATCGTC AGCGGCGTGC GCGGCTTCGT GCAGGCACGC
GAAGCCGGTG CGATGGCCAT GGAGGCACAA CTTGCGCTCG ACCGCATCAC CCGCGAAGTG
ATTGAATTGG TCGCCGTCCC CGCAGACAGC AGCGCCACCC GTCTTGTGGT GCGTAACGTG
GGGGCGGGCG GCGCGGGGCA ATTCGACAGG TCGATAGAAT ATGTTGCCAA CGCCCGCGAG
ATTCGCATCG CCAACGGCGC ACAGGGCGCA CAGAATGGTG ACACGCTCAT CGACAACGTC
ACGGCCTTCA GCCTCAACTA TTGGCAGGAA GATGTCTCCA CGGCCACATG GTCGGCTGGT
ACAGACCCGC GACTCCTCTC CGGCGTGGAC GTGACCTTCA CCCTCACCGC GCCCGGCGGC
ACCACACGCA CCTTCAGCAA CCGCATCGTA CCGCGCAACA ACGAGAATCG CGGCGGCGCA
CTGCCAGAAG CCGTGCCGCC TTCTCTGGAA CGCTTCGACA TCTGCTTCGT GGCTACCGCT
GCCAGCGGTG ACACAAGCCA CCCGGTGGTG GTGCAGCTAC GAGAATTCCG TGACAGGCTT
CTGCTCACAT GGGGCGGGGG GCGCATGCTG GTACGCGCCT ATTACACCGT GGGGCCGCGA
CTCGCCGACA TGCTGCAGGG GCATGATACG GTACGGGCCG CTGTCATGGG CATACTGACG
CCCCTTGCCG CGCTGTGCGG CATGGCCGTG CACGCCCCGA TAGCCCTTGG TACCCTGTTC
ATGCTGTCTC TCGTGCTTGG GCACATGCTG GCGGCGGCAT TGCACGGCGG CACCCTGCAC
AGAGAGACCA CGCCGTTACA GACGGCTGCC GCCGACAACA ACAGCCCCAC CACCGCAAGG
CATCTCTCCG GCAACGACAC CGGCAAGTCT GACGGCACCT GTATTGCGCG CCACGACCCG
ACCGCCGCTG ACCATGGTCG CAACGGCGGG CGTACGCGCG GTGCTGTGCT CATCAGCGTC
ATCGTCACCA TCGTGGCCTT TTCGGTCATC GGCGCAGCCA TGGTGCCCAT GATGACGGCT
TCGACGCAGA ACAGCTATTT CGCCGTGCAG GGCGACCAGG CCTATTATCT CGCGGAGTCG
GGCTTCGGCG CTGCGGGCAG CATGTTCCTT GCCGCCGGAG ACGAACAGGC CCGCAAGAAC
CTTCTCCAGA CCATGGACGG TTCGACCTAT ACCTTCGCCA ACAATGCAGG GGCATTCCGG
CTTGGGATAG AGCCCTACTG GTTCGAGGCG ACCAGCAACA CCGGAACGCG GCTGGTCACG
CGCGTATACG GCACCGCCCC CACACTGCCC ACCAACACCA CGGGGCGGCT ACGCATCGGC
AACGCCGCAG TCTTCTACAC CTACAGCAAC ATTCAGGCAT CGGGCAACAC GGTCACCTTC
ACCCTGACGC AGACGCCTTC CCCCCCTATC GATACGGGTG CGGACATCTT CCTGAGTGCC
ACGCCCACAA GCACATCCGT GGGCAATAAC GGAGACCTGA CGCTCAACTC GGCACATGCC
GCAGCCTTTC CCAACGTGAA CGGCATGTTC ACCGTCCGCG GCGGGCCCGT CGCCAACAAT
GGACGCATCG CCTACGTCTA CCGGCGCAAG AACGGAAACA CCCTTGAAGA CGTGCGCCTC
GTCGAAGGTC AGGGCGTGAC GTGGTCCGAC ATCTCGCTGT CTTCCGCCTC TGACGTGACC
CTTGAATCGT ACCTGCGGTT GCACTCCACG GGCATCCCCG CAGGGGGCAT GGCGCGCGAG
GTCATCTACA ACGTGCCCAT CGGCTGGATT CTTGGGGGGG GCAGTTTCAG TAAGGAACAG
TATCAGGACA ATTTCGCAAG CCTCGCCAAC TGGTTCACGG GCGATTCCAG TCAAGGCCAT
CTGGGGACGC ACGCCCTGTC CGGGGGGGCG CTACGCGTTA CCGGGATGCA GACGGCTGCC
ACCTCCGGCT TTGGCGGTTT TCTGTCATGG CTTTCGGGTG ATAACCAATG GAGTTCGCTG
TTCTTCAACT GGGGCAGGAC GAACGTCAAC CTCGCACGGG GATGGGCCGA CACTTACGGC
AACTCAAGTT ACGACCTCCA GTTCAAGGTC AATGTCAATC AGGATTCGGG CAAGGCTTTC
TTCGGTGGCC TCCTCTTCAG AGGACGTAAC AGCGGTTCAG ACGACCTTGA CGGCTACGGG
ATATCGTTCG TCCGTTTTCG ACAGAGGCGT ACATGGCTCA ATGACAGATG GTACTGGCCC
AACGACGTTC CCAGTTCACT CGTACCGGGC TACAACGCCA CGCCCAACCC CGATGTGGGC
GGCCCCCTTT TCGGCGATAA TGAAGACCTG AACCAGATTG TCGATGAGGG ATGGTGGATT
TTCTATGAAG AGAGCCGCTA TTCGTGGCCT GCCATCATGC TCTGGGAACG CAGGAATGGC
CAATTTCGCT GGCTTGCCTA CAAGAAGCTT GGAAGTTCAT CGGGTATTGT CACCTATAGT
AAGAATGGCC CCACATACAG ACTCGACGCG TGGCCGACCC TGCTGGTGCG CCTTGTCGAA
GGACAGGAAT TGCAGTTCAC CAACGGCGGC GGGTCTGACG GGGCAGGCGG TATACTGCGT
ATCAACTACG GCGACGAAAT CATCACCCAG ACAGGCGCCA AGGCACGGGT CATCGGTCAG
CCCATCGTCG AAAGCGGCGA CTGGACATCC GGCACGGCTT CAGGGCGGCT AGTGCTGACC
AACGTTGATA CAGGGGCGAC GGGGAACTTC AGCAACGGCC AGACCCTGAA AGTCAACAAC
GTGAGTCACG CGACCATAGG CACCGGCGGA CTGGGGGCGA AGACCAACTT CATTCGCGTC
TACTACTCAG ACGCCGAACG CAACGGCACG GGAGACGCGA ACCCCTGCAC TCCAGAAGCC
CCCGGCGGCA ACGGAGGCTT CTCGTCCGCC GACAGGCGCT CTAACCCGCG CCTTGGCGAC
AATGACAAAC TGCGCTGGAT ACCCGACGAC TACGAGCAGT GGAAAGCTGC CACCGACTAT
TTCTCTCTGG TAGAATGGGA TGTCGTGAAC ACGTCAGGTG GGACGAACTC CGGGAACGCG
ACGCTGGTGG GTTCCGGGGT GGCGGGCGTT TCAGCCCTTA ACGAGTACAT CAACGGTTCG
CAACGCACAC GAACGATTGT CCGGAGTACC AACCTGCTTT CACCGACCTA TGACCCTGAC
AACCCAGTCT ACAGCCCGTC AGAAGGTATC TCCATCGTCA CCAGCGGACC GACGGGAACC
AACTTCTATT TCGACGACTT CGGGCTGCAA CTCGACCTGC GTGGCGGCAA GGGATTTCTG
CCCCCCATAC AGCAGTGA
 
Protein sequence
MKAKGFTLIE TVAVLLIVGV IAAVAGVGIV SGVRGFVQAR EAGAMAMEAQ LALDRITREV 
IELVAVPADS SATRLVVRNV GAGGAGQFDR SIEYVANARE IRIANGAQGA QNGDTLIDNV
TAFSLNYWQE DVSTATWSAG TDPRLLSGVD VTFTLTAPGG TTRTFSNRIV PRNNENRGGA
LPEAVPPSLE RFDICFVATA ASGDTSHPVV VQLREFRDRL LLTWGGGRML VRAYYTVGPR
LADMLQGHDT VRAAVMGILT PLAALCGMAV HAPIALGTLF MLSLVLGHML AAALHGGTLH
RETTPLQTAA ADNNSPTTAR HLSGNDTGKS DGTCIARHDP TAADHGRNGG RTRGAVLISV
IVTIVAFSVI GAAMVPMMTA STQNSYFAVQ GDQAYYLAES GFGAAGSMFL AAGDEQARKN
LLQTMDGSTY TFANNAGAFR LGIEPYWFEA TSNTGTRLVT RVYGTAPTLP TNTTGRLRIG
NAAVFYTYSN IQASGNTVTF TLTQTPSPPI DTGADIFLSA TPTSTSVGNN GDLTLNSAHA
AAFPNVNGMF TVRGGPVANN GRIAYVYRRK NGNTLEDVRL VEGQGVTWSD ISLSSASDVT
LESYLRLHST GIPAGGMARE VIYNVPIGWI LGGGSFSKEQ YQDNFASLAN WFTGDSSQGH
LGTHALSGGA LRVTGMQTAA TSGFGGFLSW LSGDNQWSSL FFNWGRTNVN LARGWADTYG
NSSYDLQFKV NVNQDSGKAF FGGLLFRGRN SGSDDLDGYG ISFVRFRQRR TWLNDRWYWP
NDVPSSLVPG YNATPNPDVG GPLFGDNEDL NQIVDEGWWI FYEESRYSWP AIMLWERRNG
QFRWLAYKKL GSSSGIVTYS KNGPTYRLDA WPTLLVRLVE GQELQFTNGG GSDGAGGILR
INYGDEIITQ TGAKARVIGQ PIVESGDWTS GTASGRLVLT NVDTGATGNF SNGQTLKVNN
VSHATIGTGG LGAKTNFIRV YYSDAERNGT GDANPCTPEA PGGNGGFSSA DRRSNPRLGD
NDKLRWIPDD YEQWKAATDY FSLVEWDVVN TSGGTNSGNA TLVGSGVAGV SALNEYINGS
QRTRTIVRST NLLSPTYDPD NPVYSPSEGI SIVTSGPTGT NFYFDDFGLQ LDLRGGKGFL
PPIQQ