Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_1798 |
Symbol | |
ID | 4662671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 2099830 |
End bp | 2103267 |
Gene Length | 3438 bp |
Protein Length | 1145 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639820039 |
Product | hypothetical protein |
Protein accession | YP_967242 |
Protein GI | 120602842 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02532] prepilin-type N-terminal cleavage/methylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.314403 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGCCA AGGGCTTCAC ACTCATAGAG ACCGTGGCGG TACTGCTCAT CGTCGGAGTC ATCGCTGCCG TGGCAGGTGT GGGCATCGTC AGCGGCGTGC GCGGCTTCGT GCAGGCACGC GAAGCCGGTG CGATGGCCAT GGAGGCACAA CTTGCGCTCG ACCGCATCAC CCGCGAAGTG ATTGAATTGG TCGCCGTCCC CGCAGACAGC AGCGCCACCC GTCTTGTGGT GCGTAACGTG GGGGCGGGCG GCGCGGGGCA ATTCGACAGG TCGATAGAAT ATGTTGCCAA CGCCCGCGAG ATTCGCATCG CCAACGGCGC ACAGGGCGCA CAGAATGGTG ACACGCTCAT CGACAACGTC ACGGCCTTCA GCCTCAACTA TTGGCAGGAA GATGTCTCCA CGGCCACATG GTCGGCTGGT ACAGACCCGC GACTCCTCTC CGGCGTGGAC GTGACCTTCA CCCTCACCGC GCCCGGCGGC ACCACACGCA CCTTCAGCAA CCGCATCGTA CCGCGCAACA ACGAGAATCG CGGCGGCGCA CTGCCAGAAG CCGTGCCGCC TTCTCTGGAA CGCTTCGACA TCTGCTTCGT GGCTACCGCT GCCAGCGGTG ACACAAGCCA CCCGGTGGTG GTGCAGCTAC GAGAATTCCG TGACAGGCTT CTGCTCACAT GGGGCGGGGG GCGCATGCTG GTACGCGCCT ATTACACCGT GGGGCCGCGA CTCGCCGACA TGCTGCAGGG GCATGATACG GTACGGGCCG CTGTCATGGG CATACTGACG CCCCTTGCCG CGCTGTGCGG CATGGCCGTG CACGCCCCGA TAGCCCTTGG TACCCTGTTC ATGCTGTCTC TCGTGCTTGG GCACATGCTG GCGGCGGCAT TGCACGGCGG CACCCTGCAC AGAGAGACCA CGCCGTTACA GACGGCTGCC GCCGACAACA ACAGCCCCAC CACCGCAAGG CATCTCTCCG GCAACGACAC CGGCAAGTCT GACGGCACCT GTATTGCGCG CCACGACCCG ACCGCCGCTG ACCATGGTCG CAACGGCGGG CGTACGCGCG GTGCTGTGCT CATCAGCGTC ATCGTCACCA TCGTGGCCTT TTCGGTCATC GGCGCAGCCA TGGTGCCCAT GATGACGGCT TCGACGCAGA ACAGCTATTT CGCCGTGCAG GGCGACCAGG CCTATTATCT CGCGGAGTCG GGCTTCGGCG CTGCGGGCAG CATGTTCCTT GCCGCCGGAG ACGAACAGGC CCGCAAGAAC CTTCTCCAGA CCATGGACGG TTCGACCTAT ACCTTCGCCA ACAATGCAGG GGCATTCCGG CTTGGGATAG AGCCCTACTG GTTCGAGGCG ACCAGCAACA CCGGAACGCG GCTGGTCACG CGCGTATACG GCACCGCCCC CACACTGCCC ACCAACACCA CGGGGCGGCT ACGCATCGGC AACGCCGCAG TCTTCTACAC CTACAGCAAC ATTCAGGCAT CGGGCAACAC GGTCACCTTC ACCCTGACGC AGACGCCTTC CCCCCCTATC GATACGGGTG CGGACATCTT CCTGAGTGCC ACGCCCACAA GCACATCCGT GGGCAATAAC GGAGACCTGA CGCTCAACTC GGCACATGCC GCAGCCTTTC CCAACGTGAA CGGCATGTTC ACCGTCCGCG GCGGGCCCGT CGCCAACAAT GGACGCATCG CCTACGTCTA CCGGCGCAAG AACGGAAACA CCCTTGAAGA CGTGCGCCTC GTCGAAGGTC AGGGCGTGAC GTGGTCCGAC ATCTCGCTGT CTTCCGCCTC TGACGTGACC CTTGAATCGT ACCTGCGGTT GCACTCCACG GGCATCCCCG CAGGGGGCAT GGCGCGCGAG GTCATCTACA ACGTGCCCAT CGGCTGGATT CTTGGGGGGG GCAGTTTCAG TAAGGAACAG TATCAGGACA ATTTCGCAAG CCTCGCCAAC TGGTTCACGG GCGATTCCAG TCAAGGCCAT CTGGGGACGC ACGCCCTGTC CGGGGGGGCG CTACGCGTTA CCGGGATGCA GACGGCTGCC ACCTCCGGCT TTGGCGGTTT TCTGTCATGG CTTTCGGGTG ATAACCAATG GAGTTCGCTG TTCTTCAACT GGGGCAGGAC GAACGTCAAC CTCGCACGGG GATGGGCCGA CACTTACGGC AACTCAAGTT ACGACCTCCA GTTCAAGGTC AATGTCAATC AGGATTCGGG CAAGGCTTTC TTCGGTGGCC TCCTCTTCAG AGGACGTAAC AGCGGTTCAG ACGACCTTGA CGGCTACGGG ATATCGTTCG TCCGTTTTCG ACAGAGGCGT ACATGGCTCA ATGACAGATG GTACTGGCCC AACGACGTTC CCAGTTCACT CGTACCGGGC TACAACGCCA CGCCCAACCC CGATGTGGGC GGCCCCCTTT TCGGCGATAA TGAAGACCTG AACCAGATTG TCGATGAGGG ATGGTGGATT TTCTATGAAG AGAGCCGCTA TTCGTGGCCT GCCATCATGC TCTGGGAACG CAGGAATGGC CAATTTCGCT GGCTTGCCTA CAAGAAGCTT GGAAGTTCAT CGGGTATTGT CACCTATAGT AAGAATGGCC CCACATACAG ACTCGACGCG TGGCCGACCC TGCTGGTGCG CCTTGTCGAA GGACAGGAAT TGCAGTTCAC CAACGGCGGC GGGTCTGACG GGGCAGGCGG TATACTGCGT ATCAACTACG GCGACGAAAT CATCACCCAG ACAGGCGCCA AGGCACGGGT CATCGGTCAG CCCATCGTCG AAAGCGGCGA CTGGACATCC GGCACGGCTT CAGGGCGGCT AGTGCTGACC AACGTTGATA CAGGGGCGAC GGGGAACTTC AGCAACGGCC AGACCCTGAA AGTCAACAAC GTGAGTCACG CGACCATAGG CACCGGCGGA CTGGGGGCGA AGACCAACTT CATTCGCGTC TACTACTCAG ACGCCGAACG CAACGGCACG GGAGACGCGA ACCCCTGCAC TCCAGAAGCC CCCGGCGGCA ACGGAGGCTT CTCGTCCGCC GACAGGCGCT CTAACCCGCG CCTTGGCGAC AATGACAAAC TGCGCTGGAT ACCCGACGAC TACGAGCAGT GGAAAGCTGC CACCGACTAT TTCTCTCTGG TAGAATGGGA TGTCGTGAAC ACGTCAGGTG GGACGAACTC CGGGAACGCG ACGCTGGTGG GTTCCGGGGT GGCGGGCGTT TCAGCCCTTA ACGAGTACAT CAACGGTTCG CAACGCACAC GAACGATTGT CCGGAGTACC AACCTGCTTT CACCGACCTA TGACCCTGAC AACCCAGTCT ACAGCCCGTC AGAAGGTATC TCCATCGTCA CCAGCGGACC GACGGGAACC AACTTCTATT TCGACGACTT CGGGCTGCAA CTCGACCTGC GTGGCGGCAA GGGATTTCTG CCCCCCATAC AGCAGTGA
|
Protein sequence | MKAKGFTLIE TVAVLLIVGV IAAVAGVGIV SGVRGFVQAR EAGAMAMEAQ LALDRITREV IELVAVPADS SATRLVVRNV GAGGAGQFDR SIEYVANARE IRIANGAQGA QNGDTLIDNV TAFSLNYWQE DVSTATWSAG TDPRLLSGVD VTFTLTAPGG TTRTFSNRIV PRNNENRGGA LPEAVPPSLE RFDICFVATA ASGDTSHPVV VQLREFRDRL LLTWGGGRML VRAYYTVGPR LADMLQGHDT VRAAVMGILT PLAALCGMAV HAPIALGTLF MLSLVLGHML AAALHGGTLH RETTPLQTAA ADNNSPTTAR HLSGNDTGKS DGTCIARHDP TAADHGRNGG RTRGAVLISV IVTIVAFSVI GAAMVPMMTA STQNSYFAVQ GDQAYYLAES GFGAAGSMFL AAGDEQARKN LLQTMDGSTY TFANNAGAFR LGIEPYWFEA TSNTGTRLVT RVYGTAPTLP TNTTGRLRIG NAAVFYTYSN IQASGNTVTF TLTQTPSPPI DTGADIFLSA TPTSTSVGNN GDLTLNSAHA AAFPNVNGMF TVRGGPVANN GRIAYVYRRK NGNTLEDVRL VEGQGVTWSD ISLSSASDVT LESYLRLHST GIPAGGMARE VIYNVPIGWI LGGGSFSKEQ YQDNFASLAN WFTGDSSQGH LGTHALSGGA LRVTGMQTAA TSGFGGFLSW LSGDNQWSSL FFNWGRTNVN LARGWADTYG NSSYDLQFKV NVNQDSGKAF FGGLLFRGRN SGSDDLDGYG ISFVRFRQRR TWLNDRWYWP NDVPSSLVPG YNATPNPDVG GPLFGDNEDL NQIVDEGWWI FYEESRYSWP AIMLWERRNG QFRWLAYKKL GSSSGIVTYS KNGPTYRLDA WPTLLVRLVE GQELQFTNGG GSDGAGGILR INYGDEIITQ TGAKARVIGQ PIVESGDWTS GTASGRLVLT NVDTGATGNF SNGQTLKVNN VSHATIGTGG LGAKTNFIRV YYSDAERNGT GDANPCTPEA PGGNGGFSSA DRRSNPRLGD NDKLRWIPDD YEQWKAATDY FSLVEWDVVN TSGGTNSGNA TLVGSGVAGV SALNEYINGS QRTRTIVRST NLLSPTYDPD NPVYSPSEGI SIVTSGPTGT NFYFDDFGLQ LDLRGGKGFL PPIQQ
|
| |