Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_3075 |
Symbol | |
ID | 4661952 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008741 |
Strand | + |
Start bp | 163499 |
End bp | 166468 |
Gene Length | 2970 bp |
Protein Length | 989 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639813995 |
Product | hypothetical protein |
Protein accession | YP_961274 |
Protein GI | 120586929 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.697159 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGCCA TCGCCGACAA GCGCCGTACC ATCCGCATCA AGCGTTCCTC GCTCCTGCAG AACAAGCTGG GGGCGCTGGA CAGGCCCGCC ATCAAGATCG CGGAGACGCC CGACGAGTAT ACACGGGCCT TCCGCCTCGT GTACGAAGAG TACATCCGCT CGGGCTATCT CAAGCCGCAT CCATCAAGAA TGTATTACAA TGTCTGGAGC ATTCTGCCCG CCACCTCGGT CTTCATCTTC AAAAGCTACC ACGACGTGCT GTGCACGCTG ACCCACATCC CCGACTCCGG GCTGTTCGGC CTGCCCATGG ACACCCTCTA CAAGCCCGAG GTGGACGCCC TGCGCGCCCA GGGGCGCAAC GTGGTGGAGG TGGGTGCGCT TGCCACGCAG TACAGCCGCC GGTGGACGAA CCTCATGGTC TTTCTCGCCA AGGCCATGTT CCAGTACTCC ATGATGTCGG AGGTGGACGA CATCCTCGTC ACCGTGAACC CGAAGCATGT GAAGTTCTAC ACCGACATCT TCCTGTTCAA GCCCTTCGGC GAGGTGCGCC ACTACGACAC GGTGGACGCC CCGGCGGTGG CCCTGCGCAT CGACCTGCAC GAGGCCATGG ACGAGTTGCA GGAGAAGTAC GGCGACGAGG AGGACTTCGA GACCAACCTC TTCGCCTTCT TCGCCCGCAT GAACAGCGAC GAGACCGAGA CCTCCGAGAA CCCCGTGCGC CGCGACAGGC CACTGGAACC GTACACCGCC TATTATCTGT TGCAGCAGCG CCCCGAACTG CTCGACCATC TGGCCGAACC CCAGCGCGAC CACATCGAGA CCATCTACCA CCGCGCGGTG TTCAACCACT TCTCGACCAA CCCGGTGCAC CCCGAGACCC CCGGCGGCGT ACCGCTGGAC ATGCTCAAAC TCGAGACGCG CGACGCCTAT ACCGACATCG CCTTCAGCCG CAACCTCGGC CTCGTGGACT ACGCGGGACA ACGCAGACTC TTGCGCTCGC GGGTCGCCAT CGCGGGGCTT GGCGGCGTGG GTGGCATCCA TCTGATGACG CTGGCGCGCA CGGGCATCGG CAACTTCAAC CTCGCCGACT TCGACGCCTA TTCGCCGGTG AACCTCAACC GGCAGTACGG GGCCAGCATC GCCAGCTTCG GGCGCGGCAA GCTCGACGTC ATGACCGAAC GCGCCCTCAG CGTGAACCCG TTCCTCGACA TCCGCTCCTT CCCCGAGGGC GTCACCGCCG AGACCATCGA CGCCTTCCTC AAGGATGTCG ACCTGCTGGT GGACGGCATC GACTTCTTCG CCCTCGACAT CCGCCGCAGA CTCTTCAACC GCGCGCTCGA ACTGGGCATT CCGGTCATAA CCGCCGGGCC GCTCGGCTAT TCGTGCGCCC TGCTCGTCTT CATGCCGGGG GGCATGAACT TCGACAGCTA CTTCGGCATC GACGACGACA CGCCCCCCAT GGAGGGCTAC CTGCGCTTCG GCATGGGGCT GGCCCCGCGC CCGGCGCACC TCGGCTACAT GGACAGGCGC TTCGTCAGCC TGCACGACAG GCGGGGGCCA TCGCTGGACA TCGCCTGCCA TCTCTGTGCG GGCATGGCAG CCACCGAAGC CGTGCGCATC CTGCTGCACC GCCGGGGCAT ACGCCCCGTG CCGTACTTCA GGCAGTTCGA CCCGCTCACG GGCCGTCATG TGCGGGGCAG GCTCCGCAAG GGACTGCGCT CGCCGCTGCA ACGGCTGAAG CTCGCCATCG CCCGCCGGTT CTTCATGGGG TCGCCGCGCA CCGGCACGCC GCCCCCGCCA GAACCCGACG TGGTGGCACT GCGGCAGGAC ATCCCGCTCG AGACCCTGCA CTACGTGGCC CGCGCCGCCA TGCAGGCACC CTCCGGCGAC AACGTGCAGC CGTGGCGCTT CGTCCCCCAT GCCACGGGGC TGCACATCCA TCTCGACAGG CAGGCCGACG GCTCGTTCTT CGACTACCGC CATGTGGCCT CGCTGCTGGC GTGCGGGGCC GCGGTGCAGA ACGCCGTCTA CGCCTCGGGC AGCGCGGGAC TCGATGCCAC GCTCTCGCTC TTTCCCGACG CCACGAAGCC CGACCGTGTG GCCTCGCTGC ACTGCACGCC GCTGGGCGTG CCCTCGCACG AGATCATGAC CGCCGCCCTG TGGCGCAGGC ACACCAACCG CCGCATGTAC GCCGCGCGCC CCCTGCCCGT GGAGGTGTGC GAGCGCATCG AGCACCTCGT CAGCGAGGAA CAGGACGCCA CCCTCCTGTG GGCGACGGAC GCGGCGCAGC GCAAGACCCT TGCGCGGGCC GTCTACCTCG CCGACAGGGT GCGCGTCGAA CGGCGCGACA TGCATGAACA CCTCATGCGC TTCATCCGCT TCGACGGGCA GGCCGGGCAA CAGGGAGAGG TCGGGCAACA GGGGCACACC GGGCAGGCAG GGCATCACAG CCACGCCGGG AACGGGCCAT ACGGCGACGG ACTGCCGCTG GCGAACCTCG AGGCGGGACT CGCCGGTGAA CTGTACCTGC GTGCCGTGAG GCCGTGGCGC AACATGAGCA TCGCCAACGC CCTCGGCCTC GGGCGCCTCA TGCCCCTGCA CGGGGCGATG GGCGTGCTGC GAAGCGGCGG CATCGGGCTG CTGCTGGCCA ACGGCGAGAC GGAGACGGAC ATCGCCCGCG CGGGCATGGC GTGGCAACGG GCGTGGTGCG CCCTCGAACA CATGGGCTAT GCCATGCAAC CGCTGGCGGC CCTGCCCCTG CTGCACCTGC GCCTCGCCAT GGGCGACCCG CAGACGCTGG ACACCGCACA TCAGCGCCTG CTGGAAGAGG CGTGGACGCT TCTCGAACAG GTGCTGCCCC ACCCCAAGGG ACGACTGCCC GTCATGATGT TCCGTGCCGG TGTCGCCGCA CCCATCCGGC ATGGCACGTT CCGTCGTCCG CTGTCGGACT TCCTGCTGCC TGCCGAGTGA
|
Protein sequence | MLAIADKRRT IRIKRSSLLQ NKLGALDRPA IKIAETPDEY TRAFRLVYEE YIRSGYLKPH PSRMYYNVWS ILPATSVFIF KSYHDVLCTL THIPDSGLFG LPMDTLYKPE VDALRAQGRN VVEVGALATQ YSRRWTNLMV FLAKAMFQYS MMSEVDDILV TVNPKHVKFY TDIFLFKPFG EVRHYDTVDA PAVALRIDLH EAMDELQEKY GDEEDFETNL FAFFARMNSD ETETSENPVR RDRPLEPYTA YYLLQQRPEL LDHLAEPQRD HIETIYHRAV FNHFSTNPVH PETPGGVPLD MLKLETRDAY TDIAFSRNLG LVDYAGQRRL LRSRVAIAGL GGVGGIHLMT LARTGIGNFN LADFDAYSPV NLNRQYGASI ASFGRGKLDV MTERALSVNP FLDIRSFPEG VTAETIDAFL KDVDLLVDGI DFFALDIRRR LFNRALELGI PVITAGPLGY SCALLVFMPG GMNFDSYFGI DDDTPPMEGY LRFGMGLAPR PAHLGYMDRR FVSLHDRRGP SLDIACHLCA GMAATEAVRI LLHRRGIRPV PYFRQFDPLT GRHVRGRLRK GLRSPLQRLK LAIARRFFMG SPRTGTPPPP EPDVVALRQD IPLETLHYVA RAAMQAPSGD NVQPWRFVPH ATGLHIHLDR QADGSFFDYR HVASLLACGA AVQNAVYASG SAGLDATLSL FPDATKPDRV ASLHCTPLGV PSHEIMTAAL WRRHTNRRMY AARPLPVEVC ERIEHLVSEE QDATLLWATD AAQRKTLARA VYLADRVRVE RRDMHEHLMR FIRFDGQAGQ QGEVGQQGHT GQAGHHSHAG NGPYGDGLPL ANLEAGLAGE LYLRAVRPWR NMSIANALGL GRLMPLHGAM GVLRSGGIGL LLANGETETD IARAGMAWQR AWCALEHMGY AMQPLAALPL LHLRLAMGDP QTLDTAHQRL LEEAWTLLEQ VLPHPKGRLP VMMFRAGVAA PIRHGTFRRP LSDFLLPAE
|
| |