Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_1829 |
Symbol | |
ID | 4662314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | - |
Start bp | 2134611 |
End bp | 2137331 |
Gene Length | 2721 bp |
Protein Length | 906 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639820070 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_967273 |
Protein GI | 120602873 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG2844] UTP:GlnB (protein PII) uridylyltransferase |
TIGRFAM ID | [TIGR01693] [Protein-PII] uridylyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.570932 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAGGA AGAACACTCC GGACATCCCG GGAACCGATT GCAAGAACGA CGCCCGGAAG ACACTGCGTG AAGGCCGGTC CGCCCTAGCT GAAGGGCTGG CACAGGCGTC CTCCGATGCT CTTCCGTTCG AACACGCCTA CAGCCGTCTG CTGGATACGT ATTTCAGGCT GCGGCTTGCA GAGTTGGGCA CGGGGCATGA CCGGGTGGCG CTGGTGGCTG TCGGGGGTTA CGGGAGGCAA GAGCTTTGCC CCGCCTCGGA CATCGACATC CTGGTCCTTT GCAACCGTAG TATTCCACCG CAAGCCATCG ACCTTGCGCA AGCACTCTTC CTGCCCCTGT GGGATGAGGG GTTCACCCTC GGGCATGGTT TCCGTACCGT AAGCGATTGC GTGAAGCTGG CTGCCCACGA CCACAAGGTA TTGGCAAGCC TCCTCGATGC CCGGCTTGTG GCCGGGAGTC CTGCGCTGCT GGAAGACATG GTCAGAAGGC TGGACGAGCG CGTGCTGCCA CGGCGCAGCC ACGCGTTTCT CTCATGGCTT GACACCGAAC ATGAACGCAG GCTGGCGGCC CATGGCGATG GGACGGTGCT GCTTGAACCC AATCTGAAAG AGGGGTTGGG GGGCTTGCGT GATTATCACA GGATACTCTG GCTGGCGCGG ATTCGCGGAA AGGCGGGTAC GGTCGAGGAA CTCATGGCGC AGGCCGGTTT CTCTTCCGGA GACGCCACCC TGTTGCGTCA GGCCGTGACT TTTCTGCATG GGGTGCGCAA CAGGCTGCAT CTGCTCTCTG GTCGCAAGAA CGATACCCTG TTCCTCGACC TGCAACCGGG CATAGCGCAG ACCATGGGCT TCCATGACGA CGGCGGGTTG CTGGGAGTGG AGCGTTTTCT CGGCGCGCTG CATCGCTGCA TGTCGGACAT CAAGGGGCTA TCTGATGCCT TCCGGGGGGT CCCGGGTGGC CCTGTGCTCC CGGTCAGTCC GTGCCCGGAA GTCGATGCGG GAGTCGTCAT CGCCGGGGGG GCCGTGCACC TGTGCCTGCC TGATGGTGTC GAGGCGAGTC CCCGCCTGAC CCTCGAGACG TTCGTCCGTG CAGCCGCATC TCCCTCCGGG CCGACGCCCC GACTCGACTG GAACACGAGG CGACGCATGG CGGCGGCCAT CGCAGGCAGG GCATCTGAAC TGTCCGGTGT GGAAGTGGGG CGTGCCCTCA TCGCCATACT CACCTCGGCC AGGGCGTTCG ACGTGCTGGA ACAGATGGAC GCCGTGGGGT TGCTCCCTGC CTTGTTGCCA GCCTACGGCG CTGCCCGTGA CAGGGTGCAG TTCGATGGCT TTCATTCCTA CCCTGCCGGT ATGCATACCC TTTTCACCAT ACGGCAACTG GAGCAGCTCG CGACGAATGG CCCCGAACCC TTTGCGACCC TATGGCGTGA ATGCCCCGAA GGCGCAGGTG GCGGTGCTGA ACCGGGCAGG CTGTCGGTGT TGCTGGCGGC GTTGTTCCAT GACCTCGGCA AGGGGCTCGA AGATGCTTCG CACCATGAGG AGAGCGGTGC AGGCATCGCC CGTGACGTGC TCTCCGCATG GGGACTTCCT GCACCGGTCG TGGACGATGT GGCGTTTCTG GTCAGGGAAC ATCTGCTTCT CATGCGCACC GCGCAACGCC GCGACCTCAA CGACGAGGCT GTCGTGGGAC AGGTGGCGGG CATTGTCGGT GACCGGGCAC GGCTTGAGCG TCTGCTTCTC CTTTCCTACG CGGATGCCAG CGCGACCGGC CCCAAGGCAT GGAACAGGTG GGCCGCCAGC CTCCTTGACG AACTGCGCGG CAAGACCCTC AACATGTTGC GGGATGTCGA GGCGGGGGGC ATTCATACCG CCGGAAGTGT GGGCGATGTG CTCGCACGTG TACGGACGCT TGCGACGCAC CAGCCGACGA CGCCTCCCCT AGGTGAGGAC ATCGCCGCGG CGTTCTTGGC AGCCATGCCG CCGCGTTATG TGGTGGCCAA TGCCCCTGAA CGGATACTGC GGCATATGGA GATGGCCCGC CAGCTCAATC TCGATGTGGA AGAGGCTCGC AAGCGCCTCG AACCGGGCAG GGCCGAACGG GGGCTGGTGG TCATGGAAGG CCGACCTGTA CACGGGGGGC GTGAAAGCGA CCTGTGGGAG GTCACGATTC TGGCGCGCGA CCAGCAGGGG CTGTTCGCTA CGCTCGCTGG AGTGTTCGCC CTGCATGGCC TGAACGTCTA CGCTGCCGAC GCCTTCGTGT GGCGCGACGG GACAGCGCTC GACGTCTTCC ACGTCACGGC TCCCCCCGAC CCCCTGTATG CCCGTGAATT CTGGGGCAAG GTACGCAGCT CGGTGCAGTA CGCCATGACA GGCAAACTCG CCCTCGACTA TCGCCTTGAA GAGGCACGCG CCAGCCGCAT CATCCCCGAT GCCCTGCGCG AGGCGTTGCG GCGTCCGGCT GAAGTGAGGG TGGACAACGG CCTCTCGGAC TTCTATACCG TCATCGACGT CTTCGCACCG GATCGTCCGG CACTGCTCTA TGACGTGGCG CGCACGCTGC AATCCCTGCA TCTCGACGTG CTGTTCGCCA AGGTATCAAC ACTGGGAAAT CGCACTGCAG ATACCTTTTC CGTGCGGACG GCCCAGGGCC AGAAACTCAC GGATGAGGAA CATCTCGCAG AGGTGCGGGC CGCCCTGCTG CACGCGGTGG CTTCCCGATA G
|
Protein sequence | MPRKNTPDIP GTDCKNDARK TLREGRSALA EGLAQASSDA LPFEHAYSRL LDTYFRLRLA ELGTGHDRVA LVAVGGYGRQ ELCPASDIDI LVLCNRSIPP QAIDLAQALF LPLWDEGFTL GHGFRTVSDC VKLAAHDHKV LASLLDARLV AGSPALLEDM VRRLDERVLP RRSHAFLSWL DTEHERRLAA HGDGTVLLEP NLKEGLGGLR DYHRILWLAR IRGKAGTVEE LMAQAGFSSG DATLLRQAVT FLHGVRNRLH LLSGRKNDTL FLDLQPGIAQ TMGFHDDGGL LGVERFLGAL HRCMSDIKGL SDAFRGVPGG PVLPVSPCPE VDAGVVIAGG AVHLCLPDGV EASPRLTLET FVRAAASPSG PTPRLDWNTR RRMAAAIAGR ASELSGVEVG RALIAILTSA RAFDVLEQMD AVGLLPALLP AYGAARDRVQ FDGFHSYPAG MHTLFTIRQL EQLATNGPEP FATLWRECPE GAGGGAEPGR LSVLLAALFH DLGKGLEDAS HHEESGAGIA RDVLSAWGLP APVVDDVAFL VREHLLLMRT AQRRDLNDEA VVGQVAGIVG DRARLERLLL LSYADASATG PKAWNRWAAS LLDELRGKTL NMLRDVEAGG IHTAGSVGDV LARVRTLATH QPTTPPLGED IAAAFLAAMP PRYVVANAPE RILRHMEMAR QLNLDVEEAR KRLEPGRAER GLVVMEGRPV HGGRESDLWE VTILARDQQG LFATLAGVFA LHGLNVYAAD AFVWRDGTAL DVFHVTAPPD PLYAREFWGK VRSSVQYAMT GKLALDYRLE EARASRIIPD ALREALRRPA EVRVDNGLSD FYTVIDVFAP DRPALLYDVA RTLQSLHLDV LFAKVSTLGN RTADTFSVRT AQGQKLTDEE HLAEVRAALL HAVASR
|
| |