Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_0206 |
Symbol | |
ID | 4662172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | - |
Start bp | 251215 |
End bp | 254196 |
Gene Length | 2982 bp |
Protein Length | 993 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639818402 |
Product | phosphoribosylformylglycinamidine synthase |
Protein accession | YP_965657 |
Protein GI | 120601257 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0046] Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain |
TIGRFAM ID | [TIGR01736] phosphoribosylformylglycinamidine synthase II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTACGGC GTATTGAAGT TGGACTGCGG CCGCAGGTGA CCGACACCGT GGGCCGCAAG GTCGCTGCCA AGATTCACGA ATCCCTCGGC CTCGACGCAG GCGATGTGCG GCTGGTCAAG GTATTCACCA TCGACGGCCT TGATGCGGCG CAACTCGAGA CCGTCGTGCG TGAGGCCGTC CTGTTCGACC CCGTGTTGCA GCACGCCTCG CTCGACCCGC TGGACAGCGA TGCCGACTGG GTGCTCGAAG TGGGCTTCCG CCCCGGCGTC ACCGACAACG AGGGACGCAC CGCCCGCGAC ACGCTCGCAC TCGTGCTCGG CATCGCCGAC CGCCGCTCCA TCGCCGTGTA CACGGCCAAC CAGTACCACC TGCATTGCGG TCTCGACCGC GCCGCCGTCG AGCGCATCGC GCGCGACCTG CTCGCCAACG AACTCATCCA GCGCTACGCG CTCAAGAGCC GCGAAGAATG GGCGGCAAGC CCCGGCTTCC CCGCGCAGGC GGCACAGGTC ACGGGCGCAC GCAACGACGA GGTGGCCGTC ATCCCCCTGC TTTCCATGAG CGACGATGAA CTCATGGCCT TCAGCCGCGC CAACACCCTC GCCCTCAGCC TCGAAGAGAT GCACGCCATC CGCGCCTACT ACCAGCGCGA CGACGTGCGC GCCGCCCGTG CCGCCGAAGG GTTGCCCGCC GACCCCACCG ACGCCGAAGT CGAGGCACTG GCGCAGACAT GGTCGGAGCA CTGCAAGCAC AAGATATTCT CTTCGCGCAT CGACTACGAG AACCGCGAGA CCGGACGCCG CGAGACCATC GACAGTCTCT TCAAGTCCTG CATCCAGGAC ACCACGAAGA CCATCCGCGC CCGTCTCGGC GACAAGGACT TCTGCCGTTC GGTGTTCAAG GACAACGCTG GCGTCATCGC CTTCAACGAC ACGCATGACA TCTGCATCAA GGTCGAGACG CACAACAGCC CCTCGGCCCT CGACCCCTAC GGCGGTGCGC TCACCGGCAT CGTGGGTGTC AACCGCGACC CCATGGGCAC GGGCATGGGT GCCAACCTCG TCTGCAACAC CGACGTGTTC TGCTTCGCCT CGCCCTTCTG GGAAGGTGAA CTGCCCCCCC GCCTGCTGCA CCCCCGCCGC GTGCTTGAAG GCGTCCGCGA AGGCGTGGAA CATGGCGGCA ACAAGTCGGG CATTCCCACC GTCAACGGTT CCATCGTGTT CGAGGACCGC TACCTCGGCA AGCCGCTTGT CTATTGCGGC ACGGTGGGCA TGATTCCCGC GCAGGTCGCA GGCAAGCCCG GCTACACCAA GGAGGCGCGT CCCGGCGACG CCATCGTCAT GGTGGGCGGC CGCATCGGCA AGGACGGCAT CCACGGCGCG ACCTTCTCCT CCGAAGAACT GCATGAGGGC TCCCCCGCCA CGGCAGTGCA GATTGGCGAC CCCATCACCC AGCGCAAGAT GTACGACTGC ATCATGCGCG CCCGTGACAT GGGCCTCTAC ACCGCCATCA CCGACAACGG TGCGGGCGGC CTCTCGTCGT CCGTGGGCGA GATGGCGCAG GACACGGGCG GCTGCCGCCT CGACCTCGCC CGCGCGCCCC TGAAGTACGA CGGTCTGCGC CCGTGGGAGA TTCTGCTCTC CGAAGCGCAG GAGCGCATGA CCCTCGCCGT GCCGCAGGAC AAGCTCGAAG CCTTCATGCG GCTCGCCTCC GAGATGGACG TGGAGGCTAC GGTACTCGGC GAATTCACCG ACTCCGGCTA CTTCCACATC ACCTTCGGTG ACAGGCAGGT GGCCTATCTC GACATGGATT TCCTGCACGA CGGCGTGCCG CAGTTGCAGC TGAAGGCCGT GTGGGAACGC CCCGCCCACC CCGAAGGGCG CATCGACCTG CCCGAAGAGG AACAGGGCGC GTTCCTGCGG CGCATGATGG GTTCGCTCAA CATCTGCAGC AAAGAATATG TCATCCGCCA GTACGACCAT GAGGTGAAGG GCGGCAGCGT GGTGAAGCCC CTCGTGGGCG TGAAACGCGA TGGCCCCGCC GACGCAGCCG TGGTGCGCCC CCTGCTGGAC AGCGAGTCGG GCATCGTGCT CTCGCACGGC ATCTGCCCCA AGTTCAGCGA CTACGACGCC TACTGGATGA TGGCCAACGC CATCGACGAG GCCGTGCGCA ACGCGGTTGC CGTAGGCGGC GACCCCGACT TCATGTCCGG CGTCGACAAC TTCTGCTGGT GCGACCCCGT GCAGTCCGAT AAGACCCCCG ACGGGCACTA CAAGCTGGCG CAACTGGTGC GCGCCAACCG CGCCCTAGAG CACTTCTGTC TCGCCTACGG TGTGCCCTGC GTCTCGGGCA AGGACTCCAT GAAGAACGAC TACACCGGCG GCGGCACCAA GATATCCATA CCGCCCACGG TGCTGTTCTC GGTCATGGGC GTCATCGACG ACGTGAACCG CACCGTAACC TCCGACTTCA AGCGCGCAGG CGAGCGCATC TACCTGCTTG GCCTCACCCG CCGCGAGATG GCCGGCAGCG AAGCGGCACA GGTTCTTGGC ATCTCCTGCG CCGACGTGCC GCAGGTCGAC GCCCCCGCCG CCCTGGCCCG GTATCGTGCC CTTTACGGTG CCATCCGCGC CGGGCTTGTC ACCGCCTGCC ACGACCTCTC CGACGGCGGT CTTGCCGTGG CCCTTGCCGA GATGTGCCTC GGCGGGCGTC TCGGGGCCCG ATGCGACCTC GCCCGCGTGC CTGTCTGCGG CGACATGACA ACCACCGAAC TCCTGTACAG CGAGTCGGCC AGCCGCCTGC TGGTCTCCGT GCGTCCTGCC GACGCCGACG CCTTCGAAGC GGCCTTCGCC GGTCAGCATT ACGCCTGCGT CGGCGAAGTG ACAGCCGACG GCAGACTGAC CCTCGAGACC AAGGGCACCG CCATCGTCTC TGAAGAGGTG GAGGCACTCG CCACCGCCTT CAAGGCGACC CTCGACTGGT AG
|
Protein sequence | MLRRIEVGLR PQVTDTVGRK VAAKIHESLG LDAGDVRLVK VFTIDGLDAA QLETVVREAV LFDPVLQHAS LDPLDSDADW VLEVGFRPGV TDNEGRTARD TLALVLGIAD RRSIAVYTAN QYHLHCGLDR AAVERIARDL LANELIQRYA LKSREEWAAS PGFPAQAAQV TGARNDEVAV IPLLSMSDDE LMAFSRANTL ALSLEEMHAI RAYYQRDDVR AARAAEGLPA DPTDAEVEAL AQTWSEHCKH KIFSSRIDYE NRETGRRETI DSLFKSCIQD TTKTIRARLG DKDFCRSVFK DNAGVIAFND THDICIKVET HNSPSALDPY GGALTGIVGV NRDPMGTGMG ANLVCNTDVF CFASPFWEGE LPPRLLHPRR VLEGVREGVE HGGNKSGIPT VNGSIVFEDR YLGKPLVYCG TVGMIPAQVA GKPGYTKEAR PGDAIVMVGG RIGKDGIHGA TFSSEELHEG SPATAVQIGD PITQRKMYDC IMRARDMGLY TAITDNGAGG LSSSVGEMAQ DTGGCRLDLA RAPLKYDGLR PWEILLSEAQ ERMTLAVPQD KLEAFMRLAS EMDVEATVLG EFTDSGYFHI TFGDRQVAYL DMDFLHDGVP QLQLKAVWER PAHPEGRIDL PEEEQGAFLR RMMGSLNICS KEYVIRQYDH EVKGGSVVKP LVGVKRDGPA DAAVVRPLLD SESGIVLSHG ICPKFSDYDA YWMMANAIDE AVRNAVAVGG DPDFMSGVDN FCWCDPVQSD KTPDGHYKLA QLVRANRALE HFCLAYGVPC VSGKDSMKND YTGGGTKISI PPTVLFSVMG VIDDVNRTVT SDFKRAGERI YLLGLTRREM AGSEAAQVLG ISCADVPQVD APAALARYRA LYGAIRAGLV TACHDLSDGG LAVALAEMCL GGRLGARCDL ARVPVCGDMT TTELLYSESA SRLLVSVRPA DADAFEAAFA GQHYACVGEV TADGRLTLET KGTAIVSEEV EALATAFKAT LDW
|
| |