Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_0692 |
Symbol | |
ID | 7172579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | + |
Start bp | 837369 |
End bp | 840275 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643539192 |
Product | hypothetical protein |
Protein accession | YP_002435117 |
Protein GI | 218885796 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 0.233309 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGCAA TCGCCGACAA AAGACGCACC ATTCGCATCA AGCGGTCTTC CCTGCTCCAG TGCAAACTGG GCTCCCTGGA CAGGCCAGCC ATAAAAATAG CGGAAACGCC CGACGAATAT ACGAGGGCGT TCCGCCTTGT GTATGAAGAG TACCTGCGCT CTGGCTACAC CAGGCCGCAC CCGTCTCTCA TGCACTACAC CATCTGGAGC ATGCTGCCGC AAACTTCCGT TTTCGTCTTC AAAAGCTACA ACGACGTGCT CTGCACGCTA TCGCACATAC CGGACTCGGA CTTGTTCGGG CTTCCCATGG ACACCTTGTA CAAGCCCGAA CTCGACACGC TTCGGGACAA GGGAAGGACC ATTGCCGAAG TAGGATCACT TGCGACGCAG TACACACGCA GATGGACAAA TCTCATGGTC TACCTTGCGA AGGCCATGTT CCAGTACTCC ATCATGTCCA ATTTCGACGA CATCGTCATT ACCGTAAACC CCAAGCATGT GAATTTCTAT ACGCAGATAT TCCTGTTCAA ACCGTTTGGA GAAGTGCGCC ATTACGATTC CGTGAATGCC CCTGCGGTGG CATTGCGCAT AAACCTTTCC GAGACCATGG ATGAACTCAA GGAAAAGTAC GGCAACTCCG ACGACTTCGA CACCAACCTT TTCACGTTCT TCGTCCGCAT GAACAGTGGA GAGGCGGACA CCAAGGACAA CCCCGTCAAG CGGGACCAGC CCCTTGATCC GTACACCGCC TATCATCTGT TGCGCCAGCG CCCGGAACTT CTGGACCAGT TGGCCGAAGA GCAGCGCGAC TTCATCGAAA CCATCTACCA CCGGGCCCTG TTCAACCACT TCTCGACACA CCCCGTCCAC CCCGAAACAC CCAGCGGCGT CCCGCTGGAC ATGCTCAAGC TCGAAACCCG GGACGCGTAC TCCGATGTGG CCTTTTGCAG AAACCTGGGG TTGGTGGACT ACGCCGGACA GCGCAAGCTG CTCGGCTCAC GCGTGGCCAT TGCGGGACTG GGCGGCGTAG GCGGCGTGCA CCTGATGACG CTTGCCCGCA CCGGAATCGG CAATTTCAAT CTGGCCGACT TCGACGCCTA CTCGCCCGTC AACATCAACA GACAGTACGG GGCGAGCATC GCCAGCTTCG GGAGAAACAA GCTCGACGTC ATGACCGAGC GCGCGCTCAG CGTGAATCCG TTCATGGACA TACGCGCCTT TCCCGGAGGC ATTTCCGCAA CCTCGCTCGA CGATTTCCTC AAGGATGTCG ACTTGGTGGT GGACGGCATA GACTTTTTTG CGCTCGACAT ACGCCGCCAG CTGTTCAACC GCGCCCTTGC GCTGGGCATC CCCGTCATTA CCGCCGCGCC GCTCGGGTTC TCTTGCGCAC TGCTCGTATT CACGCCGGGC GCCATGAGCT TTGACGACTA CTTCGACATC ACGGAACACA CCGAAAAAAT GGAGGGATAT CTGCGCTTCG GCATGGGGCT GGCGCCTCGC CCGGCCCACC TTGGCTACAT GGACAGGCGG TTCGTCAGCC TGCACGATCG CCGGGGCCCC TCGCTGGACA TCGCCTGCCA CATATGCGCG GGCATGGCCG GCACGGAGGC GGTCCGCCTG CTGTTGGGCA AGAAAGGGGT CAGACCGGCG CCGTACTTTC GCCAATTCGA CCCACTCACG GGGCGATTCA CCACCGGAAA GCTGCGCCGG GGCCTGCGGT CCCCCTTGCA ACGGCTGAAG CTTGCCATCG CCCGACGGTT CTTTCTCGAC ACGCCGCGCA CCGGCGCCCT GCGCCCCCCT GAGCCGGAAA TGGTGGGGCT GCGCCAGGAC ATCCCGCCCG CAACGCTGGA ATACATCGCG CAGGCGGCCA CCCGGGCGCC CTCAGGCGAC AACGTGCAGC CATGGCGCAT CGCCCTGCAC GAGACCGGCA TACACATCCA CGCGGCAAGA CATGCGGACG ACTCGTTCTT CAATTACCGG CAGGTGGCCA CCCTGCTTGC CTGCGGGGCG GCCGTGCAGA ATGCCGTTTT CGCCGCCGGC AGCGTCGGGC TGGACGCCGA TCTTTCACTG TTCCCGGACG AACAGGACCA CAACCGCGTG GCGTCGCTTC ACTGCACACC CGTCGGGGTG CAAAGCCACG AAATCATGGC CGCGGCCCTC TGGCGGCGGC ATACCAACCG GCGCATGTAT TCGGCCAGCC CCATACCGCC CGCCGTGCGC GACCGGATTG ACCACATCGT GGACGAACAG CAGGATGCGA CACTGGCCTG GGCTGCCGAC CCGGCGCAGC GCAAGGCTCT GGCCAAGGCC GTCTACCTTG CCGACAGGGT GCGCGTGGAA CGGCCCGATC TGCACGAGCA CCTGATGCGC TTCATCCGCT TCGAACCGCA GAAGGGGCCA TACGGCGACG GGCTGCCCCT GGGCAATCTG GAGGCGGGCC CACTGGGCGA GTTGTACCTG CGCAGCCTGC GGCCATGGTC CGCCATGCAT GCGGCAAACC AGGCCGGGAT CGGCAGGCTC ATGCCGCTGC ACGGTGCGCT CAGCGTTCTG CGGAGTGGTG GCGTGGCGCT GTTGCTGGCC AACGGAGAGG CCGAGACGGA CATTGTCCGC GCGGGCATGG CCTGGCAACG CGCATGGTGC GCCCTGGAGC ACATGGGCTA CGCGTTGCAG CCTCTTGCCG CCCTGCCGCT GCTGCACCTG CGCATTCGCC TGGGGGACGC GGAAACACTT TCGCCTTGCC ATGTCTCCCT GCTGGAAAAG GCGTGGCGTC TGCTTGCCGA GGCATTGCCG CATCCTTCGG ACAAACTGCC GGTCATGCTG TTCCGGACCG GCATCGGGCC CGCCATCCGG CACGGCACCT ATCGGCTGGC CCTGTCGGAA ATCCTGCTTC CCGACAGCAG GGCCTGA
|
Protein sequence | MLAIADKRRT IRIKRSSLLQ CKLGSLDRPA IKIAETPDEY TRAFRLVYEE YLRSGYTRPH PSLMHYTIWS MLPQTSVFVF KSYNDVLCTL SHIPDSDLFG LPMDTLYKPE LDTLRDKGRT IAEVGSLATQ YTRRWTNLMV YLAKAMFQYS IMSNFDDIVI TVNPKHVNFY TQIFLFKPFG EVRHYDSVNA PAVALRINLS ETMDELKEKY GNSDDFDTNL FTFFVRMNSG EADTKDNPVK RDQPLDPYTA YHLLRQRPEL LDQLAEEQRD FIETIYHRAL FNHFSTHPVH PETPSGVPLD MLKLETRDAY SDVAFCRNLG LVDYAGQRKL LGSRVAIAGL GGVGGVHLMT LARTGIGNFN LADFDAYSPV NINRQYGASI ASFGRNKLDV MTERALSVNP FMDIRAFPGG ISATSLDDFL KDVDLVVDGI DFFALDIRRQ LFNRALALGI PVITAAPLGF SCALLVFTPG AMSFDDYFDI TEHTEKMEGY LRFGMGLAPR PAHLGYMDRR FVSLHDRRGP SLDIACHICA GMAGTEAVRL LLGKKGVRPA PYFRQFDPLT GRFTTGKLRR GLRSPLQRLK LAIARRFFLD TPRTGALRPP EPEMVGLRQD IPPATLEYIA QAATRAPSGD NVQPWRIALH ETGIHIHAAR HADDSFFNYR QVATLLACGA AVQNAVFAAG SVGLDADLSL FPDEQDHNRV ASLHCTPVGV QSHEIMAAAL WRRHTNRRMY SASPIPPAVR DRIDHIVDEQ QDATLAWAAD PAQRKALAKA VYLADRVRVE RPDLHEHLMR FIRFEPQKGP YGDGLPLGNL EAGPLGELYL RSLRPWSAMH AANQAGIGRL MPLHGALSVL RSGGVALLLA NGEAETDIVR AGMAWQRAWC ALEHMGYALQ PLAALPLLHL RIRLGDAETL SPCHVSLLEK AWRLLAEALP HPSDKLPVML FRTGIGPAIR HGTYRLALSE ILLPDSRA
|
| |