Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_0550 |
Symbol | |
ID | 4664108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | - |
Start bp | 691634 |
End bp | 692761 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639818760 |
Product | radical SAM domain-containing protein |
Protein accession | YP_966000 |
Protein GI | 120601600 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0244585 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGC ACCCGGTAGC CGGAACCCAG TCACCCTTCA TGGAAGCCGA CGCCGTGCGC ACCATCGCCG CACACGTGCT TTCCGGCGGG CGCATCGACC GCGCCGGGGC CGAGACGCTC TACCATGAGG CATCGCTGCA CACCCTTGCC CATCTGGCCC ACGCCGTGCG GCTGCGGCGT CACCCTGAAC CGGTGGTGAC CTATGTGGCC GACCGCAACA TCAACTACTC CAACATCTGC GTGTGCGCCT GCCGCTTCTG TGCCTTCTAC CGCGCCCCCG GCGCGGAAGG CGGCTATGTG CTCTCCCGCG AGGAACTCGC CCGCAAGATA GACGAGACGC TGGTCCTCGG CGGCACGCAG ATACTGTTGC AGGGGGGCCA TCACCCCGAC CTGCCGCTGC ACTTCTACGA GGACATGATA GGCTGGATAC GCGCCACCTA CCCCGCCATC CATATCCATG CCTTCTCGCC GCCCGAAATC GTCTTCTTTG CCGAGAAGGA GCACCTCACC ATCGGCGAGG TCATCGAACG CCTCCGGGCT GCGGGGCTCG ACTCCATCCC CGGCGGCGGT GCGGAGATAC TGGTGGACGA GGTGCGCACG AAGGTCTCGC CCAACAAGTG CTCGGCCGAA CTGTGGCTCG CCGTCATGGA AGAGGCGCAC TATCAGGGGC TGCGCACCAC GGCGACCATG ATGTTCGGCC ATGAGGAGAC CCACGCCCAC CGCCTCGACC ACCTCTTCGC CGTGCGCGAT GTGCAGGACC GTACCGGAGG CTTCACCGCC TTCATCCCGT GGATGTTCCA GCCCGCCAAC ACCGCCATCG ACCGCGACCC CGAACCCGCG CCCGCCTACC TTCGACTGCT GGCCCTCTCG CGCATCGTGC TCGACAACAT CGACAACATC CAGGCCTCGT GGGTGACCAT GGGCCCGCAC GTGGCGCAGC TTGCGCTCTT CTACGGCGCC AACGACTTCG GTTCGCTGAT GATAGAGGAG AACGTCGTGG CCGCAGCCGG TGTGAGCTTC AGCCTTTCGC GCGGCGAGAT ACACAAGATC ATCCGGGCAG CGGGCTTCAC CCCCGTGCAA CGCACCATGG ACTACACCCC CGTGGTGCCC CAACCCGTCG AAGCATAG
|
Protein sequence | MSQHPVAGTQ SPFMEADAVR TIAAHVLSGG RIDRAGAETL YHEASLHTLA HLAHAVRLRR HPEPVVTYVA DRNINYSNIC VCACRFCAFY RAPGAEGGYV LSREELARKI DETLVLGGTQ ILLQGGHHPD LPLHFYEDMI GWIRATYPAI HIHAFSPPEI VFFAEKEHLT IGEVIERLRA AGLDSIPGGG AEILVDEVRT KVSPNKCSAE LWLAVMEEAH YQGLRTTATM MFGHEETHAH RLDHLFAVRD VQDRTGGFTA FIPWMFQPAN TAIDRDPEPA PAYLRLLALS RIVLDNIDNI QASWVTMGPH VAQLALFYGA NDFGSLMIEE NVVAAAGVSF SLSRGEIHKI IRAAGFTPVQ RTMDYTPVVP QPVEA
|
| |