Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_0551 |
Symbol | |
ID | 4664127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | - |
Start bp | 692758 |
End bp | 693876 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639818761 |
Product | radical SAM domain-containing protein |
Protein accession | YP_966001 |
Protein GI | 120601601 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0260021 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGATA CGGACCACTA CGCCCGCCTC GGACTGGGCG AGATACGCGA CAAGGTCATG GCGGGCCACC GCCTGTCATA CGACGACGGG CTTCGCCTGT TCGCCTGCCC AGACATCACC GCCATCGGCG CACTGGCGCA CCATGTGCGC ACACGCCTGC ATGGCGACAG GACGTATTAC GTCGTCAACA GGCAGGTCAA CTACACCAAC ATCTGCGTCA ACGGTTGCCT CTTCTGCGCC TTCCAGCGCG AACGCGGGCA GCAGGGGGCC TTTGCCCTCA CCCCTGACGA GATGGTGGCC CGCGTCATGG ACGACGGCGG TGTACCCTTC AGCGAGGTGC ACATCGTGGG CGGCTGCCAC CCCGACCTGC CGCTGGAGTG GTTCGAGGAT GTCATCCGCC GCATCCACGC AGCGCGTCCG TCCATCGTGG TGAAGGCGTT CACCGCTGTG GAAATCGCCC ATTTCGCCAA ACTCGCAGGC ATCGACACCG AAGAGGTTCT GCGTCGCCTC AAGGCCGCTG GCCTCTCGCA ACTGCCCGGT GGCGGGGCCG AGATATTCGC GCCCGATGTC CGGCAGCGCA TCTGCCCCCG CAAGATAGAC GCAGACGCAT GGCTGCGTGT GGCGGGCGAG GCGCACCGCC TCGGCATCGG CACCAACTGC ACCATGCTCT TCGGGCACCT CGAAAGCGAG GCCGACAGGG TCGACCATCT GTGCCGTCTG CGTGCCCAGC AGGACGAGAC GGGCGGCTTC ACCTGCTTCA TCCCCCTGCC CTTCCTCACC GAGAACAGCC TCCTCAAGCT TCCACCTGAA CGCGTGGGGC AACATGTGGG TCTCGACAGG CTGCGCACCG TGGCGGTGAG CCGCCTGATG CTCGACAACA TCGCGCACAT CAAGGCCTAC TGGGTGATGA TGGGCGTCAA GCTGGCGCAG GTCGCGCTGC ACTACGGCGC CAACGACCTC GACGGCACCA TCGTCGAAGA GAAGATAGGC CACATGGCAG GTTCCGACGC GGTACAGGCC ATGACCATCG CCCAGCTTGA AGACATGATA CGCCGTTCCG GTTTCACGCC GGTGCGCCGC GACACCCACT TCAACCCCGT CGAGGAGGCT CGGGCATGA
|
Protein sequence | MLDTDHYARL GLGEIRDKVM AGHRLSYDDG LRLFACPDIT AIGALAHHVR TRLHGDRTYY VVNRQVNYTN ICVNGCLFCA FQRERGQQGA FALTPDEMVA RVMDDGGVPF SEVHIVGGCH PDLPLEWFED VIRRIHAARP SIVVKAFTAV EIAHFAKLAG IDTEEVLRRL KAAGLSQLPG GGAEIFAPDV RQRICPRKID ADAWLRVAGE AHRLGIGTNC TMLFGHLESE ADRVDHLCRL RAQQDETGGF TCFIPLPFLT ENSLLKLPPE RVGQHVGLDR LRTVAVSRLM LDNIAHIKAY WVMMGVKLAQ VALHYGANDL DGTIVEEKIG HMAGSDAVQA MTIAQLEDMI RRSGFTPVRR DTHFNPVEEA RA
|
| |