Gene Dvul_0550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0550 
Symbol 
ID4664108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp691634 
End bp692761 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content66% 
IMG OID639818760 
Productradical SAM domain-containing protein 
Protein accessionYP_966000 
Protein GI120601600 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0244585 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGC ACCCGGTAGC CGGAACCCAG TCACCCTTCA TGGAAGCCGA CGCCGTGCGC 
ACCATCGCCG CACACGTGCT TTCCGGCGGG CGCATCGACC GCGCCGGGGC CGAGACGCTC
TACCATGAGG CATCGCTGCA CACCCTTGCC CATCTGGCCC ACGCCGTGCG GCTGCGGCGT
CACCCTGAAC CGGTGGTGAC CTATGTGGCC GACCGCAACA TCAACTACTC CAACATCTGC
GTGTGCGCCT GCCGCTTCTG TGCCTTCTAC CGCGCCCCCG GCGCGGAAGG CGGCTATGTG
CTCTCCCGCG AGGAACTCGC CCGCAAGATA GACGAGACGC TGGTCCTCGG CGGCACGCAG
ATACTGTTGC AGGGGGGCCA TCACCCCGAC CTGCCGCTGC ACTTCTACGA GGACATGATA
GGCTGGATAC GCGCCACCTA CCCCGCCATC CATATCCATG CCTTCTCGCC GCCCGAAATC
GTCTTCTTTG CCGAGAAGGA GCACCTCACC ATCGGCGAGG TCATCGAACG CCTCCGGGCT
GCGGGGCTCG ACTCCATCCC CGGCGGCGGT GCGGAGATAC TGGTGGACGA GGTGCGCACG
AAGGTCTCGC CCAACAAGTG CTCGGCCGAA CTGTGGCTCG CCGTCATGGA AGAGGCGCAC
TATCAGGGGC TGCGCACCAC GGCGACCATG ATGTTCGGCC ATGAGGAGAC CCACGCCCAC
CGCCTCGACC ACCTCTTCGC CGTGCGCGAT GTGCAGGACC GTACCGGAGG CTTCACCGCC
TTCATCCCGT GGATGTTCCA GCCCGCCAAC ACCGCCATCG ACCGCGACCC CGAACCCGCG
CCCGCCTACC TTCGACTGCT GGCCCTCTCG CGCATCGTGC TCGACAACAT CGACAACATC
CAGGCCTCGT GGGTGACCAT GGGCCCGCAC GTGGCGCAGC TTGCGCTCTT CTACGGCGCC
AACGACTTCG GTTCGCTGAT GATAGAGGAG AACGTCGTGG CCGCAGCCGG TGTGAGCTTC
AGCCTTTCGC GCGGCGAGAT ACACAAGATC ATCCGGGCAG CGGGCTTCAC CCCCGTGCAA
CGCACCATGG ACTACACCCC CGTGGTGCCC CAACCCGTCG AAGCATAG
 
Protein sequence
MSQHPVAGTQ SPFMEADAVR TIAAHVLSGG RIDRAGAETL YHEASLHTLA HLAHAVRLRR 
HPEPVVTYVA DRNINYSNIC VCACRFCAFY RAPGAEGGYV LSREELARKI DETLVLGGTQ
ILLQGGHHPD LPLHFYEDMI GWIRATYPAI HIHAFSPPEI VFFAEKEHLT IGEVIERLRA
AGLDSIPGGG AEILVDEVRT KVSPNKCSAE LWLAVMEEAH YQGLRTTATM MFGHEETHAH
RLDHLFAVRD VQDRTGGFTA FIPWMFQPAN TAIDRDPEPA PAYLRLLALS RIVLDNIDNI
QASWVTMGPH VAQLALFYGA NDFGSLMIEE NVVAAAGVSF SLSRGEIHKI IRAAGFTPVQ
RTMDYTPVVP QPVEA