Gene Dvul_1904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1904 
Symbol 
ID4664032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2219199 
End bp2220305 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content65% 
IMG OID639820145 
Productthiamine biosynthesis protein 
Protein accessionYP_967347 
Protein GI120602947 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0482] Predicted tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.661194 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACGC AACACTACCA TGCCGTAGCG CTGCTCTCGG GTGGGCTGGA CAGCATCCTC 
GCCGTCAAGC TCGTCGAAGA GCAGGGACTG CGCGTCAAAT GCCTGCACTT CGTCTCTCCC
TTCTTCGGCA AGCCTTCACA GGTGCGCCGC TGGCGTTCGA TATATGCCCT GGACATCACC
ACGGTCGACG TGAGCGACGA TTTCGCCCGT ATGCTCGCCG AACGCCCGCA GCACGGGTTC
GGCAAGGTCA TGAACCCGTG CGTCGACTGC AAGATTCTCA TGCTGCGCCG TGCGCGTGAA
CTGATGACCG AATACGGTGC CACGTTCATC ATCACGGGCG AGGTGCTCGG ACAGCGCCCC
ATGTCGCAGC GGCGTGACAC GCTCAACGTC ATCAGGCGCG ATGCCGAGGT GCGCGACCTG
TTGCTGCGCC CCCTCAGCGC AAAGCTGCTC GACCCCACCC CCTTCGAGCT TTCCGGGATG
GTCGACCGCG AACGTCTGCT TGCCATCTCC GGGCGGGGAC GCAAGGAACA GATGGCACTC
GCCGAGCGTT TCGCACTTGA GGAGATTCCG ACCCCCGCAG GTGGCTGCAA GCTCGCCGAA
CGCGAGAATG CCCGCCGCTA CTGGCCTGTG CTCGTTCATG CACCGGTCGT CACCGCCGCC
GAGTTCCGGC TTTCGAACAT CGGAAGGCAA TACTGGCAGG GGGCGCACTG GCTCTCCATC
GGGCGCCATC AGAAGGACAA CGAGGCGCTG GAGCGCTTCG CCTTTCCGGG CGACCTGCGT
TTCAAGGTCG TGGGGTACCC GGGCCCCTTG GCCGTGGGAC GTCAGTTTGA CGGACAACCG
TGGTCGGAAG AGGTCGTGTG CGACGCCGCG TCGTTCGTGG CGTCGTTCTC ACCCAAGGCC
GTGCGGGACG GCATCGCTGT CGCAGTTCGC GTGACGTGCG GCGAGACGGT GCGCGAGGTG
CAGGTGATGC CCGCCCGAGC AACGGTTCTC GGCTGGGCCG AGGATGAGTG GCCTGTGGTA
CGGGAGGCCA TCAGGGCCGA TGCGCGAGCG CGCGCCCTGC CCGTACATGC GACCCCGGAC
GATGGTCGCG ACGGCGAGGC GGAATAG
 
Protein sequence
METQHYHAVA LLSGGLDSIL AVKLVEEQGL RVKCLHFVSP FFGKPSQVRR WRSIYALDIT 
TVDVSDDFAR MLAERPQHGF GKVMNPCVDC KILMLRRARE LMTEYGATFI ITGEVLGQRP
MSQRRDTLNV IRRDAEVRDL LLRPLSAKLL DPTPFELSGM VDRERLLAIS GRGRKEQMAL
AERFALEEIP TPAGGCKLAE RENARRYWPV LVHAPVVTAA EFRLSNIGRQ YWQGAHWLSI
GRHQKDNEAL ERFAFPGDLR FKVVGYPGPL AVGRQFDGQP WSEEVVCDAA SFVASFSPKA
VRDGIAVAVR VTCGETVREV QVMPARATVL GWAEDEWPVV REAIRADARA RALPVHATPD
DGRDGEAE