Gene DvMF_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_1950 
Symbol 
ID7173868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp2411162 
End bp2412178 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content72% 
IMG OID643540466 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002436361 
Protein GI218887040 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0350] Methylated DNA-protein cysteine methyltransferase
[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID[TIGR00589] O-6-methylguanine DNA methyltransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones109 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGGGC GGGAAGGCAA AGGAATTTTT TTGCTCGGCG GGGAGCAGCG CAAGATTCGG 
ATGGAAAAGC GCACGCGGCG GGCCTATGTT GCCGTCATGG ACACGCCCAC CCGCCATATA
GCCGCCGATC CGGTGACTGC CGCATCTGGA ACGACCACTG CGCCGGGCAC GGCCCTTCGG
CCCGACGACC CGCGTCTGGC GTTGGTCCGC GCCGCCTGCG ACACCCTGCG CCAGCGTGCC
GAACCGGTGC CCCTGGCGGA ACTGGCCGAG GCGGCGGGCC TGAGCCCCAG CCATTTTCAA
CGTGTGTTCA CCCGATTGGT GGGCGTTTCG CCGCGCGCCT ACCAGCAGGC CTGCCGCGAA
CAGCGCCTGC GCGGCGCGCT GGAGCGGGGC GTGCCCGTGG CCGAGGCCAT CTACGAGGCC
GGGTTCGGTT CACCGAGCCG GGTCTACGAG GATGCGCACG GCATGCTGGG CATGACCCCG
GCCCGCTACC GCAAGGGTGC GCCCGGCAGG CAACTGGCGG TGGCCGCCGC GCAAACGTCG
CTGGGCTGGC TGGTCATGGC CGCCACGGAG GACGGGGTGT GCGCCATCGA CATCGGCGAC
GACCGCGAGG CCCTGCTTGC CGACCTGCAA CGCCGTTTTC CCGGCGCGGA ACTGCACACC
CCCAGCGATG CGGTGCGGCA GTGGCTGGGC ACGGTGGTGG CCTTCGTCGA GCACGGCGGC
GCACACCCCG CGCTGCCGCT GGACGTGCGC GGCACCGCCT TCCAGCATGC GGTATGGAGC
GCCCTGCGCG AACTGCCACC CGGCACTACC CTGGGCTATG CCCAGCTTGC CGCCCGCATC
GGCAGGCCCA GCGCCGTGCG CGCCGTGGCC GCCGCCTGCG CCCGCAACCC CGTGGCCGTG
GTGGTGCCCT GCCACCGCGT GCTGGGCCGT GACGGCGCAC TCACCGGCTA CCGCTGGGGG
GTGGATCGAA AGGCCGAACT GCTGCGCCGC GAAGCCGCCC GAATGCCCAT CGGGTAA
 
Protein sequence
MRGREGKGIF LLGGEQRKIR MEKRTRRAYV AVMDTPTRHI AADPVTAASG TTTAPGTALR 
PDDPRLALVR AACDTLRQRA EPVPLAELAE AAGLSPSHFQ RVFTRLVGVS PRAYQQACRE
QRLRGALERG VPVAEAIYEA GFGSPSRVYE DAHGMLGMTP ARYRKGAPGR QLAVAAAQTS
LGWLVMAATE DGVCAIDIGD DREALLADLQ RRFPGAELHT PSDAVRQWLG TVVAFVEHGG
AHPALPLDVR GTAFQHAVWS ALRELPPGTT LGYAQLAARI GRPSAVRAVA AACARNPVAV
VVPCHRVLGR DGALTGYRWG VDRKAELLRR EAARMPIG