Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_1950 |
Symbol | |
ID | 7173868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | - |
Start bp | 2411162 |
End bp | 2412178 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643540466 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002436361 |
Protein GI | 218887040 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0350] Methylated DNA-protein cysteine methyltransferase [COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain |
TIGRFAM ID | [TIGR00589] O-6-methylguanine DNA methyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 109 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGGGC GGGAAGGCAA AGGAATTTTT TTGCTCGGCG GGGAGCAGCG CAAGATTCGG ATGGAAAAGC GCACGCGGCG GGCCTATGTT GCCGTCATGG ACACGCCCAC CCGCCATATA GCCGCCGATC CGGTGACTGC CGCATCTGGA ACGACCACTG CGCCGGGCAC GGCCCTTCGG CCCGACGACC CGCGTCTGGC GTTGGTCCGC GCCGCCTGCG ACACCCTGCG CCAGCGTGCC GAACCGGTGC CCCTGGCGGA ACTGGCCGAG GCGGCGGGCC TGAGCCCCAG CCATTTTCAA CGTGTGTTCA CCCGATTGGT GGGCGTTTCG CCGCGCGCCT ACCAGCAGGC CTGCCGCGAA CAGCGCCTGC GCGGCGCGCT GGAGCGGGGC GTGCCCGTGG CCGAGGCCAT CTACGAGGCC GGGTTCGGTT CACCGAGCCG GGTCTACGAG GATGCGCACG GCATGCTGGG CATGACCCCG GCCCGCTACC GCAAGGGTGC GCCCGGCAGG CAACTGGCGG TGGCCGCCGC GCAAACGTCG CTGGGCTGGC TGGTCATGGC CGCCACGGAG GACGGGGTGT GCGCCATCGA CATCGGCGAC GACCGCGAGG CCCTGCTTGC CGACCTGCAA CGCCGTTTTC CCGGCGCGGA ACTGCACACC CCCAGCGATG CGGTGCGGCA GTGGCTGGGC ACGGTGGTGG CCTTCGTCGA GCACGGCGGC GCACACCCCG CGCTGCCGCT GGACGTGCGC GGCACCGCCT TCCAGCATGC GGTATGGAGC GCCCTGCGCG AACTGCCACC CGGCACTACC CTGGGCTATG CCCAGCTTGC CGCCCGCATC GGCAGGCCCA GCGCCGTGCG CGCCGTGGCC GCCGCCTGCG CCCGCAACCC CGTGGCCGTG GTGGTGCCCT GCCACCGCGT GCTGGGCCGT GACGGCGCAC TCACCGGCTA CCGCTGGGGG GTGGATCGAA AGGCCGAACT GCTGCGCCGC GAAGCCGCCC GAATGCCCAT CGGGTAA
|
Protein sequence | MRGREGKGIF LLGGEQRKIR MEKRTRRAYV AVMDTPTRHI AADPVTAASG TTTAPGTALR PDDPRLALVR AACDTLRQRA EPVPLAELAE AAGLSPSHFQ RVFTRLVGVS PRAYQQACRE QRLRGALERG VPVAEAIYEA GFGSPSRVYE DAHGMLGMTP ARYRKGAPGR QLAVAAAQTS LGWLVMAATE DGVCAIDIGD DREALLADLQ RRFPGAELHT PSDAVRQWLG TVVAFVEHGG AHPALPLDVR GTAFQHAVWS ALRELPPGTT LGYAQLAARI GRPSAVRAVA AACARNPVAV VVPCHRVLGR DGALTGYRWG VDRKAELLRR EAARMPIG
|
| |