Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_0259 |
Symbol | |
ID | 7172140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | - |
Start bp | 296363 |
End bp | 297397 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643538754 |
Product | protein of unknown function DUF34 |
Protein accession | YP_002434684 |
Protein GI | 218885363 |
COG category | [S] Function unknown |
COG ID | [COG0327] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00486] dinuclear metal center protein, YbgI/SA1388 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 0.00852119 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACGCT CTGAAATAAT TAGCGTTATT GAAAAAACGG CACTTCCGGC ATCCGCCGCC CAGTGGGACA GGTCGGGCAT CCAGGTGGCC GGGCACGACG CACAGGCCAC GGCCCTCGCC GTCTGCCTCG ACCCCACCCC CACCGCCGTC GCCGCCGCGC TTGACCTTGG CGCCGACGTC ATCCTTTCCC ACCACCCGCT GGTCATGACG CCCCGCGCGC TGGACACGCT GGACGACCAC CACGCCGTGG CTGCCGCCGT ATTGGCGCGC GGTGCGTGGC TGTACGCCGC GCATACCTCG CTGGACGCCA ATCCCGACGG CCCCGTGGGC TGGCTGGCCC GAGAGCTTTC GCTGGACGCC CCCGAAGTGC TGGAACCCAC ACTGCGCCAG GAACGGCTGA CCCGGTGCAT CGTGGCGGCC ACAACGCCCG CGCCGCACTG GGCCAACCTG CCCGGCGTGC TGGCATGCCG GGTGCTGGGC AACACCGCCG TGCTCAGTTG CGAGGCCGCC GACTGGCCCG CCATCCGCGC GGCCATCACG TCAGACGCCC CGCAGGGCGC CGCGCCCGCC TTCCTGCCCG CCGAGCCGGA ACTGCCCGCG CGGGTCTTCG GCTTCGGCCT GGTGGGCGAC CTGCCCGAAC CGCTCGAATT TCCGGCCTTT GCCCGGCGCC TGCACGGCCT TGCCGGGCGC GCCCACTGCC ACGTCAGCGG CCCCGTCCCC GCCATGGTGC GGCGGGTGGG CTACTGCACC GGTTCCGGCA GTTCACTGGC CGATACAGCC TTTGCCCGTG GCGCCGACGT GTTCGTCACC GGCGACGTGA AGTACCACAC CGCGCTGGAA GCCCGTGGCT GCCTGCTCGA CGTGGGGCAC TTTGCCCTTG AAGAGGAAAT GATGCGCCGC TTCGCCAACG GCCTGCGCAC CGCGCTGCCA GGCCTGCCCG TGCATTTTCT GCCATCCAGC GATCCACTGC GGCTGCACCT GCCCGTTCCC CCCGCGGGCA CGGACGGCGG CCGCGTTCCG GAACATCCGG CCTAA
|
Protein sequence | MKRSEIISVI EKTALPASAA QWDRSGIQVA GHDAQATALA VCLDPTPTAV AAALDLGADV ILSHHPLVMT PRALDTLDDH HAVAAAVLAR GAWLYAAHTS LDANPDGPVG WLARELSLDA PEVLEPTLRQ ERLTRCIVAA TTPAPHWANL PGVLACRVLG NTAVLSCEAA DWPAIRAAIT SDAPQGAAPA FLPAEPELPA RVFGFGLVGD LPEPLEFPAF ARRLHGLAGR AHCHVSGPVP AMVRRVGYCT GSGSSLADTA FARGADVFVT GDVKYHTALE ARGCLLDVGH FALEEEMMRR FANGLRTALP GLPVHFLPSS DPLRLHLPVP PAGTDGGRVP EHPA
|
| |