Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_3134 |
Symbol | |
ID | 7175080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | - |
Start bp | 3953875 |
End bp | 3954912 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643541670 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002437538 |
Protein GI | 218888217 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 80 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGCGC TCACCGTGGC CGTTTCGCTT TCGCTCATCA TTTCGGCCTG CTGTTCCGTC ACCGAGGCCA TCCTGTATTC CGTGCCCTGG AGCCACGTGG AGCAGTTGCG CAAATCGGGG CGCAAGGCGG GCCAGGTGCT GTTCGAACTG CGTTCGCGCA TCGAGCAGCC CATCACCGCC GTGCTCACCC TGAACACCGT GGCCAACACG GCGGGGGCCG CCATTGCCGG GGCCTACGCG GCGGACGTGC TGGGCGACGA GCACATGGCC GCCTTTGCCG CCGGGTTCAC GGTGCTCATC CTGATCCTGG GCGAGATCAT CCCCAAGACC ATCGGGGTGG CGCATGCGCG CCCGCTTTCG GAATACGTGG CCCGCCCGGT GCGCGTGCTG GTGGCGCTGC TGATGCCGGT CATATGGTTG GGCGGGCGCA TCACCCGGCT GTTCACTCCG CCCGGCGGCG GCGCGCCGCA CGCCACGGAA GACGACATCC GGGCCATCGT CAGCCTGTCG CGCCGGGCCG GGCGCATCCA GCCCTATGAA GAACTTTCCA TCCGCAACAT CCTGTCCCTG GACCAGAAGC GGGTGCACGA AATCATGACC CCGCGCACGG TGGTCTTTTC GCTGCCTGCC GCCATGACCG TGGCCGAGGC CCACGAACAG CCGGACTTCT GGCACTACAG CCGGGTGCCG GTGTGGGGCG AGCACAACGA GGATGTGGTG GGCATCGCCA CCCGCCGCCG GGTGCTCAAG GAAGTTGCGG AGGACAACGA CACGCTGCGC CTGTCCGAGG TGATGCAGCC GGTGCACTTC GTGCCCGACA CCCAGACCCT GGACCGCACC CTGTTGCAAT TCCTGGATGC GCGCACCCAC CTGTTCGTGG TGCTGGACGA GTACGGCGGT CTAGCTGGGG TCATTTCGCT GGAAGACGTT CTGGAAGAAA TACTGGGCCG CGAAATCGTG GACGAAACCG ACAGGGTGGA CGATTTGCAG GAACTGGCCC GGCGCCGCCG CTCGGAACTG GCACGCAACA AAGAATAG
|
Protein sequence | MLALTVAVSL SLIISACCSV TEAILYSVPW SHVEQLRKSG RKAGQVLFEL RSRIEQPITA VLTLNTVANT AGAAIAGAYA ADVLGDEHMA AFAAGFTVLI LILGEIIPKT IGVAHARPLS EYVARPVRVL VALLMPVIWL GGRITRLFTP PGGGAPHATE DDIRAIVSLS RRAGRIQPYE ELSIRNILSL DQKRVHEIMT PRTVVFSLPA AMTVAEAHEQ PDFWHYSRVP VWGEHNEDVV GIATRRRVLK EVAEDNDTLR LSEVMQPVHF VPDTQTLDRT LLQFLDARTH LFVVLDEYGG LAGVISLEDV LEEILGREIV DETDRVDDLQ ELARRRRSEL ARNKE
|
| |