Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_0831 |
Symbol | |
ID | 7172720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | + |
Start bp | 1003800 |
End bp | 1005158 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643539332 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002435255 |
Protein GI | 218885934 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 0.0300168 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGACA CAGGCACATC CGGCTCCATC TACTTCGAGG CAGGCGTCAT TCTCCTGCTC ATCCTGATCA ACGGGTTCTT TTCGCTGGCG GAAATGTCGC TGGTGGCCTC GCGCAAGGTG CGACTGCGCC AGGACGCGGA ACGCGGCGTC AAGGGCGCGG CCACGGCCCT GCGCCTGCTG CGCGAGCCCG ACCGGCTGTT CTCCACCGTG CAGATCGGCA TCACCCTGGT GGGCATCCTT ACCGGTGCCT ACGGCGGCGC CGCGCTGGCC GAGCACCTTT CCGCCGTGCT GGCCCGCGTG GACGTGCTGC GCCCCTACAG CGGCCCGCTG GGCTTCGGCC TAGTGATCCT GCTCATCACC TATTTCACGC TGATTCTTGG CGAACTGGTG CCCAAGCGCA TGGCCTTCGG CAACCCGGAG GCCTGCGCCC GCCGCACCGC GCCGGTCATG GCGCTGCTGC TGCGCTTGGC CCTGCCGCTG GTGCACCTGC TCAGCGCCTC GTCGCGCGCG GCCTCGCGCC TGCTGCGCCT GCCCGAGGGC GGCGACAGGG CCGTGACGGA AGAGGACATC CGGGGGCTCA TCGGCGAGGG TGCGGCATCG GGCGTGGTGG AGCACGCCGA GCGCGACATG CTGGAACGCA TCTTCCGGCT GGGCGACCGG CGGGCGGGGT CGCTGATGAC CCACCGTTCG CAGGTGGAAT GGCTGGACCT GGACATGCCC GACGCGGAGA ACATGCAGCG CATCGCGCAG TCGTCCCATT CCTGCTTTCC CGTGGCGCGG GGCGACATCG CCGCCGCCAC CGGGGTGCTG AAGGCGCGCG ACTTCCTGGC CGCACGGCTG GTCACCCCGG ACATTCCCGT GGACGGCTTC ATCCGGCAAC CCCTGTACAT CCCCGAAACG GCCCGCGCCC TGACCCTGCT GGACCTGTTC CGCCACTCCG AAGGCCTGCC CTTCGCCCTG GTGGTGGACG AATACGGCGA GGTGCAGGGG GTGGTCACCC CCAACGACGT GCTGGAAGCC GTGGTGGGCG AACTGCCCGA CGAAGGCGGC GACCCCGACC CGGCGGCGGT GCGCCGCGAG GACGGCAGCT GGCTGCTGGA CGGGTTGCTG CCCTTCGACG AGATGTGCTC GCTGGCGGGG CTGGGCGCTG CGGAGGATCC TGACGACCGG CCCGGCTCGT ACGAAACCCT GGCCGGGTTC ATGCTGCACC GGCTGGGACG CATGCCCGCC ATGGGCGATG CCCTGCGCTG GCGCGGCCAC CGCTTCGAGA TCGTGGACAT GGATGGCCGC CGCATCGACC GCGTGCTGGT GAGCCCCGAT CCGGAACGCG CCGACGACGT GGGCGACGAC GCGCCGTAG
|
Protein sequence | MDDTGTSGSI YFEAGVILLL ILINGFFSLA EMSLVASRKV RLRQDAERGV KGAATALRLL REPDRLFSTV QIGITLVGIL TGAYGGAALA EHLSAVLARV DVLRPYSGPL GFGLVILLIT YFTLILGELV PKRMAFGNPE ACARRTAPVM ALLLRLALPL VHLLSASSRA ASRLLRLPEG GDRAVTEEDI RGLIGEGAAS GVVEHAERDM LERIFRLGDR RAGSLMTHRS QVEWLDLDMP DAENMQRIAQ SSHSCFPVAR GDIAAATGVL KARDFLAARL VTPDIPVDGF IRQPLYIPET ARALTLLDLF RHSEGLPFAL VVDEYGEVQG VVTPNDVLEA VVGELPDEGG DPDPAAVRRE DGSWLLDGLL PFDEMCSLAG LGAAEDPDDR PGSYETLAGF MLHRLGRMPA MGDALRWRGH RFEIVDMDGR RIDRVLVSPD PERADDVGDD AP
|
| |