Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_3143 |
Symbol | |
ID | 7175089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | - |
Start bp | 3963731 |
End bp | 3965530 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643541679 |
Product | glycosyl transferase family 2 |
Protein accession | YP_002437547 |
Protein GI | 218888226 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 82 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCATC CGCACGGCAT TCCCGCACTC CATCCGGCCC AGGCCCCGGC GCTGCTGGCA ACCCTCATGG ACCGCGTGCC CCTGTGGGAA TGCGGGGCCA TGGGCGCGGA CAACCAGTTC CGGCTGGCGC GTGGGGCCAT CGAAAACGCG GGGGGCGCCC CCGGCATGGT GGCCGCCGGG GTGGGCATGG CCCTATGGGG CTGGTGCGAC AACCCGCTGG ATACCCGTCT TGCCGCCCTG CTGCGGGCCA TCATGGAAGA CGCCCCGCAG GCATTCGCCC CGGCCCAGCG CGAACTGGTG GCCCGCGTGC ATGACAGGGT GCGCGTGCCC GCCGACTTCG GCGCGCTGGA TACTGCCCTG AATGCGCATG GCGGCCCGGC GGCCGCCCCG GCGCGGGCCG TACTGGACGT GGTGCTGCCC CGGCTGGCCG ACCCGGCGCA CGGGCTGTTC TGGCTGGCGC GGGGCGTGGA GTGGTGCGTG CGCCACGAGA ATTCCGTTGT CGCAACCACG GACGGAACAG ACTCCCCGGC ACTTCCGGTT GCGGACGACA CCACGCTGGA AACCCTGCTG CGCGCCGCCC TGTCGACACC GCAGCCCCTA CCGGCCCCCA TGGCCCACAG GCTGCGCGCC GAATGGGCGG TTCGTCGCCT GGACCCCGCC GAGGCACTAC GACGACTGGA GCAACTGGCG GGACAGGACA CTACCACCGC TCGCGTATCC CCCTTTGCCT CGTGGTACGC ATTGCGCCGC GCCGAACTGC TGATCCGGCT GGAACAGCCT GGCCATGCGG CGGACGCGGC CGCCCTGCTG CTGCCCCTGT GGCGCGAGGC CCCGTACCAT CCCAACCTCA CGCTGGCCCT GCACGAACTG CTGTACCCGC TGCCCGCGCC CCCGGCGGAC ATGGCCCCGC CCGCCATCCT GCTGTACACC TGGAACAAGC GCGACCTTGC GGCGCAGACC CTGCGCTCGT TGCGCGCGGC TGGCTTTCGC GGCGCGCCGG TCTTCGCGCT GGACAACGGC TCGCAGGACG GCACGGAAGC AATGTTCCGC GCCATGGCGG CGGACTGGGG TTCGCCGTTC ACCGAGGTGC GCCTGCCGGT GAACGTGGGG GCCCCCGCTG CCCGCAATTG GCTGCTGTCC CTGCCGGATG TGCGCCAGCG CGACCATGCC GTGTTCCTGG ACGACGACGT GCTGCTGAAA CCCGGCTGGC TGGATGGCCT GCTGGCCGTG GCCCACGCCC GGCCCGCCTG GGGGACCATA GGGTGCGCGG TCACCGACCA CACGCCGCCC CACGCGCTGC AATGCGCCGA CTTTTTCGCC CTGCCGCCGG ACATGGGCAC CCGCAGCTTT GCCGACATTG ACGAGCTCTG GCACATCCAC GGCAACGCGG CAGGCAGCCC CGACCCGCTG CTTACCGCCT ACACCCGGCC CTGCCTGTCC GTTTCCGGCT GCTGCCACAT GGTCTCCATG GCTTCGGTGC ACCAAGTGGG CGCCTTCGAC GTGCGCTTCA CCCCCAGCCA GTTCGACGAC CTGGAACGGG ACATCCGCTG CACGCTGGCC GGGCTGCCGG TATGGTACGC GGGCACGGTG CGGGTGCGCC ACATGCAGCA TTCCAGCCTG CGCCAGGCCA GCAGCCGGGC GCGCAGCGCA CATATCTTCG GCAACCGCAT CAAGCTGGAG CACCTGTACC CGGCCGACAA GGTCAGGCAG GCGCGCGAGA CATCCGCGGC GCTGGCCCGG CAGGATCTGC TGCGCAAAAC CGCCGCGCTG GCGGCCCTGC CCGCGCCGGG CGCCGGGTAA
|
Protein sequence | MAHPHGIPAL HPAQAPALLA TLMDRVPLWE CGAMGADNQF RLARGAIENA GGAPGMVAAG VGMALWGWCD NPLDTRLAAL LRAIMEDAPQ AFAPAQRELV ARVHDRVRVP ADFGALDTAL NAHGGPAAAP ARAVLDVVLP RLADPAHGLF WLARGVEWCV RHENSVVATT DGTDSPALPV ADDTTLETLL RAALSTPQPL PAPMAHRLRA EWAVRRLDPA EALRRLEQLA GQDTTTARVS PFASWYALRR AELLIRLEQP GHAADAAALL LPLWREAPYH PNLTLALHEL LYPLPAPPAD MAPPAILLYT WNKRDLAAQT LRSLRAAGFR GAPVFALDNG SQDGTEAMFR AMAADWGSPF TEVRLPVNVG APAARNWLLS LPDVRQRDHA VFLDDDVLLK PGWLDGLLAV AHARPAWGTI GCAVTDHTPP HALQCADFFA LPPDMGTRSF ADIDELWHIH GNAAGSPDPL LTAYTRPCLS VSGCCHMVSM ASVHQVGAFD VRFTPSQFDD LERDIRCTLA GLPVWYAGTV RVRHMQHSSL RQASSRARSA HIFGNRIKLE HLYPADKVRQ ARETSAALAR QDLLRKTAAL AALPAPGAG
|
| |