Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_1927 |
Symbol | |
ID | 7173845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | + |
Start bp | 2375912 |
End bp | 2377843 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643540443 |
Product | glycosyl transferase family 2 |
Protein accession | YP_002436338 |
Protein GI | 218887017 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 90 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCACG CCTCCCTTGC CCTTGCCTGG CAGCGCCTGC CGCTGTGGCT GCGCATCCGC CTCGTGCATG GCAGCGTGGG GTCCGTCCAC CGGCTGCGCT GCGCCGCCGA CGCCATCACC CGCTCGCAAT CCCCCGCCGC TTCGCCCGCC GACACCACAG GCCCCCACGG GCGTGAAGAC CTGCTGCGCA CCGGTCGCGA ACTGCTGCTG GCCGCGTGGG AGGACGACCC CTGCAACGGC CAGTTGGCCG GGCAGGTACT GCTGCTGCAT CAGCGCCTGC CCTGGCTGCA CCCCGCGCTG GCCGCGCTGC TGGCCGCCGT GCACGGGGCC TGGCGGCGGC CCGCCGACCT GGCCCGCTAC GAACGGCTGG CCGCCCAGGC AGACTGGGTG CGCCTGCAAC GCCATGTGGA CGCCGAGAGC CAACGCGAGC CGGACAACCT GTTCTGGGTG CAGCAGGCCG TGGCCGTGGG CGAACTTTCC GGCGACCTGG ACTGGCTGGA CGGGCACCTG CACCGGGCCG CAGCCCGGCT TGCCCCGTCT GGCGGATCTG GCCTACCCCC TCTCGCCCCG CTTTTCGACC ATCTTCACGG CGGCCTGGCC ACCAACCGCG CCTCATCCTG TGTGGGACTT GCTGATGGCG GGGCCACCGC CGCCCGCCAC CATGACGCCG CGCTGGCCCA TTTTCGCGCC GCCGCGCTGG CCGTGTCCGG CCTGCCCGCG TCCGGCCAGC CTGCATCTTC CGCCACGCCG GGCAGCGGCT GCTGGCTGGC CCCGGTGGAG CACGCCGCCC ACTGCCTGGT CCGCCTGGGC GATGTCCCCG CCGCCCTGCC CCTGTGGAAT GCGGTGCTGG CCGCCCGGCC CTGGCACGTC AACCTCGCCC TGCGCGCCCA CGACGTCTGG CGCGGCGTGG ACACTCCGGA AGCCGCGCCC GCCGGGTCCA CCGCCGTGCT GCTCTATTCG TGGAACAAGG CGCAGGAGCT CGACACCGCC CTTGCCCACC TGGCACCCGG CCTGCCGGAC GTGGCCCGCA TCGCCCTGCT GGACAACGGA TCCACCGACT GCACCGGCGA CGTGGTGCGC GCCTGGGCCG ACAGGTTCGG CGCGGACCGC TGCACCGCCG TGCACCTGCC GGTCAACGTG GGGGCCGCCG CCGCGCGCAA CTGGCTGATG CGCCTGCCGG AAGTGGCCAC CTGCGACTTT GCCGCCTATC TGGACGACGA CGCCGCCGTG CCGCCCGACT GGCTGCGCCG ACTGGCCGGG GCGGTGCGCC GCCAGCCGGA TGCCGCCGCC TGGGGCGGGC GCACCCTGGA CTGGCACGCC CCGTACATGA TCCAGTCTGC CGACCTGCAC CTGACCGCGC ACTTCCGCGC GCCGGAAGGC ACCCCGCCGC ACCTCGCCGC CTTCGCGGAC GACACCCCGC CACCGCCCGA CAGCGCGACC GGCGAGCCGC AGCGCGACAC GCTGTCGCCG GAGGTGGCCT TTTCTCCGGC CCGCGCCCAT GCGCTGCCCT TTTCGGTCAG CGACCTGCAC GCCCAGGTCA CGGACATGGG CCGCTTCGAC TACCTGCGAC CGTGCATTTC GGTAACCGGG TGCTGCCACA TGTTCCGCAC CACGGAACTG CTGGAGGGCG GGGGATTTTC CCTTTCGCTG TCGCCTTCGC AATACGACGA CCTGGAACAC GACCTGCGCC GGGCCCGCGC CGGACGCCTA GCCTGCTACA CCGGCTTTCT GCCCGTGCGC CACATGAAGC GCAGCGGCAA GGCCGTGCGC ATGGGGGCGG CGCAGTTCGG CAACGGCCTT GGCAACCGGT ACAAGCTTTC GGGCATGTTC GACGCCGCCG AGGTGCTGAC CATGCGCCGC CGCGAATTCG AGGCGCTGGA ACGCGACCTG CTGCGCAAGC TGGCGGCCCT GGGCGCACGC AAGGCTCCCT GA
|
Protein sequence | MPHASLALAW QRLPLWLRIR LVHGSVGSVH RLRCAADAIT RSQSPAASPA DTTGPHGRED LLRTGRELLL AAWEDDPCNG QLAGQVLLLH QRLPWLHPAL AALLAAVHGA WRRPADLARY ERLAAQADWV RLQRHVDAES QREPDNLFWV QQAVAVGELS GDLDWLDGHL HRAAARLAPS GGSGLPPLAP LFDHLHGGLA TNRASSCVGL ADGGATAARH HDAALAHFRA AALAVSGLPA SGQPASSATP GSGCWLAPVE HAAHCLVRLG DVPAALPLWN AVLAARPWHV NLALRAHDVW RGVDTPEAAP AGSTAVLLYS WNKAQELDTA LAHLAPGLPD VARIALLDNG STDCTGDVVR AWADRFGADR CTAVHLPVNV GAAAARNWLM RLPEVATCDF AAYLDDDAAV PPDWLRRLAG AVRRQPDAAA WGGRTLDWHA PYMIQSADLH LTAHFRAPEG TPPHLAAFAD DTPPPPDSAT GEPQRDTLSP EVAFSPARAH ALPFSVSDLH AQVTDMGRFD YLRPCISVTG CCHMFRTTEL LEGGGFSLSL SPSQYDDLEH DLRRARAGRL ACYTGFLPVR HMKRSGKAVR MGAAQFGNGL GNRYKLSGMF DAAEVLTMRR REFEALERDL LRKLAALGAR KAP
|
| |