Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_0380 |
Symbol | |
ID | 4664072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | - |
Start bp | 490408 |
End bp | 492024 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639818582 |
Product | glycosyl transferase family protein |
Protein accession | YP_965830 |
Protein GI | 120601430 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.885586 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACTACA GCTATCTCGA ACCGGGGCTT CATGCCGAAC TTGCAGCCAT CACGCCCGAC GAGGCAGTGG CGCACCTGCG CAACCATCTG GGCAACCTGC TTCTAGACCC GGCTGTCACC GCGGCCTACA TCAACCGGCT TGGAGTGCTC ACCCCCGAAG AGACCACCCC TTCCTCGCGG GCATGGCTCA CCTACCTGCT AGTGCGTTAC TGCGACCTGC TGCCCCTCGA CCTTGCGGCG CAACGCATCC TCGCACAGGT GACGGGCAAT GCCGCGCCGC ACCTGCGCGA CCTCGAACGC TGCGCCATCC CCGACCCCAT CGAACGCAAA CTCGAACGCC TCGGCGCAGG TGAAGATTCC GAACGCAACC GTGCCATGCT GCTGGCACTG CTGCATGAAT ACCCCTATTC GCCCGCCATC ATGGAACGGC TCATGGCCGT GGAACTCGCA CTGGACATCC CGGTGGGGGG CGAGTGGCTA CCGTCGGTCA GACCTCCGGC AGGACTGATG CGCCCCCTGC ACGAACGGCT CTTCATCCAC GCCATGATGC AGGGCGACAC CGAACGCGCC CTGCAACACG CCGCAGCCTG CGTTCCCGAC AACCCGTCTC CGCATCTGCT CAACCATCTC GCCGAACTGC ATGCCCGCCT CGGCGATGCG CAACAGGCCC TGAACCTGTA TGGAGCCTCG CTCGAAGCCG ACCCGCTCCA GCACCCGGTG AAGCTGCGCA TGGAAGCCCT CGCCAAGCCT CCGCGCGCGC ATGCCGACGC CCTGCACCAC CGCATCGCCA TCTGCCTGTA CAGCTTCAAC AAGGCTGAAC TGCTCGAGCG CACCCTCTCC TCGCTGGCGG CATCGCAAAC GGGCGAGGCG CACATCCTCG TCCTGCTGAA CGGCTGCACC GACGATTCCG CAACGCGCGT GGCTGCGGTG AACGAACGGC TGTTCAACGG GCGACTTGAA GTCATCGACA TGCCCATCAA CATCGGCGCA CCTGCGGCCC GCAACTGGCT GCTTGCCAAG CAGACCGTAC GTGAAGCCGA TTTCGTGGCA TTCCTCGACG ACGACGTGGA CGTGCCCGCC GCGTGGCTGC CCCGTCTGGT GGGAAATCTC GAAGAACATC CCGGCGCAGG GGTGGCGGGA ACGCGCGTGC GCAACCCCGG TGACGTCCCC CGTCTGCAGT ACCTCTACCG GAACATCTCG GTGGCGCGCC CCGGCCTCAT CCGCCTCAGC CTCGACACGC CCACCCTGCA CCACGATACC GGCTTCTACG ACTTCACCCG CAGCACCACC AGCGTCATGG GCTGCTGCCA CGTGTTCACG CGCCGCGCCC TCGACGAGGT GCCCGCCTTC GACCTGCGTT TCTCGCCGTC GCAGATGGAC GACATCGCCC ACGACATCGA CCTTGCCCTT GCCGGGTTCG ATGTGCTCTA CACAGGCGAC GTGGTCTGCG TACATCACCA GATGTCCGGC CTCGGCAGGG CCCATGCCAC GGACTGGAAG CGTTTCGGCA ACGTCGCTGG CAATGACGTG AAGTTCTACT ACCGCTTCGC CGACCGGCTT GACGCACTGG GTCGTCTCAA CAATCTGGGG TTCATGCCCG ACATGCCCCC TGCCTAG
|
Protein sequence | MHYSYLEPGL HAELAAITPD EAVAHLRNHL GNLLLDPAVT AAYINRLGVL TPEETTPSSR AWLTYLLVRY CDLLPLDLAA QRILAQVTGN AAPHLRDLER CAIPDPIERK LERLGAGEDS ERNRAMLLAL LHEYPYSPAI MERLMAVELA LDIPVGGEWL PSVRPPAGLM RPLHERLFIH AMMQGDTERA LQHAAACVPD NPSPHLLNHL AELHARLGDA QQALNLYGAS LEADPLQHPV KLRMEALAKP PRAHADALHH RIAICLYSFN KAELLERTLS SLAASQTGEA HILVLLNGCT DDSATRVAAV NERLFNGRLE VIDMPINIGA PAARNWLLAK QTVREADFVA FLDDDVDVPA AWLPRLVGNL EEHPGAGVAG TRVRNPGDVP RLQYLYRNIS VARPGLIRLS LDTPTLHHDT GFYDFTRSTT SVMGCCHVFT RRALDEVPAF DLRFSPSQMD DIAHDIDLAL AGFDVLYTGD VVCVHHQMSG LGRAHATDWK RFGNVAGNDV KFYYRFADRL DALGRLNNLG FMPDMPPA
|
| |