Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_2052 |
Symbol | |
ID | 5694895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | - |
Start bp | 2501690 |
End bp | 2502928 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641264653 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001529933 |
Protein GI | 158522063 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000028188 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGTCCA GTCTGAAGAT TCTCCACCTG ATCAGCCAGC GGCCCGATGC CACGGGCAGC GGTGTCTATG TCCAGGCCAT GCTGCGTCAC GCCGCACAAA AAGGGCATTG CAACCACCTG GTGGCCGGCA TTCAGGCGAA TAATATACCC CAGGCCCCGG TTGTCAATCA ATGTGAATGC GCTTTTGTCT GTTTCGAAGG GCCGGACACA CCCCTCCCCA TTGTGGGCAT GAGCGATGTG ATGCCTTATA AAAGCCGGCG GTTTTGCGAC CTGTCCGATA ACGCGGTGGA TGAATATGAA ACCTGCTTTG CCGAAAAACT GCGCCATGCC GTCAAACGTT TCGCTCCGGA CCTGATCCAC AGCCATCACC TGTGGCTGGT TACCTCCCTT GCCAGACGCA TGTTTCCCGG CCTGCCCATG GTCACCACCT GTCACGGCAC CGATCTGCGC CAGTTTCAGA ACTGTCCTCA CCTGCAGGCG CGAGTTCTGG AGGGATGCGC CGGGCTGGAC GCGGTCATGG CGCTGAGCCG GGCGCAAAAG ACTGAGATTG CCTCCCTGTA CGGGCTGTCC GAAGAAAAGA TTCACGTGGT GGGCGCCGGG TATGACGAGG CCCTGTTTTA TCTTCAGGCC AAGCCCGTCC CGCATCCGGT GCAGGTGGTA TATGCGGGCA AACTGTGCAA CGCCAAGGGC ACGCCCTGGC TGTTGAAAGC ATTATCCGCC ATCCATACCG TGCCGTGGCA GCTTCACCTG GTGGGCGGCG GTGCCGGCGA GGAGGCCGAT CAGTGCTGGA AAATGGCCGG CGACCTGGGA GACCGGGTGT GCGTTTACGG TGCCGTGGAC CAGTCCACGC TTGCGGCTTT GATGCGACAA AGCCATATTT TTGTGCTGCC CTCATTTTTT GAAGGCCTGC CCCTTGTGCT GCTGGAGGCC CTGGCCTGCG GATGCCGTGT TGTTGCCACC GACCTGCCCG GCGTGGCCGA GGTGCTGGAC GGCATGGATG CCGATTATAT CTCCCGGGTC CATCCGCCGG GATTGCACAC GGTAGACAAA CCCTTTACCC AGGATCTGGA CCGGTTTGTC AAAGACCTCG CAAACGTTCT TACCACACAG ATGGCCGCAG CGGTCCAACA GCCCGATATT GATCTTTCCC TTATTCAGGA CCGGCTTTCC GGTTTTACCT GGGGCCGGGT TTTTGAACGG GTGGAACGGG TCTACAGGTC GGTTTGTCGT CTATCATAA
|
Protein sequence | MVSSLKILHL ISQRPDATGS GVYVQAMLRH AAQKGHCNHL VAGIQANNIP QAPVVNQCEC AFVCFEGPDT PLPIVGMSDV MPYKSRRFCD LSDNAVDEYE TCFAEKLRHA VKRFAPDLIH SHHLWLVTSL ARRMFPGLPM VTTCHGTDLR QFQNCPHLQA RVLEGCAGLD AVMALSRAQK TEIASLYGLS EEKIHVVGAG YDEALFYLQA KPVPHPVQVV YAGKLCNAKG TPWLLKALSA IHTVPWQLHL VGGGAGEEAD QCWKMAGDLG DRVCVYGAVD QSTLAALMRQ SHIFVLPSFF EGLPLVLLEA LACGCRVVAT DLPGVAEVLD GMDADYISRV HPPGLHTVDK PFTQDLDRFV KDLANVLTTQ MAAAVQQPDI DLSLIQDRLS GFTWGRVFER VERVYRSVCR LS
|
| |