Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_1009 |
Symbol | |
ID | 5693844 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 1186967 |
End bp | 1188874 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641263606 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_001528896 |
Protein GI | 158521026 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000685319 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGC TTTATTTTTC CAAGAACGTG GTGATCGTGG TGGTGGTGGA CGCGCTGCTT TTTTGTTGCG CTTTCTACCT TGCCTACCTG CTGCGTTTTG ATTTTCACAT TCCCCGATTT TATCGGGTCC CGTTTCAGCA GGTCCTTCCC CTGGTAATCC TGCTGAAGCT GGCCTCTTTT TACTGGTTCG ACCTTTACCG GGGCATGTGG CGTTACACCA GCCTTTCCGA CCTTTTTAAC ATCGTTAAAG CCGCTTTGAT AACCAACCTG GTGATTGTGG GGGGGCTGCT GTTTTTCAAT CGGTTCCAGG GATTTCCCCG TTCGGTTTTC CTCATCGACG CGCTGCTTAC CATTCTGACT GTTTCCGGGT TTCGCATCCT GATCCGTATC TATTTCGAGC ACGCCGCAGG CGACAAGATG AGCCAGGTGG TGAAGAAAGC CTTTTCCCAG GTCTTTTCAA AAACTGACGG CCACACCCGT CGCGTGATCA TTCTGGGCGC CGGGGACTGC GGGGAAAAGA TATATCGGGA GATCGCCCAT AATCCTTCCC TGGGTTTTCA GGTGGTGGGA TTCCTTGATG ACAACCCGGT AAAGGTGGGT AAAAAAATCC ATGGCCTGCC CGTGCTGGGA GAAATTGCCG GGGTGTCCAA GTTTGTGGCC CGCCTGTCCA TTGATGAACT GATCATTGCC ATTCCTACGG CCACACCGGA ACAAATGCGC ACCATCGTGG CGCTTTGCGA GAAGAGCGGC ATTCCTTATA AAACCGTTCC GGGTTTTAGC GAACTGTTAA ACGGTACGCC ATCTGTCGCA GCCCTGCGCA AGGTGGCCTA TCGGGATTTG TTGGGCCGTG AGGTGGTGCG ACTGGATAAG GAGGGTATCG GCGCCTACCT GGCGGGAAAA ACCGTTCTGG TCACCGGCGC CGGCGGTTCT ATCGGTTCGG AGTTGTGTCG CCAGATACTT GAGTTTTCTC CCGGACGGAT TGTGCTGTTT GATCGGTCCG AAACGGCACT TTATGAGATC GACCTGGAAC TCAAGGGGCA GCGCCGCGAC ACCGGGCTTC GTATTTCCCC GGTGCTGGGT GATATTCAGG ACCAGCGCCA GCTTGAACAC CTGTTTCAAC TGACAGCCCC CCATGTCGTG TTTCACGCGG CCGCTTATAA GCATGTCCCC ATGCTGGAAG CACACCCCTG GAAGGCGGTA AAAAACAATA TTATCGGTAC ACGGAACCTG GTGGAACTTT CCAAACGATT TGCCGTGGAA CGATTCGTGC TGGTCTCAAC GGACAAGGCC GTGCGGCCCG CCAATGTGAT GGGGGCCTCC AAGCGGGTGG CTGAACTGCT GCTTCAGTGC GGTAACGGAG GTGCCCCCTG CACCACGCAG TTCATGATCG TCCGGTTCGG AAACGTTCTG GGCAGCGCCG GCAGCGTGAT TCCCCTGTTC CAAAAACAGA TTGGAAAAGG CGGCCCGGTC ACCGTGACCC ATCCGGAGGT GACCCGTTTT TTTATGACCG TTTCCGAGGC GTGCCAACTG ATTCTTCAGG CCGGCGCCAT TGGCAACACG GGCCGGGGCA GGGCAGAAGT GTTTGTCCTC AAGATGGGAA CGCCGGTTAA AATAGTGGAC ATGGCCCGGG ACCTGATCCG TCTGTCCGGC CTGGAGCCGG ACAAAGACAT TTCCATCGAA TTTGTCGGGC TTCGGCCCGG GGAAAAGCTG TATGAAGAAC TGATCGTTGA AGGTGAAGGT GTGGTCCCCA CCGAGCACGA GAAAATAATG GTGCTGCGAG GAGCCGAGGC CCACGCCGCC GTTCTCAATG GTGCCATTGA AGAACTGGAG CGGTCTGCTG AACTCCAGGA TGGGGGCGCG ATAAGGGTCT GGCTGAAAAA AATCGTCCCC GAATATGAAC CCGCTTAA
|
Protein sequence | MKKLYFSKNV VIVVVVDALL FCCAFYLAYL LRFDFHIPRF YRVPFQQVLP LVILLKLASF YWFDLYRGMW RYTSLSDLFN IVKAALITNL VIVGGLLFFN RFQGFPRSVF LIDALLTILT VSGFRILIRI YFEHAAGDKM SQVVKKAFSQ VFSKTDGHTR RVIILGAGDC GEKIYREIAH NPSLGFQVVG FLDDNPVKVG KKIHGLPVLG EIAGVSKFVA RLSIDELIIA IPTATPEQMR TIVALCEKSG IPYKTVPGFS ELLNGTPSVA ALRKVAYRDL LGREVVRLDK EGIGAYLAGK TVLVTGAGGS IGSELCRQIL EFSPGRIVLF DRSETALYEI DLELKGQRRD TGLRISPVLG DIQDQRQLEH LFQLTAPHVV FHAAAYKHVP MLEAHPWKAV KNNIIGTRNL VELSKRFAVE RFVLVSTDKA VRPANVMGAS KRVAELLLQC GNGGAPCTTQ FMIVRFGNVL GSAGSVIPLF QKQIGKGGPV TVTHPEVTRF FMTVSEACQL ILQAGAIGNT GRGRAEVFVL KMGTPVKIVD MARDLIRLSG LEPDKDISIE FVGLRPGEKL YEELIVEGEG VVPTEHEKIM VLRGAEAHAA VLNGAIEELE RSAELQDGGA IRVWLKKIVP EYEPA
|
| |