Gene Dole_1009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1009 
Symbol 
ID5693844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1186967 
End bp1188874 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content55% 
IMG OID641263606 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_001528896 
Protein GI158521026 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000685319 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGC TTTATTTTTC CAAGAACGTG GTGATCGTGG TGGTGGTGGA CGCGCTGCTT 
TTTTGTTGCG CTTTCTACCT TGCCTACCTG CTGCGTTTTG ATTTTCACAT TCCCCGATTT
TATCGGGTCC CGTTTCAGCA GGTCCTTCCC CTGGTAATCC TGCTGAAGCT GGCCTCTTTT
TACTGGTTCG ACCTTTACCG GGGCATGTGG CGTTACACCA GCCTTTCCGA CCTTTTTAAC
ATCGTTAAAG CCGCTTTGAT AACCAACCTG GTGATTGTGG GGGGGCTGCT GTTTTTCAAT
CGGTTCCAGG GATTTCCCCG TTCGGTTTTC CTCATCGACG CGCTGCTTAC CATTCTGACT
GTTTCCGGGT TTCGCATCCT GATCCGTATC TATTTCGAGC ACGCCGCAGG CGACAAGATG
AGCCAGGTGG TGAAGAAAGC CTTTTCCCAG GTCTTTTCAA AAACTGACGG CCACACCCGT
CGCGTGATCA TTCTGGGCGC CGGGGACTGC GGGGAAAAGA TATATCGGGA GATCGCCCAT
AATCCTTCCC TGGGTTTTCA GGTGGTGGGA TTCCTTGATG ACAACCCGGT AAAGGTGGGT
AAAAAAATCC ATGGCCTGCC CGTGCTGGGA GAAATTGCCG GGGTGTCCAA GTTTGTGGCC
CGCCTGTCCA TTGATGAACT GATCATTGCC ATTCCTACGG CCACACCGGA ACAAATGCGC
ACCATCGTGG CGCTTTGCGA GAAGAGCGGC ATTCCTTATA AAACCGTTCC GGGTTTTAGC
GAACTGTTAA ACGGTACGCC ATCTGTCGCA GCCCTGCGCA AGGTGGCCTA TCGGGATTTG
TTGGGCCGTG AGGTGGTGCG ACTGGATAAG GAGGGTATCG GCGCCTACCT GGCGGGAAAA
ACCGTTCTGG TCACCGGCGC CGGCGGTTCT ATCGGTTCGG AGTTGTGTCG CCAGATACTT
GAGTTTTCTC CCGGACGGAT TGTGCTGTTT GATCGGTCCG AAACGGCACT TTATGAGATC
GACCTGGAAC TCAAGGGGCA GCGCCGCGAC ACCGGGCTTC GTATTTCCCC GGTGCTGGGT
GATATTCAGG ACCAGCGCCA GCTTGAACAC CTGTTTCAAC TGACAGCCCC CCATGTCGTG
TTTCACGCGG CCGCTTATAA GCATGTCCCC ATGCTGGAAG CACACCCCTG GAAGGCGGTA
AAAAACAATA TTATCGGTAC ACGGAACCTG GTGGAACTTT CCAAACGATT TGCCGTGGAA
CGATTCGTGC TGGTCTCAAC GGACAAGGCC GTGCGGCCCG CCAATGTGAT GGGGGCCTCC
AAGCGGGTGG CTGAACTGCT GCTTCAGTGC GGTAACGGAG GTGCCCCCTG CACCACGCAG
TTCATGATCG TCCGGTTCGG AAACGTTCTG GGCAGCGCCG GCAGCGTGAT TCCCCTGTTC
CAAAAACAGA TTGGAAAAGG CGGCCCGGTC ACCGTGACCC ATCCGGAGGT GACCCGTTTT
TTTATGACCG TTTCCGAGGC GTGCCAACTG ATTCTTCAGG CCGGCGCCAT TGGCAACACG
GGCCGGGGCA GGGCAGAAGT GTTTGTCCTC AAGATGGGAA CGCCGGTTAA AATAGTGGAC
ATGGCCCGGG ACCTGATCCG TCTGTCCGGC CTGGAGCCGG ACAAAGACAT TTCCATCGAA
TTTGTCGGGC TTCGGCCCGG GGAAAAGCTG TATGAAGAAC TGATCGTTGA AGGTGAAGGT
GTGGTCCCCA CCGAGCACGA GAAAATAATG GTGCTGCGAG GAGCCGAGGC CCACGCCGCC
GTTCTCAATG GTGCCATTGA AGAACTGGAG CGGTCTGCTG AACTCCAGGA TGGGGGCGCG
ATAAGGGTCT GGCTGAAAAA AATCGTCCCC GAATATGAAC CCGCTTAA
 
Protein sequence
MKKLYFSKNV VIVVVVDALL FCCAFYLAYL LRFDFHIPRF YRVPFQQVLP LVILLKLASF 
YWFDLYRGMW RYTSLSDLFN IVKAALITNL VIVGGLLFFN RFQGFPRSVF LIDALLTILT
VSGFRILIRI YFEHAAGDKM SQVVKKAFSQ VFSKTDGHTR RVIILGAGDC GEKIYREIAH
NPSLGFQVVG FLDDNPVKVG KKIHGLPVLG EIAGVSKFVA RLSIDELIIA IPTATPEQMR
TIVALCEKSG IPYKTVPGFS ELLNGTPSVA ALRKVAYRDL LGREVVRLDK EGIGAYLAGK
TVLVTGAGGS IGSELCRQIL EFSPGRIVLF DRSETALYEI DLELKGQRRD TGLRISPVLG
DIQDQRQLEH LFQLTAPHVV FHAAAYKHVP MLEAHPWKAV KNNIIGTRNL VELSKRFAVE
RFVLVSTDKA VRPANVMGAS KRVAELLLQC GNGGAPCTTQ FMIVRFGNVL GSAGSVIPLF
QKQIGKGGPV TVTHPEVTRF FMTVSEACQL ILQAGAIGNT GRGRAEVFVL KMGTPVKIVD
MARDLIRLSG LEPDKDISIE FVGLRPGEKL YEELIVEGEG VVPTEHEKIM VLRGAEAHAA
VLNGAIEELE RSAELQDGGA IRVWLKKIVP EYEPA