Gene Dole_3066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_3066 
Symbol 
ID5695926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3674489 
End bp3675877 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content61% 
IMG OID641265683 
Producthypothetical protein 
Protein accessionYP_001530946 
Protein GI158523076 
COG category[R] General function prediction only
[S] Function unknown 
COG ID[COG1090] Predicted nucleoside-diphosphate sugar epimerase
[COG4276] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01777] conserved hypothetical protein TIGR01777 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000836111 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCACCG ACACCTTCAC ACGGCAGTCG ATCATTGATG CCGACGCGCG CACTCTGTTT 
TTATGGCATG CCCGGCCTGG CGCCATTGAA CGGCTCAGCC CGCCCTGGGA CCCCCTGGAG
GTGATCTTTC GCACCGGCGG CATTACTGTC GGCGCCCGGG TGGTACTGAA GATGTTTGCC
GGCCCGGTGC CCTACCGGTG GCACGCCCGG CATACCGTGT ATGAAGAAAA CCAAAAATTC
GTGGACGAGC AGGTCAAAGG GCCCATGGCC TTCTGGCGTC ACACCCACGC CTTTGAGCCG
GCCGGAGAAA ACCAATGCCG GCTCATCGAC ACCATTGATT ATCGCCTGCC GCTTTATCCC
CTTACCCGTT TTCCCGGCAA GCTCCTGGTG GAGAACAAAC TTGCCCGCAT CTTTGCCTGG
CGTCACCGGA TCACCGCCTT TGACATGGCC CTGCACCGGC GGTTTAACAA AAAGGGGCCC
ATGACCGTGC TGATCTCCGG GGCCAGCGGA GTCCTGGCAT CAGCCCTGAT CCCCCTGCTC
ACCACCGGTG GCCACAGGGT GGTCAGGCTG GTGCGCCGCA AACCGTCGGC CGAAAACGAG
GTGTTCTGGA ACCCGGCCGA CAATGTCATT GACACTGATG CCTTGAAAAA CCATACCATT
GACGCGGTGA TTCACCTGGC CGGCGAGCAT GTGGGCACCG GACGGTGGAC GGACGCCAAG
AAAAAAACCA TTATCGACAG CCGGCAGCAG GGCACCCGTC TTCTGGCCGA AACTGCGGCC
CGGCTCTCCC CCAGGCCCGG GGTTTTTCTC TGCGCCTCGG CCACCGGATT TTACGGTGAA
CGGGGAGAGG CCGTGCTGAC GGAAAATGAC GGGCCGGGAA ACGATTTTCT GGCAAAGGTG
TGCAAAATAT GGGAAGCCTC GGTCCAGCCG GCAACGGACG CCGGCATTCG CACCGTTCGC
ATGCGCATCG GTGTGGTGCT TACACCAAAA GGCGGGGCCC TGCAGCGGCT GCTTCTGCCC
TTTCAGCTCG GCATGGGTGG CCGCCTGGGA AACGGCCGCC AGTATTTAAG CTGGATCGGT
ATTGATGACG CCATCGGCGC CATCTTTTAC CTGCTGATGA ATGAGACGGT CAGCGGGCCG
GTCAACGTGG TATCCCCTTC CCCGGTCACC AACGCCGAAT TTACCCGGAC CCTGGCAACG
GTGCTTTGCC GTCCGGCCTT GATGCCGGTG CCGGCAACGG CCATTGACCT TGCCTTTGGC
GAAATGGGTA CCACCGTGCT GCTCACCAGC ACCCGGGTGG CGCCGTCAAA ACTGACAGAA
TCCGGCTACT GTTTCGGCTG GCCCGATCTT GAAAGCGCAC TGCGCCACAT TCTGGGAAAG
ACACGTTAA
 
Protein sequence
MITDTFTRQS IIDADARTLF LWHARPGAIE RLSPPWDPLE VIFRTGGITV GARVVLKMFA 
GPVPYRWHAR HTVYEENQKF VDEQVKGPMA FWRHTHAFEP AGENQCRLID TIDYRLPLYP
LTRFPGKLLV ENKLARIFAW RHRITAFDMA LHRRFNKKGP MTVLISGASG VLASALIPLL
TTGGHRVVRL VRRKPSAENE VFWNPADNVI DTDALKNHTI DAVIHLAGEH VGTGRWTDAK
KKTIIDSRQQ GTRLLAETAA RLSPRPGVFL CASATGFYGE RGEAVLTEND GPGNDFLAKV
CKIWEASVQP ATDAGIRTVR MRIGVVLTPK GGALQRLLLP FQLGMGGRLG NGRQYLSWIG
IDDAIGAIFY LLMNETVSGP VNVVSPSPVT NAEFTRTLAT VLCRPALMPV PATAIDLAFG
EMGTTVLLTS TRVAPSKLTE SGYCFGWPDL ESALRHILGK TR