Gene Dole_2087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2087 
Symbol 
ID5694930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2535879 
End bp2537207 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content59% 
IMG OID641264688 
Producthypothetical protein 
Protein accessionYP_001529968 
Protein GI158522098 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000422239 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATC ATTTTTTGGT TGTCTGTGTA TGGCTGACAG TCTGTCTGGC GCCTGTCTGT 
GCCCCTGCCG GCCAGTCCTT CGAGGTATCC CAGTTCATTT CCCCTGATAC CTGCAGCGGA
TGCCACAGCG ACATCTACGC CCAGTGGAAA AACTCCATGC ACGGCCTGGC CCATAAGGAC
CCCATTTACC AGAAGGTGGC AAAGTTTTTT TTAACCGGCC TGACCGACCC CGGCGAGGTG
GAGGAGTCCG AGTCCTGCGT CAAGTGCCAC ACCCCGGTGG GCGTGGTGAG CGGCTTTCCA
AAAAAGACCT CGGACGACTG GTCAAAGACC CCGGAGATCG CCACCCACGG GATCCAGTGC
GATTTCTGCC ATTCGGCCGT GAGCGCGGAA AAGATGTACA ATAACGGCAT GGTCCTGTCG
CCGGGTAACG GCGAGGCCGA TCCCGGCATC AAACGGGGCC CGATCAAGGA TCCGGTGCCC
GAGTTCCACG AAGCCGAGTT TTCGGAATTT CATACCGGCG CTGAAATCTG CGGCACCTGC
CATAACGTCA AGCACGTGGC GTTCGGCACC GACCTGGAGA CCACCTATGA TGAGTGGGCC
GCCGGCCCCT ACAACAGCGA CGATCCGGCA AAGCGGGTAG TGTGCCAGGA ATGCCACATG
CGCCAGAAGC CCGGCCTGCC GGCCACCGGT TCCACCCCGC GGCCCGACAA CCCCGGTTAT
GCGTCGGATA TCGGCCCGGA ACGGGACCAC GTGTATACTC ATTATTTTGT GGGCGCCAAC
AATTTTGTGC CCCAACAGTT CGGCGACACG GAAAAAACGG CAATGGCCGT TGAGCGGCTG
ACCCATGCCG CTACCCTTAC CCTGGATACG ACAGGGATAA AAAAGGGGCG CCTGACGGTG
ACGGTGTCCA ATACCGGGGC CGGCCACAAG CTGCCCACCG GCCTGACAAA CGCCCGACAG
ATGTGGCTTG AGGTGACGGT GAAAAGCAAA AAGGACGGGC AGGTGCTTTA TGCTTCCGGG
GCTTTGAATG CCGATGGCTA CGTGGCAGAC AGTGCGACGG TTTACCATAC GATTTTCGGG
GACGGCAAGG GCAAACCGGT GGACAACATC TCCCTTGCGC GTGAGATCCT TACGGATCAA
CGCATACCGC CCGGGCAGGC CGTGACCGAA ACCTTTAAAC TGCCGGCCAA AACCCCCTGC
AAGGATGTGG TGGTTTCGGT CCGGCTTCAG TACCGCATCT GCTCCCAGAA ACTGCTGGAC
CTGGTGCTGG GCAAGGGCGC GCTCTCAGTA CCGGTGGTGA CCATGGCCCA GATTGAAACC
AGTCTGTAG
 
Protein sequence
MKNHFLVVCV WLTVCLAPVC APAGQSFEVS QFISPDTCSG CHSDIYAQWK NSMHGLAHKD 
PIYQKVAKFF LTGLTDPGEV EESESCVKCH TPVGVVSGFP KKTSDDWSKT PEIATHGIQC
DFCHSAVSAE KMYNNGMVLS PGNGEADPGI KRGPIKDPVP EFHEAEFSEF HTGAEICGTC
HNVKHVAFGT DLETTYDEWA AGPYNSDDPA KRVVCQECHM RQKPGLPATG STPRPDNPGY
ASDIGPERDH VYTHYFVGAN NFVPQQFGDT EKTAMAVERL THAATLTLDT TGIKKGRLTV
TVSNTGAGHK LPTGLTNARQ MWLEVTVKSK KDGQVLYASG ALNADGYVAD SATVYHTIFG
DGKGKPVDNI SLAREILTDQ RIPPGQAVTE TFKLPAKTPC KDVVVSVRLQ YRICSQKLLD
LVLGKGALSV PVVTMAQIET SL