Gene Dole_2871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2871 
Symbol 
ID5695729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3455605 
End bp3458568 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content61% 
IMG OID641265486 
Productpeptidase M16C associated domain-containing protein 
Protein accessionYP_001530751 
Protein GI158522881 
COG category[R] General function prediction only 
COG ID[COG1026] Predicted Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000230302 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAACG GCGCCGGTCT TGACGTCGGC TCACGGGTAA GCGGTTACCG GGTAAAACAG 
GTTATTGCGT TTGACGATAT TCAATCGCTG GTTTACGAAC TGGTGCACGA GGCCACCGGC
GCGCAACACC TGCATATCGC CAACGCCGAC AGGGAAAACA CGTTCGGCGT AGGCTTCAAG
ACGGTTCCCA GGGACTCCAC CGGGGTGGCC CATATCCTGG AACATACCAT TCTGTGCGGG
TCGGAAAAAT ATCCGGTGCG GGACCCGTTT TTCTCCATGA TCCGGCGCAG CTTGAATTCC
TTCATGAACG CCTTTACTGC GTCGGACTGG ACCCTGTATC CCTTTGCCTC TCCCAATAAA
AAAGATTTTT ACAACCTGGC CGATGTCTAC CTGGACGCCG TGTTTTTCCC CCTGCTGACC
GAGCTTGCTT TCAAGCAGGA GGGCCACCGC CTGGAGGTGG TAAAGGCGGC CGAAGGCCAG
GGAGCGCCGG AGCTTGCCTA CAAGGGCGTG GTCTATAATG AGATGAAGGG GGCCATGTCA
TCGCCGGACC AGGTGATGTC CCGTGCCCTG ACCGCCGCTC TTTGCCCGGA CACCACCTAC
AGCAACAACT CCGGCGGTGA TCCGGCGGTG ATCCCGACAC TGACCTGGGA GCAGCTGCGG
GATTTTCACC GGCGTCACTA CCATCCCAGC AACGCTTACT TTTATACCTA CGGCGACATT
CCCCTGCAGG AGCACCTGGC CTATATCAAC GACCACGCGC TTTGCCGGTT TTCCCGTATC
GACCCCGATA CGCAGGTGGC GTCCCAGCCC CGGTGGCAGG TCCCCCGGCA AGTGGCCCAC
GCCTATCCGC TGGCGCCGGA CGAAGACCCG GCAAAAAAAT ACCAGGCCTG CGTGGCCTGG
CTCATGGCCG ATGTGCAGGA GAGCTTTGAC GTGTTCGTCC TGGTATTGCT GGAGCAGGTA
CTGCTGGGCA ATCCCGGGGC ACCCCTGTAC AAGGCCCTGA TCGATTCCGG CCTGGGAAGC
GCCCTTTGCG ATGGCACGGG ATACGACCCG GACATGAAGG ACACCCTGTT TGCCTGCGGC
CTGAAGGACG TGGGAAAAGA CGACGCCGAA GCCGTGGAAA AGATCATTGT CGACACCCTG
ACCGATGTGG CGGACCGGGG TATTGATCCG GAGCTGGTGG AATCGGCCAT TCACCAGATT
GAGTTTCACC GCCGGGAGAT ATCCAACACG CCCTATCCCT ACGGCATCAA GCTGTTGCTG
ACCCTTTGCG GCAGCTGGCT TCACGGCACC GATCCCGCCG AGATCATTCA ACTTGACCCC
TACCTGGAAC GCCTGACCGG GGAAACCGGC CGGGGACCGT TTCTGGAAAA TTCGATCCGG
CGGTGGTTTT TAAACAACCC CCACCGGGTG CGTTTTACCC TGGAACCGGA CATGGAAATG
GGGGCCAGGG AACAGGCCGA AGAAGAGCGG GAACTGGCCC GTGTCGCCGC GTCCCTCTCC
CCTGAAGCAC TGGACAAGAT TCAACAGGAT GCCCGGGAAC TGGACGCCCT TCAGATGACC
GATGAGGACC TTACGGTTCT GCCCACCCTG ACGCTTTCCG ACATTGACGC GTCGGTCCGG
ACCGTGGCCC CGGTCATGGC GGCCGAGCCC CTGCGCTGTT ATGACCAGCC CACCTCCGGT
ATCCTATATT ATACGTCGGC TGTGGGCATC GACCGGCTTT CGCCGGACCT GCTGCCCCTG
GTCCCCTTCT TCTGCGCGGC ACTTCCCCGC ATGGGCACAC GCCGGCACGA TTATGTGGCC
CTGGAGCGGC TGATCGACAT GCACACCGGT GGCCTTGGCC TGTCGGCCCA GGCCCGGACC
CGGTACGGGG AAACGGGCGA ATGTATTCCC TACATCTCGT TTTCCGGAAA GTGCCTGGAC
CGGAAGATCG AGCAAATGTT CGATCTTGTG CGGGAACTGC TGTGCGACTA CAGTTTTGCC
GACCATCAGC GGCTTGGCCA GCTGGTGGCC GAGTACCGGG CCCACATGGA ATCGGCCGTG
GTCCACAACG GCCACCGGTA CGCCATCTCC CTGGCGTCAC GGCACGTTTC TTTTGCCAGC
CATCTTTCCG AGATGTGGCA CGGCATTGGC CAGCTGCAAT ACTTCAAGTC CCTGACCGCC
GACCTGGAAG GGCCCGCCCT GGCCGCCATT GCAGACAGGC TTTGCCTGAT TGGACGGAAC
CTCTTTTCAA AAGAAAACCT TCAAGTCGGC CTGGTCGGCC ATGGAAAGGG TCTGGACACG
GCCTCCGGTC TGGCCCGGGC CATGGTGGAA AACCTGGGAG CCGGCGCGAT GCCGGCTGAA
TTCAGGGGGC AGGCCATTGA ACATGACACG CAGCCGCCCA GGGAGGGATG GTACACTTCC
ACGGCAGTGT CCTTTGTGGC ATCGGTTTTT CCCACCATTC GCATGGACCA CGAAGACGCG
CCCGTGCTGG CCGTGATCAG CAAACTGCTG CGGTCCACGT TTCTTCACCG GGAGATACGG
GAAAAGGGCG GTGCCTACGG CGGGTTTGCC CTGTATAACC CGGAAGACGG CCGGTTCTGT
TTTGCCTCCT ACCGGGACCC CCACATTCGG GCCACGCTGG AGGTCTACAC ACGGGCCGTG
GCCTATATTC AGTCCGGTGA CTACACAGAC GAAGAGATCA CCGAGTCCGT GCTTCAGGTC
TGCTCGGACA TCGACAAGCC CGACACCCCG GCCGAAGCCG CTACCCGGGA TTTTTACCGA
AAGCTTGTCG GCGTTACCGA TACCTGCCGG CAGCGGTTCA AGGAAGGGGT GCTGACCGTG
ACCAGGGAAA AGGTCAGGGC CGTGGCCCTG CGCCATTTTC CCACCGGACA GGAGCACTGT
GGAACGGTGG TGATCTCTTC GGAGGCGCTG CTGAAAAAGG CCAATACCCG GCTGGCTTCC
CCCCTGGCGC TTCACCACAT CTGA
 
Protein sequence
MTNGAGLDVG SRVSGYRVKQ VIAFDDIQSL VYELVHEATG AQHLHIANAD RENTFGVGFK 
TVPRDSTGVA HILEHTILCG SEKYPVRDPF FSMIRRSLNS FMNAFTASDW TLYPFASPNK
KDFYNLADVY LDAVFFPLLT ELAFKQEGHR LEVVKAAEGQ GAPELAYKGV VYNEMKGAMS
SPDQVMSRAL TAALCPDTTY SNNSGGDPAV IPTLTWEQLR DFHRRHYHPS NAYFYTYGDI
PLQEHLAYIN DHALCRFSRI DPDTQVASQP RWQVPRQVAH AYPLAPDEDP AKKYQACVAW
LMADVQESFD VFVLVLLEQV LLGNPGAPLY KALIDSGLGS ALCDGTGYDP DMKDTLFACG
LKDVGKDDAE AVEKIIVDTL TDVADRGIDP ELVESAIHQI EFHRREISNT PYPYGIKLLL
TLCGSWLHGT DPAEIIQLDP YLERLTGETG RGPFLENSIR RWFLNNPHRV RFTLEPDMEM
GAREQAEEER ELARVAASLS PEALDKIQQD ARELDALQMT DEDLTVLPTL TLSDIDASVR
TVAPVMAAEP LRCYDQPTSG ILYYTSAVGI DRLSPDLLPL VPFFCAALPR MGTRRHDYVA
LERLIDMHTG GLGLSAQART RYGETGECIP YISFSGKCLD RKIEQMFDLV RELLCDYSFA
DHQRLGQLVA EYRAHMESAV VHNGHRYAIS LASRHVSFAS HLSEMWHGIG QLQYFKSLTA
DLEGPALAAI ADRLCLIGRN LFSKENLQVG LVGHGKGLDT ASGLARAMVE NLGAGAMPAE
FRGQAIEHDT QPPREGWYTS TAVSFVASVF PTIRMDHEDA PVLAVISKLL RSTFLHREIR
EKGGAYGGFA LYNPEDGRFC FASYRDPHIR ATLEVYTRAV AYIQSGDYTD EEITESVLQV
CSDIDKPDTP AEAATRDFYR KLVGVTDTCR QRFKEGVLTV TREKVRAVAL RHFPTGQEHC
GTVVISSEAL LKKANTRLAS PLALHHI