Gene Dole_1400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1400 
Symbol 
ID5694235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1662364 
End bp1665240 
Gene Length2877 bp 
Protein Length958 aa 
Translation table11 
GC content63% 
IMG OID641263993 
Productpeptidase M16 domain-containing protein 
Protein accessionYP_001529281 
Protein GI158521411 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACGAC GTTTGACAGT CCAACTGGCA GCCGTTTTGA TGTCAGTGCT TTTGTTCAGC 
GGCTGCGCGG CCTTGTCCGG CCTGCTGCCG GATTTTGCCT GCCCCTGCCG GGCGGACCGG
TTCCCCCACC AGGCCAGCGA CCTTGCGCCG GACCCGGCCC TGGTGTTCGG CACCCTTGAC
AACGGGTTTT CCTATGTGCT GATGGAAAAC CGGCGGCCCG AAGACCGGGT TTACATGCAC
CTGGTGACGG ACGCCGGCTC TTTTCATGAA ACCGACGACC AGCAGGGGCT GGCCCATTTT
CTGGAGCACA TGCTCTTCTG CGGCTCCACC CACTTTCCTC CGGGAGAGCT GATCCGCTAT
TTTCAGGAGA TCGGCATGCG GTTCGGCAAC GACGCCAATG CCCGGACCGG GTTTTTCCGC
ACCATCTACG ATCTGCACCT GCCTGCCGGG GATGAACAGA CCCTGCGCGA GGGGCTGGTC
GTGATGACCG ATTATGCCGA AGGGGCCCTG CTGCTGCCCG AGGAGATAGA CCGGGAGCGG
GAGGTGATCC TGGCGGAAAA GCGGACCCGC GATTCAGTGG CCTACCGGAC CTTTACCGCC
ACCTTGGCCT TTGAGATGGA GGGTGCCCGG ATCGTTGACC GCCTGCCCAT CGGCATTGAA
CCGGTGATCC AGGCCGCGGA CCGGGAGACG CTGAAAAACT TTTACGATGC CTGGTATCGG
CCCGAGCGGC TGGTGCTGGT GGTGGCCGGC GACATGGATA CCGCTGCTGC CGAAGCCCTG
GTGCGGGAGG CTTTTGGTGC AATGGAATCC AGGACCCCGG CGATGCCGGT GCCCGATTAC
GGCCGGGTGG CCCACCAGGG TGAAAAAGCC TTTTACCATT TTGAAAAAGA GGCGGGCGGC
ACCGAGGTGA CCATTGAAAC CATGGTGCAG GCCGAAGAGC CGCCGGATTC AAAGGCGCGA
GTCAGGGAGC GCTTTATCCG GGACACGGCC TTCTGGATAC TGGACAACCG GCTGGACGAC
CTGGCCGAAA CCGCGTCCCC GCCCTTTACG TCGGCCTCCA CCGCCGCGGG CCGGGCCTTT
GAACAGATCG ACTATGCTCA TGTTTCCGCC ACCTGCCCGC CGGAGACATG GCAGACCGCC
CTGACCGCCA TTGAGCAGGA GCTGCGCCGG GCCCTGGCCT ACGGGTTTAC CGCGGCCGAG
GTGGAGCAGG CCCGAAGGGA AATGCTCAAG GCTTTGGAGA TAGAGGTGCG GCAGTCCCCC
ACCCGGGAGA GTGGCGGTCT GGCCGGACGG ATCATTCACA GCCTGGCCGC AGGAAGGGTG
CTTCAGTCGC CGGAGCAGGT TCAGGACCTG CTGGCGCCGG TGGCTCAACG CTTTACCCCG
GCCATGCTGC ACAAGGCACT CAAAGCGGCC TGGGCCCCGA ATAACCGGCT GGTGCTGGTC
ACGGGCAACG CGGACCTGAC GCAGGTGGGG GCACCCCCGG AAGACCATAT CCTGTCGATA
ATGGCCCAAA GCCGGACACA GAAGGTGGCC CGGCCCGATG AAAGGGAAAT CCCTGTTTTC
CCCTACCTGC CGGTACCTGA AGCGCAGGGC AGGGAGGTCC GGCAGGTTTA TATCGACGAC
CTTGACATTC ACCAGGTCGA TTTTGAAAAC GGGGTAAGGC TGAACATTCG AAAAACCGAC
TTTGAGAAAG ACCAGGTAAT GGCGCGGGTC GGTTTTGGTG ATGGCGAGTC GTCGGAACCT
GTTGACAAAC CGGCCCTGGC CGAGTTTTCC GCCGCTGTGA TCAACGAGAG CGGTTTTGAG
AAAATGACCA ACGATGAGCT GCGCCGGGCA CTGGCCGGCG CCACCGCCGA CTTTTCCTTT
GCCGTGACCC AGGAGCGGTT TGAGCTTCGG GGGGGCTGCG CGTCGGACGA GGTGGAACTG
CTTTTTCAGC TGTTTTACAC ATTTTTCAAA GATTTCGGGT GCGGCTCCGA TGCCCGGCAA
CGCAGCCTGG AACAGCTGGC CCTCCACTAT AAACAGATGG GGCACACCAT TGACGGCGCC
ATGGCCCGCT TCGGGTATCG CTTTTTCGCC GGGGGTGACT CCCGGTTCGG CATGCCGCCC
GACTATGAAT CCTTTGTTGC GGTATCGGTT GAAGACATGC GCGCGTGGGT GGACGACGCC
CTGGCCGGGT CCCCGCCGGA GATCTCGGTT GTGGGAGACC TGGATGTGGA CCAGGTGGTT
GCTGTTGCGG CCCGCTATTT CGGCGCCATG GTATTTGAGG CCGATTCTTC GGGCCCGGAA
GGCCGCCAGG GGATGCCGCT CTTTCCCGTG GGAGAATCGC TGACGCTTCA GGTGCCCACG
GCCATTGAAA AGGCCCGGGT CCAGGTGGCC TGGCCCACGG ACGACTGCTG GAATATTTAC
GCCAACCGGC GGCTTTCGGT GCTGGCCGAC ATTCTTACCG ACCGCATGCG CACCACCATC
CGGGAAGAAA CCGGCCAGTC CTACTCCCAG TATGCCTATC ACGAGGCCTA TTGCGCCTAT
CCCGGTTACG GGGCACTGCA CGCGGTGGTA AACGTGGCAC CGGGCGAGGC CGAAACAGTG
GTGGACCAGG TGCGGGCCAT TGCCGCGGAT ATTCTGGAAA ACGGGGTGTC CGAAGACGAG
GTGACCCGGG CCGTGGACCC CACGCTGACC CACATTCGGG AGATGGTCAA ACAGAACCGG
TACTGGATGG GCAGCGTGCT GGTGGGCTCC AGGGAACACC CGGAACGGCT GGCCTGGGCC
CGGACCCTGA CCGATGATTA CGCGTCCATC GCGCCGGAGG AGGTGACCGC CCTGGCCCGT
CAGTACCTGT TGGACGACAG GGGCGCTGCC GTGGTCATTA TTCCGGAATC CTTTTGA
 
Protein sequence
MRRRLTVQLA AVLMSVLLFS GCAALSGLLP DFACPCRADR FPHQASDLAP DPALVFGTLD 
NGFSYVLMEN RRPEDRVYMH LVTDAGSFHE TDDQQGLAHF LEHMLFCGST HFPPGELIRY
FQEIGMRFGN DANARTGFFR TIYDLHLPAG DEQTLREGLV VMTDYAEGAL LLPEEIDRER
EVILAEKRTR DSVAYRTFTA TLAFEMEGAR IVDRLPIGIE PVIQAADRET LKNFYDAWYR
PERLVLVVAG DMDTAAAEAL VREAFGAMES RTPAMPVPDY GRVAHQGEKA FYHFEKEAGG
TEVTIETMVQ AEEPPDSKAR VRERFIRDTA FWILDNRLDD LAETASPPFT SASTAAGRAF
EQIDYAHVSA TCPPETWQTA LTAIEQELRR ALAYGFTAAE VEQARREMLK ALEIEVRQSP
TRESGGLAGR IIHSLAAGRV LQSPEQVQDL LAPVAQRFTP AMLHKALKAA WAPNNRLVLV
TGNADLTQVG APPEDHILSI MAQSRTQKVA RPDEREIPVF PYLPVPEAQG REVRQVYIDD
LDIHQVDFEN GVRLNIRKTD FEKDQVMARV GFGDGESSEP VDKPALAEFS AAVINESGFE
KMTNDELRRA LAGATADFSF AVTQERFELR GGCASDEVEL LFQLFYTFFK DFGCGSDARQ
RSLEQLALHY KQMGHTIDGA MARFGYRFFA GGDSRFGMPP DYESFVAVSV EDMRAWVDDA
LAGSPPEISV VGDLDVDQVV AVAARYFGAM VFEADSSGPE GRQGMPLFPV GESLTLQVPT
AIEKARVQVA WPTDDCWNIY ANRRLSVLAD ILTDRMRTTI REETGQSYSQ YAYHEAYCAY
PGYGALHAVV NVAPGEAETV VDQVRAIAAD ILENGVSEDE VTRAVDPTLT HIREMVKQNR
YWMGSVLVGS REHPERLAWA RTLTDDYASI APEEVTALAR QYLLDDRGAA VVIIPESF