Gene Dole_3044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_3044 
Symbol 
ID5695904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3650358 
End bp3652298 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content62% 
IMG OID641265661 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_001530924 
Protein GI158523054 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000897526 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGGAGG ATTCTCCATT GAATCAGTTT TCGAAAAACA TGGCGCTGTG GCTGGTCATC 
ATTCTGGTGA TGGTCATGCT TTACAACATG TTCAATCAGC AGCAGAACCT GGAGCGCACC
ATCGGCTACA GTGATTTTCT CTACATGGTG GACAACGGCA AGGTCAAGGA TGTGCTGATT
CAGGGCCAGA CCCTGCGGGC CACCAGTGAC ACCGGTGAGC GGTTCAGTGT TTACATGCCG
GAAGATCAGG ACCTGATCCC CACCCTGCGG GCCAAGGGCA TCGCCATTGA GGCCAAGCCC
CCGGCCGAAG CCCCGTGGTT TATCACGGCC CTGATCTCCT GGCTGCCCAT GATCCTGCTG
ATCGGCATAT GGATCTATTT CATGCGACAG ATGCAGTCCG GCGGCGGCAA GGCCCTCTCC
TTCGGCAAAA GCCGGGCCCG GCTCCTCTCC GAAGGGGCCA ACAAGGTCAC CTTTGCCGAC
GTGGCCGGCA TCGACGAAGT CAAAGAAGAA GTGGGCGAAA TCATCGAGTT TCTGCGGGAC
CCCCAGAAGT TCACCCGGCT GGGGGGCCGT ATTCCCAAGG GCGTGCTGCT GGTGGGCGCG
CCGGGCACGG GCAAGACCCT GCTGGCAAGG GCCATTGCCG GCGAGGCCGG AGTCCCCTTT
TTCAGCATCA GCGGCTCGGA TTTTGTCGAG ATGTTCGTGG GCGTGGGTGC CTCCCGGGTC
CGGGACCTGT TTCTTCAGGG CAAAAAGAAC GCGCCCTGCA TCATCTATAT TGATGAAATC
GATGCCGTGG GTCGCCACCG CGGGGCCGGC CTGGGCGGCG GGCACGACGA GCGGGAGCAG
ACCTTGAACC AGCTGCTGGT GGAGATGGAC GGGTTTGAGT CCAACGAGGG CGTGATCCTG
ATCTCGGCCA CCAACCGTCC GGACGTGCTG GACCCGGCCC TGCTGCGGCC GGGCCGCTTC
GACCGGCAGG TGGTGGTGCC CCTGCCCGAC ATTCGGGGCC GCCGGGCCAT CCTGGACGTC
TATATCAAAA AAATTCCCGC GGCCGACGAC GTCAAGGTCA ACAACCTGGC CAAGGGAACC
CCCGGTTTTT CCGGCGCCGA CCTGGAGAAC CTGGTCAACG AGGCGGCCCT GTTTGCCGCC
AAGCGCAACA AGGAAAAGGT TGAAATGGTG GACTTTGAAG ACGCCAAGGA CAAGGTGTAC
ATGGGCCTTG AGCGCAAGTC CAAGGTGATC AAGGAAGAAG ACAAGAAGAT GACCGCCTAC
CACGAAGGGG GCCATGCCAT CGTGGCCCGG CTCCTTCCCG ACACCGACAC GGTCAACAAG
ATCACCATCA TTCCAAGGGG GCGGGCCGCC GGCGTCACAT GGTTTCTGCC CGAAGAGCGG
GACTTTCGGT TCAAGGACCA GCTGGAAAGC GAGCTGGCCA TCTCTTTCGG CGGCCGCATC
GCCGAGGAGA TCATCTTCAA CCGGATCAGC ACGGGCGCGG CCAACGACAT CAAGCAGGCC
ACGGCCCTGG CCCAGAAGAT GGTGCGGGAG TGGGGCATGA GCGAAAATCT GGGCCTGCTC
TCCTACTCGG CCAACGAGGA GCAGATATTT CTGGGCCGGG AGATATCCCA GCACCGGGAC
TACTCGGAAG ACACGGCCCG GCGCATCGAC GCGGAGGTGG AGCGGATCAT CAAGTCCGCC
TATGACACGG CCCGGCGCCT GCTGAAAGCG AATGTGGATA TCCTGCACGC GCTGGCAGAC
CTGCTGATCG AAAAAGAGAC CGTGCTGGGG CCGGAACTGG ATGAGCTGAT CCGGTCCCTG
CGGCCGGACA TCGAGCTTTC CGTGCCGTCG GACGACGACA TGCCGGAGGA AAAGCCGGCG
AAGCCGTCCG AACCGGCGGA GAAGGCCCCG GAGGCTACCG GCACCGAACC AGAAGAACCG
GAAGAAGAGG ACAAAACATA A
 
Protein sequence
MLEDSPLNQF SKNMALWLVI ILVMVMLYNM FNQQQNLERT IGYSDFLYMV DNGKVKDVLI 
QGQTLRATSD TGERFSVYMP EDQDLIPTLR AKGIAIEAKP PAEAPWFITA LISWLPMILL
IGIWIYFMRQ MQSGGGKALS FGKSRARLLS EGANKVTFAD VAGIDEVKEE VGEIIEFLRD
PQKFTRLGGR IPKGVLLVGA PGTGKTLLAR AIAGEAGVPF FSISGSDFVE MFVGVGASRV
RDLFLQGKKN APCIIYIDEI DAVGRHRGAG LGGGHDEREQ TLNQLLVEMD GFESNEGVIL
ISATNRPDVL DPALLRPGRF DRQVVVPLPD IRGRRAILDV YIKKIPAADD VKVNNLAKGT
PGFSGADLEN LVNEAALFAA KRNKEKVEMV DFEDAKDKVY MGLERKSKVI KEEDKKMTAY
HEGGHAIVAR LLPDTDTVNK ITIIPRGRAA GVTWFLPEER DFRFKDQLES ELAISFGGRI
AEEIIFNRIS TGAANDIKQA TALAQKMVRE WGMSENLGLL SYSANEEQIF LGREISQHRD
YSEDTARRID AEVERIIKSA YDTARRLLKA NVDILHALAD LLIEKETVLG PELDELIRSL
RPDIELSVPS DDDMPEEKPA KPSEPAEKAP EATGTEPEEP EEEDKT