Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_3044 |
Symbol | |
ID | 5695904 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 3650358 |
End bp | 3652298 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641265661 |
Product | ATP-dependent metalloprotease FtsH |
Protein accession | YP_001530924 |
Protein GI | 158523054 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000897526 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGGAGG ATTCTCCATT GAATCAGTTT TCGAAAAACA TGGCGCTGTG GCTGGTCATC ATTCTGGTGA TGGTCATGCT TTACAACATG TTCAATCAGC AGCAGAACCT GGAGCGCACC ATCGGCTACA GTGATTTTCT CTACATGGTG GACAACGGCA AGGTCAAGGA TGTGCTGATT CAGGGCCAGA CCCTGCGGGC CACCAGTGAC ACCGGTGAGC GGTTCAGTGT TTACATGCCG GAAGATCAGG ACCTGATCCC CACCCTGCGG GCCAAGGGCA TCGCCATTGA GGCCAAGCCC CCGGCCGAAG CCCCGTGGTT TATCACGGCC CTGATCTCCT GGCTGCCCAT GATCCTGCTG ATCGGCATAT GGATCTATTT CATGCGACAG ATGCAGTCCG GCGGCGGCAA GGCCCTCTCC TTCGGCAAAA GCCGGGCCCG GCTCCTCTCC GAAGGGGCCA ACAAGGTCAC CTTTGCCGAC GTGGCCGGCA TCGACGAAGT CAAAGAAGAA GTGGGCGAAA TCATCGAGTT TCTGCGGGAC CCCCAGAAGT TCACCCGGCT GGGGGGCCGT ATTCCCAAGG GCGTGCTGCT GGTGGGCGCG CCGGGCACGG GCAAGACCCT GCTGGCAAGG GCCATTGCCG GCGAGGCCGG AGTCCCCTTT TTCAGCATCA GCGGCTCGGA TTTTGTCGAG ATGTTCGTGG GCGTGGGTGC CTCCCGGGTC CGGGACCTGT TTCTTCAGGG CAAAAAGAAC GCGCCCTGCA TCATCTATAT TGATGAAATC GATGCCGTGG GTCGCCACCG CGGGGCCGGC CTGGGCGGCG GGCACGACGA GCGGGAGCAG ACCTTGAACC AGCTGCTGGT GGAGATGGAC GGGTTTGAGT CCAACGAGGG CGTGATCCTG ATCTCGGCCA CCAACCGTCC GGACGTGCTG GACCCGGCCC TGCTGCGGCC GGGCCGCTTC GACCGGCAGG TGGTGGTGCC CCTGCCCGAC ATTCGGGGCC GCCGGGCCAT CCTGGACGTC TATATCAAAA AAATTCCCGC GGCCGACGAC GTCAAGGTCA ACAACCTGGC CAAGGGAACC CCCGGTTTTT CCGGCGCCGA CCTGGAGAAC CTGGTCAACG AGGCGGCCCT GTTTGCCGCC AAGCGCAACA AGGAAAAGGT TGAAATGGTG GACTTTGAAG ACGCCAAGGA CAAGGTGTAC ATGGGCCTTG AGCGCAAGTC CAAGGTGATC AAGGAAGAAG ACAAGAAGAT GACCGCCTAC CACGAAGGGG GCCATGCCAT CGTGGCCCGG CTCCTTCCCG ACACCGACAC GGTCAACAAG ATCACCATCA TTCCAAGGGG GCGGGCCGCC GGCGTCACAT GGTTTCTGCC CGAAGAGCGG GACTTTCGGT TCAAGGACCA GCTGGAAAGC GAGCTGGCCA TCTCTTTCGG CGGCCGCATC GCCGAGGAGA TCATCTTCAA CCGGATCAGC ACGGGCGCGG CCAACGACAT CAAGCAGGCC ACGGCCCTGG CCCAGAAGAT GGTGCGGGAG TGGGGCATGA GCGAAAATCT GGGCCTGCTC TCCTACTCGG CCAACGAGGA GCAGATATTT CTGGGCCGGG AGATATCCCA GCACCGGGAC TACTCGGAAG ACACGGCCCG GCGCATCGAC GCGGAGGTGG AGCGGATCAT CAAGTCCGCC TATGACACGG CCCGGCGCCT GCTGAAAGCG AATGTGGATA TCCTGCACGC GCTGGCAGAC CTGCTGATCG AAAAAGAGAC CGTGCTGGGG CCGGAACTGG ATGAGCTGAT CCGGTCCCTG CGGCCGGACA TCGAGCTTTC CGTGCCGTCG GACGACGACA TGCCGGAGGA AAAGCCGGCG AAGCCGTCCG AACCGGCGGA GAAGGCCCCG GAGGCTACCG GCACCGAACC AGAAGAACCG GAAGAAGAGG ACAAAACATA A
|
Protein sequence | MLEDSPLNQF SKNMALWLVI ILVMVMLYNM FNQQQNLERT IGYSDFLYMV DNGKVKDVLI QGQTLRATSD TGERFSVYMP EDQDLIPTLR AKGIAIEAKP PAEAPWFITA LISWLPMILL IGIWIYFMRQ MQSGGGKALS FGKSRARLLS EGANKVTFAD VAGIDEVKEE VGEIIEFLRD PQKFTRLGGR IPKGVLLVGA PGTGKTLLAR AIAGEAGVPF FSISGSDFVE MFVGVGASRV RDLFLQGKKN APCIIYIDEI DAVGRHRGAG LGGGHDEREQ TLNQLLVEMD GFESNEGVIL ISATNRPDVL DPALLRPGRF DRQVVVPLPD IRGRRAILDV YIKKIPAADD VKVNNLAKGT PGFSGADLEN LVNEAALFAA KRNKEKVEMV DFEDAKDKVY MGLERKSKVI KEEDKKMTAY HEGGHAIVAR LLPDTDTVNK ITIIPRGRAA GVTWFLPEER DFRFKDQLES ELAISFGGRI AEEIIFNRIS TGAANDIKQA TALAQKMVRE WGMSENLGLL SYSANEEQIF LGREISQHRD YSEDTARRID AEVERIIKSA YDTARRLLKA NVDILHALAD LLIEKETVLG PELDELIRSL RPDIELSVPS DDDMPEEKPA KPSEPAEKAP EATGTEPEEP EEEDKT
|
| |