Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ddes_1798 |
Symbol | |
ID | 7285511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 |
Kingdom | Bacteria |
Replicon accession | NC_011883 |
Strand | + |
Start bp | 2170805 |
End bp | 2173618 |
Gene Length | 2814 bp |
Protein Length | 937 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643582618 |
Product | ComEC/Rec2-related protein |
Protein accession | YP_002480373 |
Protein GI | 220905061 |
COG category | [R] General function prediction only |
COG ID | [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCATC CACCACTGCA AGCCCCGCTG TTATGGCAGC TCTGCCTTTG CTTCTGGATA GGAGGCATAG CTGCGGCCGT ATGGCCCTGG CAGTCTTTGC TGTGCCTGCT GCTGCTTGTT CTCGCAGACC GCCGCCTGTG GAAAGCGGGG CGCATTGCCT TCTGTACCCT GCTGGTCCTG GCCGGGCTGT TCACAGCTCG ATGGCAACTT TACGGCAGCC CCTGGCACGG CGGTCTTCCT CCAGCTTCTC CACCCTTCTG GCTGGCTGAA GCCCCCAAAG ATATCCGCGT CTGCGGCACA GTGACCGACA GCCAGGGCCT GCCGGACCAA CGCCTGCGCC TGATTCTGTC TCACATGGCT CCTGACGCGA CAGCTGCGGC AGACGATGCG TCGCGCCGCG CAGACCTGCG AGCAGCCTTT GCCGCGCCCT TGCCCGGCCT TGTGGCCTTT ACCTGGGAAA ATCCCACACT GCGCCCGTTA CCCGGCCAGA CATTTTGCCT GGCACGCCGC CCCATGCCCA TGCAGGGCTT CGCCAATGAG GGACAATCGG CCTGGGATCT GTGGTGGGCC GCGCAGGACG TACGCTGGCG TCTATGGACG CAGGCGGAAC GCGGCGAGCC GTTCGTCTCT GGCGAAGGGG GGCGTTCCGC TCGCTGGCGA GAGTCCCTGC GCGAAAAATT CCTGGCAGCC CTGCAAGCTC CTGCGCAACC GGGCGATCTT CTTCCTGCGG CCCGGAACGG CAGCCCCACG ACGCATGGAG GGCATACAAC AGCCTCGGGG CAGATAAAAG AACTCTCCGG CGGGCATGCC GCATCCGGTG GCGAAATGCC GCCCGGCCAG AAAAGCCTGA GCCAGGGCAA AGCCATTGTA CTGGCCCTGC TTTTTGGCGA CAGGCAATAC CTCGCCCAGG CGACCCTGAA CAATTTTGCC GCTGCCACCC TGGTGCACAG CCTGGCCCTT TCGGGCCAGC ACCTTGTGGT AGCAGGCATG ATCGGGCTGT TCTGCGTACT GGCGGCATCA CGCCTGCGGC CGGGAATCTA TCTGCGCCGC CCCAAAGTCC TGTGGGTCAT GCTCGCCTCT CTGCCGCCCG CCCTGGGGTA TCTGTGGCTG GGCAACGCCC CGGCATCCCT GTTGCGCGCC ACAGTCATGA TGCTTGTGGC GGCCCTGTGG ATGGCTTCCG GCCGTCCGCG CACAACCCTG GACGCACTGG GCGCGGCACT TCTTTGTATC ACGGTATCCT TTCCCCTGAG CGTGCTGGAT ACCGGACTGC AGCTTTCAGC CCTGTGTGTG GGGGCCATCG GCCTGAGCCT GCCCTGGTTG CGGCGCATCA TTCCCCTGCC GGACCGTGGT GCGCCCGACT CGCCTTCGCG CGTCACGGCG TGGCTGCGCG CTCTTATGCG CATTTTTCTT GTTTCACTGT TCATACAGAT CGCCCTGCTA CCGCTCAATA TGCTGCTTTT CGGCAACGCG GGCTTCTGGT TTCCCCTTAA TGTGGCATGG CTGCCCGTAG CCGACCTGAT TGTACTGCCC GCAGCCGTAC TGGGCCTGCT GTGCGCGGCC CTTGGCCTTG AATCACCGGC GCGCCTGATT CTGGATGTGG CCGCCATGCC CTGCCAGTGG CTGGTGGACT CTCTGGCCTG GCTGGCCGAA AGTGGCCTGC TGGCTGCCCC GGCCATGCTG CGCCCCCACT GGACGGCCCT GCCGGCCTTT GCTGCCTTGC TTACGGCTCT TGCTTTCATG GCCGGGCGAA AAACCATGCC TGCGGGAGCA GCGCGACTGC TGCTGGCAGG GGGCGCGTTA TTGCTGGCAG GGCCGCTGCT GCGCGTGGAA CAGCGCCTTT CACCGGAAAT ACGGCTGGAG GTGCTGGATG TGGGGCAAAG CCAGGCCGTG GCCCTGCGTT TGCCGGGACA TGTGCGCCTG CTGCTGGACG GCGGCGGCAG TGCCTCACCG CGCTTTGATC CCGGCCAGGC GCTGGTGGCT CCAGTCCTCC TCTATAATGA TTCGCCCCGC CTTTCAGCCG TACTGAACAG CCATCCCGAC CTTGACCATA TGGGGGGACT GGTACATATT CTTGAGTATT TCAAGGTAAA CACCCTGTTT GACAACGGGC AGGACGGACG CAGCGACAGC GGCTGGGGCG CCCTGTGGAC ACAGGTCAGG CAACAGCACA AGGCCCGCCC CCTTGCACAG GGCGATGTGC TGCAACTGGG CGACCCGATC CGCCAGCTCC AGCTGGAAAT ACTGCACCCA CCGCGCAGCG CCCGTGCTGA AGCACATTCG CCGTGGACCG GAAACAATGC TTCGCTGGTG GCCCGTCTGA CCCACAAGGG CCGCGGCCTG GCCCTGATCC CCGGCGATGC CGAACGCCGT TCATTGCGGC ACATGCTGGA TCAGGGCATT GACCTGCGTG CCGAGGTTCT GGTTTTGCCG CACCACGGAT CAGACAGCAG CTATCTGGCT GATTTTTACA AGGCGGTGCA GCCCCGTGTG GCGGTGGCTG CCTGCGGCTT TGAAAACCGC TATGGCTACC CCGGTAAAAA AGTGCGCGCA TGGCTGGACA AAGCAGGCAT TCCCCTGCTG TTCACCGGGC GTGACGGGCA GGTGCAGCTG ACCTGGCCTG GACGTGGCAT GCAGTGGCCC CCTACCGACG CTGAAGAACG CGCGGTGCTG CATCTACGGA CACAAAGAGG CACTGCCGGG GTAAACAGTG CCGCGCCGGG CGGCACCAGG CAGGACCATG AAAAAATGCC CGCCAAAGAG CATAAAGAGG AAGGCGAACT ACGCGGCAAG GCTGATAGCG GACAGACCGA ATAG
|
Protein sequence | MRHPPLQAPL LWQLCLCFWI GGIAAAVWPW QSLLCLLLLV LADRRLWKAG RIAFCTLLVL AGLFTARWQL YGSPWHGGLP PASPPFWLAE APKDIRVCGT VTDSQGLPDQ RLRLILSHMA PDATAAADDA SRRADLRAAF AAPLPGLVAF TWENPTLRPL PGQTFCLARR PMPMQGFANE GQSAWDLWWA AQDVRWRLWT QAERGEPFVS GEGGRSARWR ESLREKFLAA LQAPAQPGDL LPAARNGSPT THGGHTTASG QIKELSGGHA ASGGEMPPGQ KSLSQGKAIV LALLFGDRQY LAQATLNNFA AATLVHSLAL SGQHLVVAGM IGLFCVLAAS RLRPGIYLRR PKVLWVMLAS LPPALGYLWL GNAPASLLRA TVMMLVAALW MASGRPRTTL DALGAALLCI TVSFPLSVLD TGLQLSALCV GAIGLSLPWL RRIIPLPDRG APDSPSRVTA WLRALMRIFL VSLFIQIALL PLNMLLFGNA GFWFPLNVAW LPVADLIVLP AAVLGLLCAA LGLESPARLI LDVAAMPCQW LVDSLAWLAE SGLLAAPAML RPHWTALPAF AALLTALAFM AGRKTMPAGA ARLLLAGGAL LLAGPLLRVE QRLSPEIRLE VLDVGQSQAV ALRLPGHVRL LLDGGGSASP RFDPGQALVA PVLLYNDSPR LSAVLNSHPD LDHMGGLVHI LEYFKVNTLF DNGQDGRSDS GWGALWTQVR QQHKARPLAQ GDVLQLGDPI RQLQLEILHP PRSARAEAHS PWTGNNASLV ARLTHKGRGL ALIPGDAERR SLRHMLDQGI DLRAEVLVLP HHGSDSSYLA DFYKAVQPRV AVAACGFENR YGYPGKKVRA WLDKAGIPLL FTGRDGQVQL TWPGRGMQWP PTDAEERAVL HLRTQRGTAG VNSAAPGGTR QDHEKMPAKE HKEEGELRGK ADSGQTE
|
| |