Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_0547 |
Symbol | |
ID | 3756494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | + |
Start bp | 561796 |
End bp | 564498 |
Gene Length | 2703 bp |
Protein Length | 900 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637781408 |
Product | ComEC/Rec2-related protein |
Protein accession | YP_387043 |
Protein GI | 78355594 |
COG category | [R] General function prediction only |
COG ID | [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACGCA GCACACCCCC GCAAACCGCA CAGGGCGGTC CCCCCGCCGG TTACAACCGC AGTGCCGCCC TGCTGCCGCA CGGCGCGCCA CCTGCACTGT TGTGGCGTCA ATGGTGCCTG CTGGCTGTGC TGGCCGGTCT GGCGGCAGGA GAAACGGGCC TGCGGGACGC TGCCCCGGCG TGGGGGGCGG CAGTGCTGGT ATTTATGGTG CTGTGGTCCG GTCGTCAGAA TGCGGGCAGG CCTGCCGCGG TTTTGCGTGC GCTGGTGCTG GGGGTATGCA TGGCTGCCGG TGTCTGCTTT ATGCTGCTGC ACCGCCAGCC CGCTGCGGAA GTGCCCGGCT GGATGACCGC GCAGGAAAAG GTGCGGGTGC AGGGGGTGGT GCAGATTGTG GAAAGCCGTC CTGAAGGACG GCTGCGGATG GTGCTGGCAG ATGCGCGCTG CCGCACGGCG CAAGGCTGGC AGACGTTGAA GGCTGATGTG GTCTGGACAT GGCAGTATCC TGTTTTTCGG CCCCGCAGCG GCAGTACAGT GGAATTTGAA GGCAGAGTGA AGCCGGTACG CGGTTTTCTG AACCGCGGAA CATGGGACAG CGGAAGCTTC TGGCTGCGCA ACGGGGTAGG GTACAGAGTC TGGACACGGG CAGAGCAGGG ACAGGTGCAT GAGGTGCCTG CCGGTAAGCC CGCAGCGGAG TCAGCCTTGT TCCAGAGTGA TGCGCTGCAG TCCGCAGAGG TGATGCGGGG CCTGCGCAGT CTTGTGCTGC AGCGGGTGGA ACGGGTGCTG GCGGGCATGT CTGACCCGAT ACCGGCAGAT GCGGACGGGC AGGGGGCTTT GCGGCAGATG CCGCGCTGGC GTGACCTGCC TCAGCGGTAT GCCGTGCTGC CAGCTCTGCT GCTCGGAGAG CGTTTTTACC TTTCACAGCA GACGATGGAC AGGCTGGCCG ATGCGGGGTT GATGCACAGC TTTGCTCTTT CCGGCATGCA TCTGGGGCTG GCGGCAGCGC TGGGTGCGTT GGCGGCGCTG CTGGCCGGCA GAATTTTTCC CGGACTGTAC CTTGTGGTGC CGCGGCAGCA GCTGGCCGTA CTGTGTGCCG CACCGCCGGT GCTCGGGTAT GTCTGGCTGG GCGGCGCCAC GCCTTCACTG CTGCGGGCGG CGCTGATGTT TGCCTTCTGG GGCGGGTTGC TGCTGGCCGG CAGGCGCGGA GTACTGACAG ACGGGCTGTT GTGGGCCGTG GCGGTAGTGG TTGCATGGCA GCCGCAGGCT CTGTTTGATC TGCGGCTGCA GCTTTCTGCC GTGGCCGTGG CGGGCATTGC CTGCACCATG CCTGTGTACC GTGCTGCGGC GGGCAGGCTC GGCACCAAGC ATGGCACGGC TGCCGGTGCT GTCATGCGCC TGACGGGGGG GCTTGCCGGA GTGCTGGCTG TAAGCATGGC TGCCCAGCTG GCCCTTATGC CTCTCACGCT GGATGCGTTC GGCAGTGTGA CGCCGTGGTT TGTGCTTAAT GTCCTGTGGC TGCCCGCGCT GGGGCTGTGG GTGCTGCCGC TGGCCTTTGC CGGGCTTGTG TCGCTGGCCC TGCCGCAGGC TGCCCCTGTT GCCGTGTGGC TGTTCCATGT TGCCACAGTG CCGGTGGAAT GGTTGCTGGA GATGCTGGGC ACACTGGAGG CCGGCGGACT GCTTTATCCG GTGTTGTCTG TAAGACCGCT GCCCTGCGCT GCTGCGGGGT TCTGGATGCT GTTGGGTACG GCATGCCTGC TGTGGCGGGG AACCGCCGGT CCTGCCGTGC AGCAGAGCGG CCGCAGCGTT CTGCTGCCGT ATGGTCTGCG CGCTGTCTGC GTGACGGTGA CTGCAGGGGT GGTGCTGCTT GCCGCGCCGG TGTGTCTGCG TTTTTCCGCG TACCTGCAGG ATGATGTTTC TGTTTCCGTG CTGGATGTGG GGCAGGGGCA GGCCGTGCTG GTGGAGGCGC CGCACGGAGT GCGCGTTCTT ATTGACGGCG GGGGGTTTCC TTCGTCTTCT TTTGACACGG GCAAGGCACT GGTGGCACCC GTGCTGACGT ATAACAGACC GCCGGTGCTT TCCGCCGTGG TCAACACCCA TCCGGACAGT GATCATCTGG GCGGCTTGCC GTTTATTCTG CAGTCTTTTG ACGTGGGCGC CTTTTATACC AACGGCGAGC TGCCGCAGGA GGGGGCGCAT GCGGCTGCTC TGGAACGGGC ATGGCGGGCC GGTGCTCCCG TGCCTGCGGT TCTGGCTGCC GGTGATTCGC TGTTGCTGGG CAGCACCGCG GAACTGCAGG TGCTTGCACC GGAGGCCGGA CGCATGACGG GCAATACCAA TGACAATTCG CTGATTCTGC GGTTGACGCG CGGTGGCAGA GGGCTGGCGC TGGTGCCCGC CGATGCGGGT ACCGTAGTGC TTGACCGGCT GGCGCAGGCG GCGCGGCGGC AGAATACTCC GGTGGAAGCC GCGTTGCTTG TGGTGCCGCA TCACGGCAGC GGAAACAGCC TCTCTCCCCT TCTGTACGAT GCGGTTGCGC CCTCTCTTGC AGTGGCTTCG TGCGGATACA TGAACTACTG GCGGTTTCCG CGTCCTGAAG TGCGCGGGGC GCTGGAGGCG CGTGGAATTC CGCTGCTTAC AACGTCCGGA AGCGGACAGA TTACCGTGTG CTGGCCGGAC AGCGGACATA TGCAGGTACG CAGTGTGCGC TGA
|
Protein sequence | MTRSTPPQTA QGGPPAGYNR SAALLPHGAP PALLWRQWCL LAVLAGLAAG ETGLRDAAPA WGAAVLVFMV LWSGRQNAGR PAAVLRALVL GVCMAAGVCF MLLHRQPAAE VPGWMTAQEK VRVQGVVQIV ESRPEGRLRM VLADARCRTA QGWQTLKADV VWTWQYPVFR PRSGSTVEFE GRVKPVRGFL NRGTWDSGSF WLRNGVGYRV WTRAEQGQVH EVPAGKPAAE SALFQSDALQ SAEVMRGLRS LVLQRVERVL AGMSDPIPAD ADGQGALRQM PRWRDLPQRY AVLPALLLGE RFYLSQQTMD RLADAGLMHS FALSGMHLGL AAALGALAAL LAGRIFPGLY LVVPRQQLAV LCAAPPVLGY VWLGGATPSL LRAALMFAFW GGLLLAGRRG VLTDGLLWAV AVVVAWQPQA LFDLRLQLSA VAVAGIACTM PVYRAAAGRL GTKHGTAAGA VMRLTGGLAG VLAVSMAAQL ALMPLTLDAF GSVTPWFVLN VLWLPALGLW VLPLAFAGLV SLALPQAAPV AVWLFHVATV PVEWLLEMLG TLEAGGLLYP VLSVRPLPCA AAGFWMLLGT ACLLWRGTAG PAVQQSGRSV LLPYGLRAVC VTVTAGVVLL AAPVCLRFSA YLQDDVSVSV LDVGQGQAVL VEAPHGVRVL IDGGGFPSSS FDTGKALVAP VLTYNRPPVL SAVVNTHPDS DHLGGLPFIL QSFDVGAFYT NGELPQEGAH AAALERAWRA GAPVPAVLAA GDSLLLGSTA ELQVLAPEAG RMTGNTNDNS LILRLTRGGR GLALVPADAG TVVLDRLAQA ARRQNTPVEA ALLVVPHHGS GNSLSPLLYD AVAPSLAVAS CGYMNYWRFP RPEVRGALEA RGIPLLTTSG SGQITVCWPD SGHMQVRSVR
|
| |