Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_0132 |
Symbol | |
ID | 4663366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 159776 |
End bp | 162715 |
Gene Length | 2940 bp |
Protein Length | 979 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639818327 |
Product | ComEC/Rec2-related protein |
Protein accession | YP_965583 |
Protein GI | 120601183 |
COG category | [R] General function prediction only |
COG ID | [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.807661 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0603201 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCTTG CCCGTACCCC TTCACTGCTT TTCCGACAGG TGTGCATGCT GGCGTGGGTC GGGGGGCTTT TTGCGGCGCG ACACCCCCTG CCGGCACTGT GTGCGTTCAC TCTGCTGCTG GCGGGAGACT GGCCCCGCGC CCGCGTCCCT GCACGGTTCG TGCTGCTGTG CCTGTGCTAT GCCGCCGGAT GGGGCGTGGC CCTCGCAGCC CTGCCCGAGA CGCCGTCCGC CCCCGCGTGG GTGACGGGCA AGGCACAGCG GGTGACGGGC ATCGTGGATG ATGTGGATGG CCTGCCGGAT GGCCGACTTC GCATCATGCT GCGCGACGTG CACCCCGTGT TTCCGGAAGG CGGTGCGTCT CCGGTGGTGA CCGCCGGGGA CGAAGGCGAC GGTAGCGAGG CTGAAGGCCT GCCTGTTGAC GTTGCAGAGG GTGAGGACAG GGCGGACAGC AGGCCCGCGA GGCCCACCGT GGCGGAGAAC GCGGCAGGAC GCGGCGAAGG GGCGGGACAG CCGGAGACCG CACGCGCAGG TTCGCACCTG CCACCCGAAG CCCGCGCCAT GGGCGGCGAG ACCGGGCGAG GCGTTCAGCC CGCCCCCCTC GCCCCCCCTG CCCCGCCTCT ACCCGGAAGG CTCGTCTGGA CATGGGAGCA CCCCGTGGCG CTTCCCCTCA CCGGGCAGAC GGTGACGGCG ACCCTCGCCG TGAAGCCGGT GCGTGGCTTC GCCAACCCCG GCGGTCAGGA CAGTGCCGCC TACTGGCAAC GCCGCGACGC GCATTTCAGG GCATGGGCCC GGGATGACAT GCCCCGTGCC GAGGTGACGG GCGCGCCGTC CGGCCCTGCG GCCCTGCGCG CATGGCTGCG GGAGAGGCTT GTCGATACGC TTGGCGGCCC GCAGGGCATC ACGCGCGGGG GCGGTGTGCT GCTGGCCATA CTCTTCGGCG ACAGGTTCCA CCTCGACAGC GCCATGCTCG ACCTCTTCGC CCGCACCGAC CTCCTGCACA GTCTTGCGCT TTCGGGGCAG CACCTTGCCG TGGCGGGCCT CTTCGCGGGG GCGGCGGTGT TGCTGGTCGG CCGGTTCACC CCCGGTGTCT TCCTGCACCT GCCGCGCCGC AAGCTGCTAT TCGTCCTTTC GTTGCCGCCC GCCGCAGCAT ACCTCTGGCT TGGGAATGCC CCGCCGTCTC TGGTGCGTGC CGCGCTCATG CTCCTTTTCT GGACGGTGCT GGCCCTTGCG GACAGGCCCG GCGTACTGCT GGACGGCCTC TTGTGGGCCG TGGGCTGCAT CCTCCTCTTC GACCCCGATG CGGTGTACGA CCTCGGTCTG CAACTCTCGG CCCTCGCCGT GGCGTCCATC GCCCTCAGCC TGCCCTTCGC GGCGTGGCTT CACGGTGGCG GTCATCATGG AGATTTCAGA CCGGGGACCG CGCCACCGCT GGCAGACGTG ACCGGAAGCA CGGGCACTTT GAGCACAGGG ACCTTGAGCA CAGGCACGGA GGGCATTTCC GATGCGGGAG GGCCAGACAC CGCTACGCTG CGCGATGGGG TGTGGCAGCG GGTGCGGCGT GCCCTGATGC TCATGGCCCT GACCACGCTG GCAGTGCAGG TGGCCTTGCT TCCCGTGCAA CTCATGGGCT TCGGCAGGGC CAGCCCGTGG TTCGCGCTCA ACCTGCTGTG GCTGCCCTTC GCCGACCTCG TGGTACTGCC TCTGGGGGCG TTGGGCCTTG TGTGTGAAGC CGCCGACCTC ACGCGCCCGC TGACGGGGCC GTTGCTCATG GTGGCCGCCC TGCCCTGTGA AGGGTTGATG TGGCTGCTGG AGTGGATGGA GGGGGCGGGG CTGCTGGCCG TGCCCGCCAT GCTGCGCCCG CACTGGACGG CGGCACTGGG GTACGGCGCG CTGGTGGTTG CTTTCGCTTC GTTGCCCGGT CGGCTGTTCC ATTTCCCTGC CCGTGGCAGG GCTTCGCATC ATGCCCCTCA TGGCGCGACA TCCGCGGGTA CTCGCTTGGC GGGGGCAAGG GCGGCTGCCC CGTGCCTGCC GCCGCTGGCG AGACGCCTGC TGCCCTTTGC CCTCGCCCTG CTTCTGGCGG GCCCGGTGCT GCGGCTGTAT GCCGCAACGG ACGGCACCGT GCGGGTCTCC GTGCTCGATG TGGGGCAGGG GCAGGCCATC GCCATCGACC TTCCGGGTGA CAGGCGGCTT CTGGTGGACG GCGGGGGCTT CAACTCCCGC CGTTTCGACG CCGGACGCGA CCTCGTGGCC CCCGCGCTGA CTGCCAATCG TTCGCCGCGC CTTGATATGG TGCTCAACAC CCATCCGGAC ACCGACCATC TGCGCGGTCT CATCCACATC CTCGACCGTT TCGCCGTGGA CGCGTTCGCC ACCAATGGCG ACGCACCGCG CGGTCTCAAT GCCCGCGACC TTTCCCGGGT GCTGGCCCGC ACCGGCATGG AGGCGACCCC GATGTACGCT GGTGAGGTGC TGCCCCTCGG TGACGGACTG GGGTTGCGTG TGCTGCACCC GCCGCAGAAG CATCGCGGGT CGAGCAACAA CAAGGCACTG GTGTTGCGTC TTGAGCGAGA TGGCAGGGGG CTGGCGGTGT TGTGCGGTGA TGCCGAGGCC CCGGCCCTGC GAGACATCCT GCGCAGTGGC GCACCGCTGA AGGCGGAGGT GCTGGTGCTG CCGCACCACG GGTCTGCGTC GAGTCTGCTG CCCGCCTTCT ATGATGCGGT GGCCCCGCGC CTTGCCATCG CCAGTTGCGG TGTGGACAAC AGGTATGGCT TTCCCGCCAC AGGCGTGCGG GCTGCGCTGG CTGAACGTGG CGTGACGCTA CGCACCACGG GGGAGGCCGG GTGCATCATG CTGGGATGGG ATGACGGCGG TCGAGGCCCG TTGACGTTGG ATACGAGTCG CAACAGGGGG GCTGCGGACA CGTCGGCCTT CGGGGAGTGA
|
Protein sequence | MNLARTPSLL FRQVCMLAWV GGLFAARHPL PALCAFTLLL AGDWPRARVP ARFVLLCLCY AAGWGVALAA LPETPSAPAW VTGKAQRVTG IVDDVDGLPD GRLRIMLRDV HPVFPEGGAS PVVTAGDEGD GSEAEGLPVD VAEGEDRADS RPARPTVAEN AAGRGEGAGQ PETARAGSHL PPEARAMGGE TGRGVQPAPL APPAPPLPGR LVWTWEHPVA LPLTGQTVTA TLAVKPVRGF ANPGGQDSAA YWQRRDAHFR AWARDDMPRA EVTGAPSGPA ALRAWLRERL VDTLGGPQGI TRGGGVLLAI LFGDRFHLDS AMLDLFARTD LLHSLALSGQ HLAVAGLFAG AAVLLVGRFT PGVFLHLPRR KLLFVLSLPP AAAYLWLGNA PPSLVRAALM LLFWTVLALA DRPGVLLDGL LWAVGCILLF DPDAVYDLGL QLSALAVASI ALSLPFAAWL HGGGHHGDFR PGTAPPLADV TGSTGTLSTG TLSTGTEGIS DAGGPDTATL RDGVWQRVRR ALMLMALTTL AVQVALLPVQ LMGFGRASPW FALNLLWLPF ADLVVLPLGA LGLVCEAADL TRPLTGPLLM VAALPCEGLM WLLEWMEGAG LLAVPAMLRP HWTAALGYGA LVVAFASLPG RLFHFPARGR ASHHAPHGAT SAGTRLAGAR AAAPCLPPLA RRLLPFALAL LLAGPVLRLY AATDGTVRVS VLDVGQGQAI AIDLPGDRRL LVDGGGFNSR RFDAGRDLVA PALTANRSPR LDMVLNTHPD TDHLRGLIHI LDRFAVDAFA TNGDAPRGLN ARDLSRVLAR TGMEATPMYA GEVLPLGDGL GLRVLHPPQK HRGSSNNKAL VLRLERDGRG LAVLCGDAEA PALRDILRSG APLKAEVLVL PHHGSASSLL PAFYDAVAPR LAIASCGVDN RYGFPATGVR AALAERGVTL RTTGEAGCIM LGWDDGGRGP LTLDTSRNRG AADTSAFGE
|
| |