Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_1185 |
Symbol | |
ID | 4664897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 1452184 |
End bp | 1455054 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639819417 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_966632 |
Protein GI | 120602232 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0243274 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00558343 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCAAAC ATTGCATCCA CATCGAAGGG GCACGCCAGC ACAACCTCAA GAATGTCGAC ATCGACATCC CGCGCGATGA ACTGGTCGTC GTGTGCGGGC CTTCGGGGTC TGGCAAGTCC ACACTGGCCT TCGATATCGT GTACGCCGAG GGACAACGCC GCTATGTGGA GTCGCTCTCC GCCTATGCAC GTCAGTTCCT GCCGCAGATG GACAAGCCCG CCGTCGACAA GATTGAGGGC CTCTCTCCCG CCATATCCCT CGAACAGCAG ACCTCTACGC GAAACCCGCG CTCCACGGTG GGCACCGTCA CGGAGGTCTA CGACTTCTTG CGCGTGTTCT ATGCACGGCT TGGTCGCATG TACTGCCCGC AGTGCGGACG CCCCATCGAG GCGCGTGCTG CGGATGAGAT CATCGCAGAC ATCCTCGCGT TGGGCGAGGG TACGCGATGC ATCATCATGG CACCGCTGGT CGAACACCAG AAGGGTACGC ACGCAGACCG CTTCAAGAAG CTGAAGGCCG AGGGCTTCGT GCGGGTGCGC GTCAACGGCG AGACGACGAC CATCGACGAC GTCCCGCCGC TGGACAAGAA CCGCAAGCAT TGCATCGACC TCGTGGTCGA CCGCATCGTG GTCAAGGAAG GCATACGCGG ACGCCTTGCC GATTCTGTCG AACTGGCGTT GCGCTACGGT AACGGAAGAC TCGTCGTCGA GGTGCCGGGG CAGGGCGAGA CGGTGCACTC CACCGAATCG GTATGCCCCT CCTGCCGCAT CAGTCTGCCC GCGCCGAGTC CGCAACTCTT TTCCTTCAAC AGTCCGCAGG GGGCGTGCCC GCATTGTTCT GGCCTTGGCA GTGTCGACTA CTTCGAACCC GCACTCCTTG CGCCCAACCG GGGGCTTTCG CTCAACACGG GCGCGTTGCT GCCGTGGAAG AACCCTCGGG TCTTCGCGCG CTGGCAGGCT GACCTTGAGA AACTGGCGAA GCGTTTCGGT GTGACTCTCT CCACCCCCCT TTCCGCATGG CCCGCTGCCG GACTGGAAGT GCTCTTTCAT GGCGACGGTT CATTGCCCCA TGCGGCTGCC GGTGACGAAG GCAGCGGCGG CGGTGCGGAC ACCCGGAAGA AGGGCAGGCG CGGGGCAGAT GAGGCGTTCG GGCCGCCCAC GGGCTGGAGC GGTGTGACAC AGCTTCTTGA AAGCGGGATG CAGTACGGTG ACGCGTGGCG TGACGAGATG TCGCGCTACC GCCAGAGTCG CCCCTGCCCT GCGTGCCATG GGGCACGGCT GCGGCCCGAG GCGCTGTCGG TGCGCGTCGA CGACCTCGAC ATCCATAGTT TCTGTTCGCT TTCAGTGGCG CGTGCCCTTG CATGGCTGCG CGAACGCAGC TTCGACGGAC GGCATACGCT GGTCGCTGAA CCCTTGCTCA AGGAACTCAC GCACCGTCTT GAGTTCATGG TCAACGTGGG GCTCGACTAC ATTTCGCTGG GCCGCAACAT GTCGACCCTC TCCGGCGGCG AGGCACAGCG CATCCGCCTC GCCTCACAAC TCGGTTCCGG ACTGGTGGGG GTGACCTACG TGCTGGACGA ACCCTCGATA GGGCTGCACC CGCGGGACAA CGAACGGCTC ATCCGCACCC TGCGCAGGTT GCAGCAGCGC GGGAACACCG TGCTCGTGGT CGAACACGAC GAAGCGACCA TCCGCGAGGC GGATACCGTC ATCGAACTTG GGCCGGGGTC GGGTGCTCTT GGCGGCGAGG TGGTGTTCAG CGGGCGCGTG CCCGACCTGC TTGGAACCGC AGACACGTTG ACGGCGCGCT ACCTGCGCGG TGAGATGACC ATTCCCCTGC CGGAATCCCG GCGCAAGGGC GATGGCGCGT TGACGCTTCG CGGCGTGACC ACCAACAACC TGCAAGGTCT CGATTGCTCC ATCCCCTTCG GCGTGCTGAC ATGTGTCACT GGCGTCTCCG GGTCGGGAAA GAGTTCGCTT GTGGTGGACA CGCTGTACAA GCACGTCGCG CTGGCGCGGG GTATCAAGGT CGATTCGCCG GGGAGCATCG GCGGTATCGA CGGACTCGAC AGGATAGAGC GTATCGTCGC CATCGACCAG ACGCCCATCG GGCGGACGCC GCGTTCCAAC CCCGCGACGT ATACCAAGAT ATTCGACGAG ATACGCGACA TCTTCGCCAT GACGGCAGAT GCCCGTAAGC GCGGGTACAA GCCGGGGCGC TTCAGCTTCA ACGTGCGTGG AGGACGGTGC GAAGCCTGTG GCGGAGACGG GCAACTGCGT GTCGAGATGC ACTTTCTGCC CGATGTCTTC GTCACGTGCG ACGTCTGCAA GGGGCGTCGT TACAACCACG AGACGCTCGA AGTCCGGTAC AAGGGCCTCA ACATAGCGGA GGTGCTCGAC CTCACCGTGC GACAGGCACG GCAGTTCTTC GAGAACTATC CCGTGCTGGA GCGCAGGCTT GGCGTGCTCG AGGACGTGGG CCTCGAATAC CTCAGACTGG GCCAACCGGC GACGACCCTT TCGGGTGGTG AGGCGCAACG CATCAAGATA TCACGCGAAC TCGGAAAGCG TAGCCTGCCC GGCACGCTCT ACATCCTCGA CGAACCCACC ACGGGGCTGC ACATGCACGA GGTGGGCAAG CTCATTCGCG TGTTACATCA GCTTGTGGAC AGGGGCGCGA CTGTTGTGGT CATCGAACAC AACACCGATG TCATCCTGTC GTCCGACCAT GTCATCGACC TCGGGCCGGG TGGTGGCGAG AATGGCGGGC GCATCGTCTC TGCGGGAACT CCGGAGGAGA TTATCGCAGA CTCGGCATCC GTGACCGGGG CGTTCCTCGT GCAGGAACGG GCCATCCGTA ACGGCGGGTA G
|
Protein sequence | MSKHCIHIEG ARQHNLKNVD IDIPRDELVV VCGPSGSGKS TLAFDIVYAE GQRRYVESLS AYARQFLPQM DKPAVDKIEG LSPAISLEQQ TSTRNPRSTV GTVTEVYDFL RVFYARLGRM YCPQCGRPIE ARAADEIIAD ILALGEGTRC IIMAPLVEHQ KGTHADRFKK LKAEGFVRVR VNGETTTIDD VPPLDKNRKH CIDLVVDRIV VKEGIRGRLA DSVELALRYG NGRLVVEVPG QGETVHSTES VCPSCRISLP APSPQLFSFN SPQGACPHCS GLGSVDYFEP ALLAPNRGLS LNTGALLPWK NPRVFARWQA DLEKLAKRFG VTLSTPLSAW PAAGLEVLFH GDGSLPHAAA GDEGSGGGAD TRKKGRRGAD EAFGPPTGWS GVTQLLESGM QYGDAWRDEM SRYRQSRPCP ACHGARLRPE ALSVRVDDLD IHSFCSLSVA RALAWLRERS FDGRHTLVAE PLLKELTHRL EFMVNVGLDY ISLGRNMSTL SGGEAQRIRL ASQLGSGLVG VTYVLDEPSI GLHPRDNERL IRTLRRLQQR GNTVLVVEHD EATIREADTV IELGPGSGAL GGEVVFSGRV PDLLGTADTL TARYLRGEMT IPLPESRRKG DGALTLRGVT TNNLQGLDCS IPFGVLTCVT GVSGSGKSSL VVDTLYKHVA LARGIKVDSP GSIGGIDGLD RIERIVAIDQ TPIGRTPRSN PATYTKIFDE IRDIFAMTAD ARKRGYKPGR FSFNVRGGRC EACGGDGQLR VEMHFLPDVF VTCDVCKGRR YNHETLEVRY KGLNIAEVLD LTVRQARQFF ENYPVLERRL GVLEDVGLEY LRLGQPATTL SGGEAQRIKI SRELGKRSLP GTLYILDEPT TGLHMHEVGK LIRVLHQLVD RGATVVVIEH NTDVILSSDH VIDLGPGGGE NGGRIVSAGT PEEIIADSAS VTGAFLVQER AIRNGG
|
| |