Gene Dvul_1185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1185 
Symbol 
ID4664897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1452184 
End bp1455054 
Gene Length2871 bp 
Protein Length956 aa 
Translation table11 
GC content64% 
IMG OID639819417 
Productexcinuclease ABC, A subunit 
Protein accessionYP_966632 
Protein GI120602232 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0243274 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00558343 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAAAC ATTGCATCCA CATCGAAGGG GCACGCCAGC ACAACCTCAA GAATGTCGAC 
ATCGACATCC CGCGCGATGA ACTGGTCGTC GTGTGCGGGC CTTCGGGGTC TGGCAAGTCC
ACACTGGCCT TCGATATCGT GTACGCCGAG GGACAACGCC GCTATGTGGA GTCGCTCTCC
GCCTATGCAC GTCAGTTCCT GCCGCAGATG GACAAGCCCG CCGTCGACAA GATTGAGGGC
CTCTCTCCCG CCATATCCCT CGAACAGCAG ACCTCTACGC GAAACCCGCG CTCCACGGTG
GGCACCGTCA CGGAGGTCTA CGACTTCTTG CGCGTGTTCT ATGCACGGCT TGGTCGCATG
TACTGCCCGC AGTGCGGACG CCCCATCGAG GCGCGTGCTG CGGATGAGAT CATCGCAGAC
ATCCTCGCGT TGGGCGAGGG TACGCGATGC ATCATCATGG CACCGCTGGT CGAACACCAG
AAGGGTACGC ACGCAGACCG CTTCAAGAAG CTGAAGGCCG AGGGCTTCGT GCGGGTGCGC
GTCAACGGCG AGACGACGAC CATCGACGAC GTCCCGCCGC TGGACAAGAA CCGCAAGCAT
TGCATCGACC TCGTGGTCGA CCGCATCGTG GTCAAGGAAG GCATACGCGG ACGCCTTGCC
GATTCTGTCG AACTGGCGTT GCGCTACGGT AACGGAAGAC TCGTCGTCGA GGTGCCGGGG
CAGGGCGAGA CGGTGCACTC CACCGAATCG GTATGCCCCT CCTGCCGCAT CAGTCTGCCC
GCGCCGAGTC CGCAACTCTT TTCCTTCAAC AGTCCGCAGG GGGCGTGCCC GCATTGTTCT
GGCCTTGGCA GTGTCGACTA CTTCGAACCC GCACTCCTTG CGCCCAACCG GGGGCTTTCG
CTCAACACGG GCGCGTTGCT GCCGTGGAAG AACCCTCGGG TCTTCGCGCG CTGGCAGGCT
GACCTTGAGA AACTGGCGAA GCGTTTCGGT GTGACTCTCT CCACCCCCCT TTCCGCATGG
CCCGCTGCCG GACTGGAAGT GCTCTTTCAT GGCGACGGTT CATTGCCCCA TGCGGCTGCC
GGTGACGAAG GCAGCGGCGG CGGTGCGGAC ACCCGGAAGA AGGGCAGGCG CGGGGCAGAT
GAGGCGTTCG GGCCGCCCAC GGGCTGGAGC GGTGTGACAC AGCTTCTTGA AAGCGGGATG
CAGTACGGTG ACGCGTGGCG TGACGAGATG TCGCGCTACC GCCAGAGTCG CCCCTGCCCT
GCGTGCCATG GGGCACGGCT GCGGCCCGAG GCGCTGTCGG TGCGCGTCGA CGACCTCGAC
ATCCATAGTT TCTGTTCGCT TTCAGTGGCG CGTGCCCTTG CATGGCTGCG CGAACGCAGC
TTCGACGGAC GGCATACGCT GGTCGCTGAA CCCTTGCTCA AGGAACTCAC GCACCGTCTT
GAGTTCATGG TCAACGTGGG GCTCGACTAC ATTTCGCTGG GCCGCAACAT GTCGACCCTC
TCCGGCGGCG AGGCACAGCG CATCCGCCTC GCCTCACAAC TCGGTTCCGG ACTGGTGGGG
GTGACCTACG TGCTGGACGA ACCCTCGATA GGGCTGCACC CGCGGGACAA CGAACGGCTC
ATCCGCACCC TGCGCAGGTT GCAGCAGCGC GGGAACACCG TGCTCGTGGT CGAACACGAC
GAAGCGACCA TCCGCGAGGC GGATACCGTC ATCGAACTTG GGCCGGGGTC GGGTGCTCTT
GGCGGCGAGG TGGTGTTCAG CGGGCGCGTG CCCGACCTGC TTGGAACCGC AGACACGTTG
ACGGCGCGCT ACCTGCGCGG TGAGATGACC ATTCCCCTGC CGGAATCCCG GCGCAAGGGC
GATGGCGCGT TGACGCTTCG CGGCGTGACC ACCAACAACC TGCAAGGTCT CGATTGCTCC
ATCCCCTTCG GCGTGCTGAC ATGTGTCACT GGCGTCTCCG GGTCGGGAAA GAGTTCGCTT
GTGGTGGACA CGCTGTACAA GCACGTCGCG CTGGCGCGGG GTATCAAGGT CGATTCGCCG
GGGAGCATCG GCGGTATCGA CGGACTCGAC AGGATAGAGC GTATCGTCGC CATCGACCAG
ACGCCCATCG GGCGGACGCC GCGTTCCAAC CCCGCGACGT ATACCAAGAT ATTCGACGAG
ATACGCGACA TCTTCGCCAT GACGGCAGAT GCCCGTAAGC GCGGGTACAA GCCGGGGCGC
TTCAGCTTCA ACGTGCGTGG AGGACGGTGC GAAGCCTGTG GCGGAGACGG GCAACTGCGT
GTCGAGATGC ACTTTCTGCC CGATGTCTTC GTCACGTGCG ACGTCTGCAA GGGGCGTCGT
TACAACCACG AGACGCTCGA AGTCCGGTAC AAGGGCCTCA ACATAGCGGA GGTGCTCGAC
CTCACCGTGC GACAGGCACG GCAGTTCTTC GAGAACTATC CCGTGCTGGA GCGCAGGCTT
GGCGTGCTCG AGGACGTGGG CCTCGAATAC CTCAGACTGG GCCAACCGGC GACGACCCTT
TCGGGTGGTG AGGCGCAACG CATCAAGATA TCACGCGAAC TCGGAAAGCG TAGCCTGCCC
GGCACGCTCT ACATCCTCGA CGAACCCACC ACGGGGCTGC ACATGCACGA GGTGGGCAAG
CTCATTCGCG TGTTACATCA GCTTGTGGAC AGGGGCGCGA CTGTTGTGGT CATCGAACAC
AACACCGATG TCATCCTGTC GTCCGACCAT GTCATCGACC TCGGGCCGGG TGGTGGCGAG
AATGGCGGGC GCATCGTCTC TGCGGGAACT CCGGAGGAGA TTATCGCAGA CTCGGCATCC
GTGACCGGGG CGTTCCTCGT GCAGGAACGG GCCATCCGTA ACGGCGGGTA G
 
Protein sequence
MSKHCIHIEG ARQHNLKNVD IDIPRDELVV VCGPSGSGKS TLAFDIVYAE GQRRYVESLS 
AYARQFLPQM DKPAVDKIEG LSPAISLEQQ TSTRNPRSTV GTVTEVYDFL RVFYARLGRM
YCPQCGRPIE ARAADEIIAD ILALGEGTRC IIMAPLVEHQ KGTHADRFKK LKAEGFVRVR
VNGETTTIDD VPPLDKNRKH CIDLVVDRIV VKEGIRGRLA DSVELALRYG NGRLVVEVPG
QGETVHSTES VCPSCRISLP APSPQLFSFN SPQGACPHCS GLGSVDYFEP ALLAPNRGLS
LNTGALLPWK NPRVFARWQA DLEKLAKRFG VTLSTPLSAW PAAGLEVLFH GDGSLPHAAA
GDEGSGGGAD TRKKGRRGAD EAFGPPTGWS GVTQLLESGM QYGDAWRDEM SRYRQSRPCP
ACHGARLRPE ALSVRVDDLD IHSFCSLSVA RALAWLRERS FDGRHTLVAE PLLKELTHRL
EFMVNVGLDY ISLGRNMSTL SGGEAQRIRL ASQLGSGLVG VTYVLDEPSI GLHPRDNERL
IRTLRRLQQR GNTVLVVEHD EATIREADTV IELGPGSGAL GGEVVFSGRV PDLLGTADTL
TARYLRGEMT IPLPESRRKG DGALTLRGVT TNNLQGLDCS IPFGVLTCVT GVSGSGKSSL
VVDTLYKHVA LARGIKVDSP GSIGGIDGLD RIERIVAIDQ TPIGRTPRSN PATYTKIFDE
IRDIFAMTAD ARKRGYKPGR FSFNVRGGRC EACGGDGQLR VEMHFLPDVF VTCDVCKGRR
YNHETLEVRY KGLNIAEVLD LTVRQARQFF ENYPVLERRL GVLEDVGLEY LRLGQPATTL
SGGEAQRIKI SRELGKRSLP GTLYILDEPT TGLHMHEVGK LIRVLHQLVD RGATVVVIEH
NTDVILSSDH VIDLGPGGGE NGGRIVSAGT PEEIIADSAS VTGAFLVQER AIRNGG