Gene Dvul_0995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0995 
Symbol 
ID4662569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1222957 
End bp1225128 
Gene Length2172 bp 
Protein Length723 aa 
Translation table11 
GC content67% 
IMG OID639819219 
ProductRNA-binding S1 domain-containing protein 
Protein accessionYP_966443 
Protein GI120602043 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.5301 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGAAA CGATTCCACA CGACTCTGTC GCACATCGCC TCGCGGCCGA CTTGGGCATC 
ACGACCGCAC AGGCAAGTGC GGCACTCAAG CTTTTCGACG AGGGGGGCAC CATCCCCTTC
GTGGCCCGCT ATCGCAAGGA GGCCACTGGC GGCCTCGACG AGGTGGCCCT CACCGCCTTG
CGTGACGGCT GCGAACGGCT TCGTACGCTG GACAAGCGAC GTGACGCCAT CATCACCTCC
ATGACGGAGC GCGAACAGCT TACCCCCGAC CTCGCCAAGG CCCTCCATGC GGCGACCACT
CTCACCGCGC TGGAAGACAT CTACCTGCCC TTCCGCCCCA AACGGGTGAC AAGGGCCGCC
AAGGCGCGTG AACGTGGCCT CGCCCCCCTT GCTGAACGGC TGCTCGAACA ACGCGGCGCG
CAGGCGGGGC AACTTGCCGC CCCGTTCGTC GACGAGGCCA AGGGGGTTCC CGACACGGTA
GCCGCCCTTG CAGGGGCACG TGACATCATC GCCGAGACGG TGAGCGAAGA CAGGGCCACA
CGCGCCACCC TGCGCGACAT CTTCGTCCGC CGTGCGACGC TGCGTTCGAA GGTCGCACGT
GGCAAGGAGG AACAGGCCGC AACCTATCGC GACCACTTCG ACCGCAGCGA ACATGCAGCC
GCCGCCCCGG CGCACAGGCT GCTCGCCATG TTCCGTGGCG AACGCGAAGA GCTTCTCGAC
GTGCGGGTGC GGCCTGATGA CGACCTTGCC GTGGGGGCCC TTCACCGTCG CTGGCTGAAA
GGGCGTGCAT CTTCAGCCGG GACGGATGCG GCTGAAGTGG CCACGGCCCT GACCGACGCA
TGGAATCGTC TTCTGGCCCC GTCACTCGAG AACGAATTCA GAACCGCCCT GCGCGAACGG
GCCGAAGCGG AGGCCATAGC CGTCTTCGCC GCCAACCTTC GGGAATTGCT ACTGGCCCCG
CCGATGGGGC CACGACGCAC ACTCGCACTC GACCCCGGCT GGCGCACCGG GGCCAAGCTT
GTATGCCTCG ACGCGCAAGG GGCCCTGCTG CACCACGAGG TCATCCACCC CCTCACCGGT
GGAGACCGGG CGACCCGCGC CGCCGCAACC TTGCGAGAAT TGTGCTCACG CCATGGCATC
GAAGCGGTGG CCGTCGGCAA CGGTACAGCG GGGCGCGAAA CCGAAGCCTT CGTCAAGTCG
GCCGGGCTTC CACCGCATGT GACGGTGGCC CTCGTGGATG AGCGCGGCGC CTCGGTCTAC
TCCGCCTCCG AGGTCGCACG CGCCGAATTC CCCGACCACG ACGTCACTGT GCGCGGGGCC
GTATCCATCG GGCGCAGGCT CATGGACCCG CTGGCCGAAC TCGTCAAGGT CGACCCTCGC
TCGCTGGGCG TGGGGCAGTA CCAGCATGAT GTCGACCAGA ACGCCCTGCG CCGCGCCCTC
GAAGAGGTCG TGGCCTCGTG CGTCAACGCC GTGGGTGTGG ATGTCAACAC CGCCAGCCCC
GAATTGCTCG CCTACGTCTC GGGCATCGGA CCTGCGCTGG CGAAGGGCAT CGTGGCATGG
CGTGCCGCCA ACGGCCCGTT CCGTACACGG CGCGAGCTTC TCAAGGTGCC TCGCCTCGGC
CCCAAGGCCT TCGAACAGGC TGCGGGCTTC CTGCGTGTAC ATGGCGGCCC CGAACCGCTT
GACGCGAGCG CCGTCCACCC TGAAAGCTAT GCCCTCGTGC GCCGCATGGC AGAGGATACC
GGGTGCAGCG TGCCCGACCT CATGCGCGAT GCGGCCCGAC GCGAGGCCCT GCGGCTGGAA
CGCTATGTTG ACGACAGGGT AGGTCTGCCC ACCCTGCACG ACATCATGTC GGAACTGGCG
CGGCCGGGCC GTGACCCGCG CCCGGCTTTC GAGGTCTTCG CCTTCGCCGA AGGCGTGAAC
AAGGTCGATG ACGTTCAGCC CGGCATGGAG CTTCCCGGCA TCGTCACCAA CGTCACCAAG
TTCGGTGCCT TCGTCGACAT CGGCGTCCAC CGTGACGGTC TCGTGCATGT GAGCCAACTG
GCCGACCGTT TCGTACGCGA CCCTGCCGAG GTCGTGGCGC CGGGACGCAA GGTGCGGGTA
CGGGTGCTCG ATGTGGACAG GCAGCGTGAA CGCATCAACC TCACGCTCAA AGGGGTTCCG
CAGCAGGATT GA
 
Protein sequence
MTETIPHDSV AHRLAADLGI TTAQASAALK LFDEGGTIPF VARYRKEATG GLDEVALTAL 
RDGCERLRTL DKRRDAIITS MTEREQLTPD LAKALHAATT LTALEDIYLP FRPKRVTRAA
KARERGLAPL AERLLEQRGA QAGQLAAPFV DEAKGVPDTV AALAGARDII AETVSEDRAT
RATLRDIFVR RATLRSKVAR GKEEQAATYR DHFDRSEHAA AAPAHRLLAM FRGEREELLD
VRVRPDDDLA VGALHRRWLK GRASSAGTDA AEVATALTDA WNRLLAPSLE NEFRTALRER
AEAEAIAVFA ANLRELLLAP PMGPRRTLAL DPGWRTGAKL VCLDAQGALL HHEVIHPLTG
GDRATRAAAT LRELCSRHGI EAVAVGNGTA GRETEAFVKS AGLPPHVTVA LVDERGASVY
SASEVARAEF PDHDVTVRGA VSIGRRLMDP LAELVKVDPR SLGVGQYQHD VDQNALRRAL
EEVVASCVNA VGVDVNTASP ELLAYVSGIG PALAKGIVAW RAANGPFRTR RELLKVPRLG
PKAFEQAAGF LRVHGGPEPL DASAVHPESY ALVRRMAEDT GCSVPDLMRD AARREALRLE
RYVDDRVGLP TLHDIMSELA RPGRDPRPAF EVFAFAEGVN KVDDVQPGME LPGIVTNVTK
FGAFVDIGVH RDGLVHVSQL ADRFVRDPAE VVAPGRKVRV RVLDVDRQRE RINLTLKGVP
QQD